[DB-1625] Indexing without sequences (suggestions) #5238

shaan1337 · 2025-08-14T07:00:41Z

No description provided.

…scribing to a position but the position is not yet available in the stream/log. In that case the subscriptions will go live directly and wait - there are no retries involved

linear · 2025-08-14T07:00:44Z

DB-1625 Secondary Indexes - Second Review Pass

github-actions · 2025-08-14T07:13:20Z

Qodana for .NET

3 new problems were found

Inspection name	Severity	Problems
`Redundant using directive`	🔶 Warning	2
`Redundant 'partial' modifier on type declaration`	🔶 Warning	1

💡 Qodana analysis was run in the pull request mode: only the changed files were checked
☁️ View the detailed Qodana report

Contact Qodana team

Contact us at [email protected]

Or via our issue tracker: https://jb.gg/qodana-issue
Or share your feedback: https://jb.gg/qodana-discussions

…reading them: * If in-flight records are committed/cleared while reading them, stop where we have reached and return a partial result. * Always read in-flight records before reading database records: * If in-flight records are committed while reading, we can be sure that they will be present in the subsequent query of database records. * After we've read the database records, some of them may overlap with in-flight records: deduplicate the result. * Simplify inclusive/exclusive log position queries * Simplify sorting by RowId: Always sort in ascending order (return negative RowIds for backwards database reads)

…es event by event, not transaction by transaction (Using only the `PreparePosition` means that the checkpoint could move backwards when processing events from an explicit transaction)

…dditional complexity. There were some issues with the implementation: * "" ($all) was being passed as the index name to SubscriptionMessage.PollStream * The position being polled was incorrect (it was using the main index's position instead of the secondary index's position) * Long polls are not suitable for backward index reads - only forward reads * MissedEvents: the lastIndexedPosition is the main index's position, not the secondary index's position

* cosmetic fixes * don't exclude first event when subscribing to the beginning of the index

…n reached (instead of potentially reading an extra page)

- Throw ObjectDisposedException when disposed - Use the prepare/commit positions from resolvedEvent.OriginalPosition - Track the stream index processor's commit time as part of the commit time

- Separate initialization of the last appended record to avoid the need for synchronization. If the last event in the log was a system event, it was also previously filtered out by RecordAppended() - Use the commit position instead of prepare position of events

…tion when reading events This is required as prepares written with explicit transactions will not have an expected version set - the event number is derived from the commit record.

…reading.

- Clear in-flight records after flushing the appender to make sure that concurrent reads will see the record either in the in-flight records or in the database

The data in the secondary indexes is already committed to a quorum number of nodes, so it might be ok. But it might be risky, as disposal could be happening due to a fatal error. for e.g consider the case where the system ran out of disk space: we are trying to write even more data.

…irectory by default: - Indexes are usually stored on a separate disk so that IOPS are not shared with the DB for better performance - It's better if DuckDB related files (like the write-ahead log, etc.) are stored in its own directory so that the database code doesn't accidentally modify them. (for e.g all .tmp files are deleted on startup)

shaan1337 added 20 commits August 11, 2025 11:18

suggestion

204804e

suggestion

f6752b1

suggestion

698387a

suggestion

b88a923

suggestion

625843a

suggestion

166057f

suggestion

77c47fc

suggestion: i think this comment refers to the case where you are sub…

efe8e4f

…scribing to a position but the position is not yet available in the stream/log. In that case the subscriptions will go live directly and wait - there are no retries involved

suggestion

8f63df3

suggestion

09de87b

suggestion

5072a63

suggestion

0f92f0f

suggestion: missing break?

3a9c1cc

remove unused class

220b800

make respects_configuration_feature_flag_and_dev_mode test work

4202c5a

suggestion

d99508a

suggestion

2b21379

suggestion

a4ba96b

suggestion: update queries (untested)

806dc2a

suggestion: fix failing test: Index_streams_should_not_be_found

be60c9c

shaan1337 and others added 8 commits August 15, 2025 15:00

* Use the full TFPos as checkpoint for the SecondaryIndex as it index…

fe9f10b

…es event by event, not transaction by transaction (Using only the `PreparePosition` means that the checkpoint could move backwards when processing events from an explicit transaction)

Fixed optimistic lock

92eb4a7

Fixed memory fence

a01595a

Removed redundant comments

b1fe954

suggestion

0ac1447

Fix build error

7c90be6

suggestion

29a5029

shaan1337 added 16 commits August 25, 2025 10:55

don't allow stream subscriptions to index streams

d592b43

Enumerator.IndexSubscription:

2319385

* cosmetic fixes * don't exclude first event when subscribing to the beginning of the index

Enumerator.ReadIndex: immediately stop reading when max count has bee…

3dc8790

…n reached (instead of potentially reading an extra page)

DefaultIndexProcessor:

a64dd2e

- Throw ObjectDisposedException when disposed - Use the prepare/commit positions from resolvedEvent.OriginalPosition - Track the stream index processor's commit time as part of the commit time

off by one?

7d868e2

use correct query for backward reads

1d462b8

remove unused code

b81eef8

ReaderExtensions.ReadRecords: Take the commit position into considera…

b420464

…tion when reading events This is required as prepares written with explicit transactions will not have an expected version set - the event number is derived from the commit record.

Read events sequentially. TFReaderLease is not designed for parallel …

1da2065

…reading.

DuckDbSchema: Move transaction.Commit into try block

c6696ea

StreamIndexProcessor:

d8b6883

- Clear in-flight records after flushing the appender to make sure that concurrent reads will see the record either in the in-flight records or in the database

Must these tests be uncommented or deleted?

2f85acd

shaan1337 force-pushed the alexey/secondary-indexes-suggestions branch from 9b4b036 to 2f85acd Compare September 1, 2025 08:17

shaan1337 added 3 commits September 1, 2025 16:30

SecondaryIndexSubscription: small improvements

95c51d4

StreamSql: Why are the other ACL role types not stored?

183117b

Do these tests need to be re-enabled?

ef4694b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[DB-1625] Indexing without sequences (suggestions) #5238

[DB-1625] Indexing without sequences (suggestions) #5238

Uh oh!

shaan1337 commented Aug 14, 2025

Uh oh!

linear bot commented Aug 14, 2025

Uh oh!

github-actions bot commented Aug 14, 2025 •

edited

Loading

Uh oh!

Uh oh!

[DB-1625] Indexing without sequences (suggestions) #5238

Are you sure you want to change the base?

[DB-1625] Indexing without sequences (suggestions) #5238

Uh oh!

Conversation

shaan1337 commented Aug 14, 2025

Uh oh!

linear bot commented Aug 14, 2025

Uh oh!

github-actions bot commented Aug 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Qodana for .NET

Uh oh!

Uh oh!

github-actions bot commented Aug 14, 2025 •

edited

Loading