Skip to content

Conversation

jianoaix
Copy link
Contributor

@jianoaix jianoaix commented Feb 10, 2025

Why are these changes needed?

The blob feed needs to use pagination to get all blob, because Dynamo has a 1MB limit on the response size, so if there are more blobs to fetch, it won't be able to get them all.

Checks

  • I've made sure the tests are passing. Note that there might be a few flaky tests, in that case, please comment that they are not relevant.
  • I've checked the new test coverage and the coverage percentage didn't drop.
  • Testing Strategy
    • Unit tests
    • Integration tests
    • This PR is not tested :(

@jianoaix jianoaix requested a review from dmanc February 10, 2025 18:13
// queryBucketBlobMetadata returns blobs (as metadata) within range [startKey, endKey] from a single bucket.
// Results are ordered by <RequestedAt, Bobkey> in ascending order.
//
// The function handles DynamoDB's 1MB response size limitation by performing multiple queries if necessary.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Isn't GetBlobMetadataByRequestedAt already doing pagination?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It has no information about 1MB underneath in dynamo

@jianoaix jianoaix merged commit 6382132 into Layr-Labs:master Feb 10, 2025
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants