fix: Huge entries fail to load outside RDB / replication #4154

chakaz · 2024-11-19T18:41:38Z

We have an internal utility tool that we use to deserialize values in some use cases:

RESTORE
Cluster slot migration
RENAME, if the source and target shards are different

We recently changed this area of the code, which caused this regression as it only handled RDB / replication streams.

We have an internal utility tool that we use to deserialize values in some use cases: * `RESTORE` * Cluster slot migration * `RENAME`, if the source and target shards are different We [recently](#3760) changed this area of the code, which caused this regression as it only handled RDB / replication streams. Fixes #4143

romange · 2024-11-20T04:26:27Z

We will have to release a patch release for that. I would also want to add all the INFO improvements for 1.25.2

adiholden · 2024-11-20T07:52:20Z

tests/dragonfly/cluster_test.py

+
+    await push_config(json.dumps(generate_config(nodes)), [node.admin_client for node in nodes])
+
+    logging.debug(f"Generating huge {type}")


check out test_big_containers how it uses StaticSeeder
if we can do the same here and use the capture to compare
also this will be one test case and not running for each data type

adiholden · 2024-11-20T07:54:42Z

tests/dragonfly/cluster_test.py

+@pytest.mark.parametrize("type", ["list", "hash", "string", "set", "zset"])
+@dfly_args({"proactor_threads": 2, "cluster_mode": "yes"})
+@pytest.mark.asyncio
+async def test_cluster_migration_huge_list(df_factory: DflyInstanceFactory, type):


rename test_cluster_migration_huge_list -> test_cluster_migration_huge_container
I see you test string data type above in params but its not clear to me what is
debug populate 1 k 10000 RAND TYPE string ELEMENTS 1000
what is the affect of elements param here?

also lets add streams to tested types

what is the affect of elements param here?

It doesn't have any effect, I thought it's a "free test" so why not, but I removed it

also lets add streams to tested types

I don't have support for breaking streams, nor does StaticSeeder nor debug populate support streams :|

adiholden · 2024-11-20T08:36:42Z

tests/dragonfly/generic_test.py

@@ -168,3 +169,41 @@ async def test_denyoom_commands(df_factory):

    # mget should not be rejected
    await client.execute_command("mget x")
+
+
+@pytest.mark.parametrize("type", ["list", "hash", "string", "set", "zset"])


we can do this test in unit test
see TEST_F(RdbTest, LoadHugeSet) for example
btw we can check the dump and restore command directly

It's cleaner and with the seeder (your suggestion) also easier to do in pytests :)

adiholden · 2024-11-20T09:04:03Z

src/server/generic_family.cc

+      config.streamed = true;
+    }
+
+    if (auto ec = FromOpaque(*opaque_res, config, &pv); ec) {


can we remove (std::error_code RdbLoaderBase::FromOpaque(const OpaqueObj& opaque, CompactObj* pv)) function?

adiholden · 2024-11-20T13:18:36Z

tests/dragonfly/cluster_test.py

+        collection_size=10_000,
+        variance=1,
+        samples=1,
+        types=[type],


I suggest instead of running a test case each data type you can pass all the types here
this will be a little faster ci and regression

* fix: Huge entries fail to load outside RDB / replication We have an internal utility tool that we use to deserialize values in some use cases: * `RESTORE` * Cluster slot migration * `RENAME`, if the source and target shards are different We [recently](#3760) changed this area of the code, which caused this regression as it only handled RDB / replication streams. Fixes #4143

chakaz requested a review from adiholden November 20, 2024 06:15

adiholden reviewed Nov 20, 2024

View reviewed changes

shahar added 2 commits November 20, 2024 14:54

improve tests

5e43a8d

remove unused overload

e040a70

chakaz requested a review from adiholden November 20, 2024 13:09

adiholden reviewed Nov 20, 2024

View reviewed changes

!parameterize

34a1987

chakaz requested a review from adiholden November 20, 2024 13:29

adiholden approved these changes Nov 20, 2024

View reviewed changes

chakaz enabled auto-merge (squash) November 20, 2024 13:34

chakaz merged commit 24a1ec6 into main Nov 20, 2024
9 checks passed

chakaz deleted the chakaz/migration-huge-restore branch November 20, 2024 14:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: Huge entries fail to load outside RDB / replication #4154

fix: Huge entries fail to load outside RDB / replication #4154

Uh oh!

chakaz commented Nov 19, 2024

Uh oh!

romange commented Nov 20, 2024

Uh oh!

adiholden Nov 20, 2024

Uh oh!

adiholden Nov 20, 2024

Uh oh!

adiholden Nov 20, 2024

Uh oh!

chakaz Nov 20, 2024

Uh oh!

chakaz Nov 20, 2024

Uh oh!

adiholden Nov 20, 2024

Uh oh!

chakaz Nov 20, 2024

Uh oh!

adiholden Nov 20, 2024 •

edited

Loading

Uh oh!

chakaz Nov 20, 2024

Uh oh!

adiholden Nov 20, 2024

Uh oh!

Uh oh!

Uh oh!


		await push_config(json.dumps(generate_config(nodes)), [node.admin_client for node in nodes])

		logging.debug(f"Generating huge {type}")

fix: Huge entries fail to load outside RDB / replication #4154

fix: Huge entries fail to load outside RDB / replication #4154

Uh oh!

Conversation

chakaz commented Nov 19, 2024

Uh oh!

romange commented Nov 20, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

adiholden Nov 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

adiholden Nov 20, 2024 •

edited

Loading