fix(replication): Correctly replicate commands even when OOM #2428

chakaz · 2024-01-17T06:59:18Z

Before this change, OOM in shard callbacks could have led to data inconsistency between the master and the replica. For example, commands which mutated data on 1 shard but failed on another, like LMOVE.

After this change, callbacks that result in an OOM will correctly replicate their work (none, partial or complete) to replicas.

Note that MSET and MSETNX required special handling, in that they are the only commands that can create multiple keys, and so some of them can fail.

Fixes #2381

Before this change, OOM in shard callbacks could have led to data inconsistency between the master and the replica. For example, commands which mutated data on 1 shard but failed on another, like `LMOVE`. After this change, callbacks that result in an OOM will correctly replicate their work (none, partial or complete) to replicas. Note that `MSET` and `MSETNX` required special handling, in that they are the only commands that can _create_ multiple keys, and so some of them can fail. Fixes #2381

adiholden · 2024-01-17T08:22:56Z

src/server/transaction.cc

+  if (result.status != OpStatus::OK) {
+    // We log NOOP even for NO_AUTOJOURNAL commands because the non-success status could have been
+    // due to OOM in a single shard, while other shards succeeded
+    journal->RecordEntry(txid_, journal::Op::NOOP, db_index_, unique_shard_cnt_,


Check out JournalStreamer::Start.
It does not replicate NOOP, but instead today we use this to trigger journal writes to sink. so this will not get to replica , did you run your tests with the flag enable_multi_shard_sync set to true ? I think this will fail

Nice catch.
How about I'll send a PING, but mark the opcode as COMMAND instead? Do you see any problem with that?

btw it does pass with --df enable_multi_shard_sync=true

I think PING is ok.
Oh I see that the test is using proactor_threads=1 so there is only one shard.
Lets change the test to run on 2 shards. you will need to increase the max memory size as well

I still see the test with proactor_threads=1, can you change it ? see my comment above

Sorry, I forgot about it :(
Done now.

src/server/string_family.cc

adiholden · 2024-01-17T09:50:34Z

tests/dragonfly/replication_test.py

+
+
+@pytest.mark.asyncio
+async def test_policy_based_eviction_propagation(df_local_factory, df_seeder_factory):


We have this test above, I asked Yue on his PR to add it and mark it as skip. You can remove remove his test.
Note that there is unsupported_types in seeder in his test because another bug that Kostas fixed

it messes up the diff, but the change is really only adding the test below

adiholden · 2024-01-17T12:40:44Z

src/server/string_family.cc

+      // journal [0, i)
+      payload = make_pair("MSET", ArgSlice(&args[0], i));
+    }
+    tx->LogJournalOnShard(op_args.shard, std::move(payload), tx->GetUniqueShardCnt(), false, false);


I see that you still dont use RecordJournal function. why?

I left this for later and did it now :)

chakaz requested a review from adiholden January 17, 2024 06:59

Merge branch 'main' into oom-journal

b07a171

adiholden reviewed Jan 17, 2024

View reviewed changes

src/server/string_family.cc Show resolved Hide resolved

adiholden reviewed Jan 17, 2024

View reviewed changes

fixes

5cb662a

adiholden reviewed Jan 17, 2024

View reviewed changes

shahar added 4 commits January 17, 2024 23:12

test fix

53f8693

RecordJournal

c7bd9e4

Merge branch 'main' into oom-journal

53e9fe7

UNDO idiotnessness

bfa7e80

chakaz requested a review from adiholden January 18, 2024 08:24

shahar added 2 commits January 18, 2024 11:46

2 shards

5e7bc11

fix pytest

9331419

adiholden approved these changes Jan 18, 2024

View reviewed changes

chakaz merged commit 2f02874 into main Jan 18, 2024

chakaz deleted the oom-journal branch January 18, 2024 10:30

chakaz mentioned this pull request Jan 30, 2024

Investigate why test_policy_based_eviction_propagation is flaky #2506

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(replication): Correctly replicate commands even when OOM #2428

fix(replication): Correctly replicate commands even when OOM #2428

Uh oh!

chakaz commented Jan 17, 2024

Uh oh!

adiholden Jan 17, 2024

Uh oh!

chakaz Jan 17, 2024

Uh oh!

chakaz Jan 17, 2024

Uh oh!

adiholden Jan 17, 2024

Uh oh!

adiholden Jan 18, 2024

Uh oh!

chakaz Jan 18, 2024

Uh oh!

Uh oh!

adiholden Jan 17, 2024

Uh oh!

chakaz Jan 17, 2024

Uh oh!

adiholden Jan 17, 2024

Uh oh!

chakaz Jan 17, 2024

Uh oh!

Uh oh!



		@pytest.mark.asyncio
		async def test_policy_based_eviction_propagation(df_local_factory, df_seeder_factory):

fix(replication): Correctly replicate commands even when OOM #2428

fix(replication): Correctly replicate commands even when OOM #2428

Uh oh!

Conversation

chakaz commented Jan 17, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!