feat: Add support for multiple histories #170

bradhilton · 2025-06-28T00:34:02Z

No description provided.

corbt · 2025-06-28T01:12:39Z

src/art/local/tokenize.py

-            ):
-                results.append(result)
+            trajectory_results: list[TokenizedResult] = []
+            for history in [


Probably doesn't matter but we could make Trajectory inherit from History to avoid having to create a new one here.

corbt

I do think it would be good to have an explicit way to mark an assistant message as trainable or not. I guess we get that right now by passing Choice vs Message objects but if we end up standardizing on using Message objects everywhere (or someone wants to pass allow_training_without_logprobs for any reason) we'll have an issue. That can come later though I suppose.

Also, just checking that these are being masked out somewhere so the different histories in the same trajectory don't attend to each other?

bradhilton · 2025-06-28T03:26:47Z

@corbt no masking, here's a caveat I shared in our discussion:

For the qwen 3 reasoning use case you'll want to add an additional message and choices list for each history. You'll also want to convert previous choice objects to plain assistant messages so that you don't multiplicatively train on earlier messages with the wrong contents to boot

I also mused "maybe we could add support for auto-splitting reasoning trajectories once we figure out what works," but I'm not ready to tackle that atm

corbt · 2025-06-28T07:22:33Z

Ah I may have been unclear; my question was about attention masking, not loss masking. Are we masking to make sure the different message histories within a trajectory aren't able to attend to each other?

bradhilton · 2025-06-28T20:46:25Z

@corbt yes, each history will yield it's own TokenizedResult, which gets a unique token group id in the packing step

corbt

Sweet, excited to have this in. @arcticfly we'll definitely have to document how this works.

bradhilton added 2 commits June 27, 2025 18:33

feat: Add support for multiple histories

3716e27

refactor: Adopt History type

9e18e42

corbt reviewed Jun 28, 2025

View reviewed changes

corbt approved these changes Jun 28, 2025

View reviewed changes

bradhilton marked this pull request as ready for review June 30, 2025 18:46

bradhilton merged commit d96e539 into main Jun 30, 2025
1 check passed

bradhilton deleted the feat/multiple-histories branch June 30, 2025 18:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add support for multiple histories #170

feat: Add support for multiple histories #170

Uh oh!

bradhilton commented Jun 28, 2025

Uh oh!

corbt Jun 28, 2025

Uh oh!

corbt left a comment

Uh oh!

bradhilton commented Jun 28, 2025 •

edited

Loading

Uh oh!

corbt commented Jun 28, 2025

Uh oh!

bradhilton commented Jun 28, 2025

Uh oh!

corbt left a comment

Uh oh!

Uh oh!

Uh oh!

feat: Add support for multiple histories #170

feat: Add support for multiple histories #170

Uh oh!

Conversation

bradhilton commented Jun 28, 2025

Uh oh!

corbt Jun 28, 2025

Choose a reason for hiding this comment

Uh oh!

corbt left a comment

Choose a reason for hiding this comment

Uh oh!

bradhilton commented Jun 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

corbt commented Jun 28, 2025

Uh oh!

bradhilton commented Jun 28, 2025

Uh oh!

corbt left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

bradhilton commented Jun 28, 2025 •

edited

Loading