Release v0.3.12

corbt released this 11 Jul 06:26

· 354 commits to main since this release

7032d6c

What's Changed

Tau bench async (#168)
Refactor dev/tau-bench for true async (#167)
ART-E updates (#166)
Add langfuse tracing to run_rl.py (#165)
Make rollout_tau_bench_task synchronous (#164)
feat: Multi-device training (#163)
Create run_training.py for remote training (#162)
Create run_rl.py with ART RL loop (#161)
Wandb weave (#158)
Basic W&B Weave integration (#157)
Properly read base model from CLI (#156)
Deploy model locally (#155)
Fix s3 utils typo (#153)
Fix busy wait in vllm test client (#152)
Fix comment (#151)
dev: swebench (#149)
fix: Improve retry util typing (#148)
Add get_guided_completion_params and use in tic tac toe self play (#147)
Pin vllm to 0.8.5 (#146)

Full Changelog: v0.3.11...v0.3.12

Assets 2