Release v0.3.12
What's Changed
- Tau bench async (#168)
- Refactor dev/tau-bench for true async (#167)
- ART-E updates (#166)
- Add langfuse tracing to run_rl.py (#165)
- Make rollout_tau_bench_task synchronous (#164)
- feat: Multi-device training (#163)
- Create run_training.py for remote training (#162)
- Create run_rl.py with ART RL loop (#161)
- Wandb weave (#158)
- Basic W&B Weave integration (#157)
- Properly read base model from CLI (#156)
- Deploy model locally (#155)
- Fix s3 utils typo (#153)
- Fix busy wait in vllm test client (#152)
- Fix comment (#151)
- dev: swebench (#149)
- fix: Improve retry util typing (#148)
- Add
get_guided_completion_params
and use in tic tac toe self play (#147) - Pin vllm to 0.8.5 (#146)
Full Changelog: v0.3.11...v0.3.12