Skip to content

Release v0.3.12

Compare
Choose a tag to compare
@corbt corbt released this 11 Jul 06:26
· 354 commits to main since this release
7032d6c

What's Changed

  • Tau bench async (#168)
  • Refactor dev/tau-bench for true async (#167)
  • ART-E updates (#166)
  • Add langfuse tracing to run_rl.py (#165)
  • Make rollout_tau_bench_task synchronous (#164)
  • feat: Multi-device training (#163)
  • Create run_training.py for remote training (#162)
  • Create run_rl.py with ART RL loop (#161)
  • Wandb weave (#158)
  • Basic W&B Weave integration (#157)
  • Properly read base model from CLI (#156)
  • Deploy model locally (#155)
  • Fix s3 utils typo (#153)
  • Fix busy wait in vllm test client (#152)
  • Fix comment (#151)
  • dev: swebench (#149)
  • fix: Improve retry util typing (#148)
  • Add get_guided_completion_params and use in tic tac toe self play (#147)
  • Pin vllm to 0.8.5 (#146)

Full Changelog: v0.3.11...v0.3.12