Skip to content

[feature request] log generation data to help debugging #2188

@chenhaiq

Description

@chenhaiq

Lots of people hit training failures after 100 steps especially in multi-turn agentic RL.
For example 0russwest0/Agent-R1#30 (comment)

This kind of problem is very difficult to debug due to lacking tools.

The idea in this issue is to log input\output from LLM and tool calls into external tracking system such as wandb.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions