-
Notifications
You must be signed in to change notification settings - Fork 83
Open
Description
It is useful to see environment-specific metrics during training runs to understand how the model is progressing. To do so, we should support SkyRL-Gym environments returning metrics that will be reported to wandb (or other logging/reporting tools).
TODOs
- add a
metrics
field of typeDict[str,Any]
toBaseTextEnvStepOutput
- add the logic in
SkyRLGymGenerator
to aggregate these metrics across the trainingbatch and return them to the trainer for logging (underenv/*
on wandb) - update one of the example environments to report metrics (to serve as an example)
SumanthRH and alex-drSumanthRH and alex-dr
Metadata
Metadata
Assignees
Labels
No labels