Skip to content

Request to Open Source RL Trained Models Mentioned in the Paper #4

@pspdada

Description

@pspdada

Great work on the paper!

We noticed that the paper mentions several models (like Qwen2.5-Math-7B-base model) that have undergone several RL training methods. These models seem to show promising performance, and it would be highly beneficial for the research community to have access to them. Would it be possible to open-source the weights of these RL-trained models to facilitate further research and reproducibility?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions