About using vLLM integration as general generation tool for custom training loops #3623

daniel-dona · 2025-06-20T09:58:46Z

daniel-dona
Jun 20, 2025

A lot of work have been put to integrate GRPO with vLLM as a way to speed up online inferencing during training, could be great to generalize the integration in a way it could be used in custom training loops.

I'm thinking for example as a way of completions sampling during training, or performing custom evaluations that require completions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

About using vLLM integration as general generation tool for custom training loops #3623

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

About using vLLM integration as general generation tool for custom training loops #3623

Uh oh!

daniel-dona Jun 20, 2025

Replies: 0 comments

daniel-dona
Jun 20, 2025