Skip to content

Add GRPO/ Online DPO support for quantitative models when use vllm as infer backbone.#3133

Closed
maoulee wants to merge 0 commit intohuggingface:mainfrom
maoulee:main
Closed

Add GRPO/ Online DPO support for quantitative models when use vllm as infer backbone.#3133
maoulee wants to merge 0 commit intohuggingface:mainfrom
maoulee:main

Commits

No commits history

There isn't any commit history to show here.