Commit 9603b41
authored
😷 Refactor GRPO/RLOO to isolate
_generate (huggingface#4114)1 parent 5ee56ed commit 9603b41
File tree
4 files changed
+193
-462
lines changed- docs/source
- trl
- experimental/gfpo
- trainer
4 files changed
+193
-462
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
66 | 66 | | |
67 | 67 | | |
68 | 68 | | |
69 | | - | |
| 69 | + | |
70 | 70 | | |
71 | 71 | | |
72 | 72 | | |
| |||
0 commit comments