Skip to content

Commit 90ce1aa

Browse files
authored
Update README to align nomenclature correctness (#97)
* Update README to align nomenclature correctness * Update README.md
1 parent a216a3c commit 90ce1aa

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -3,9 +3,9 @@
33

44
# Transformer Reinforcement Learning X
55

6-
TRLX allows you to fine-tune 🤗 Hugging Face supported language models (`gpt2`, `gpt-j`, `gpt-neo` and `gpt-neox` based) up to 20B parameters using reinforcement learning via either a provided reward function or reward-labeled dataset. Proximal Policy Optimization ([PPO](https://arxiv.org/pdf/1909.08593.pdf)) and Implicit Language Q-Learning ([ILQL](https://sea-snell.github.io/ILQL_site/)) are implemented.
6+
trlX allows you to fine-tune 🤗 Hugging Face supported language models (`gpt2`, `gpt-j`, `gpt-neo` and `gpt-neox` based) up to 20B parameters using reinforcement learning via either a provided reward function or reward-labeled dataset. Proximal Policy Optimization ([PPO](https://arxiv.org/pdf/1909.08593.pdf)) and Implicit Language Q-Learning ([ILQL](https://sea-snell.github.io/ILQL_site/)) are implemented.
77

8-
You can read more about TRLX in our [documentation](https://trlX.readthedocs.io).
8+
You can read more about trlX in our [documentation](https://trlX.readthedocs.io).
99

1010
## Installation
1111
```bash

0 commit comments

Comments
 (0)