-
Notifications
You must be signed in to change notification settings - Fork 2.1k
⚔️ Fix bf16 fp16 config conflict issue #3598
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: YAO Matrix <[email protected]>
Signed-off-by: YAO Matrix <[email protected]>
Signed-off-by: YAO Matrix <[email protected]>
Yes thank you, this PR makes sense. By the way, that's why slow tests fail on main. The only thing that bothers me is that None by default isn't very informative, maybe even misleading. For the moment, I don't really see an alternative. |
@qgallouedec Yes, since it's a Like "Whether to use bf16 (mixed) precision instead of 32-bit. Requires Ampere or higher NVIDIA " WDYT? |
Yes, I think that's the best way to go at this stage. I'm fine with what you suggested, just delete this
|
I just realized that I asked you to remove something that was already in the doc. But still, can you remove it? I think bf16 is stable enough now |
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
Signed-off-by: YAO Matrix <[email protected]> Co-authored-by: Quentin Gallouédec <[email protected]> Co-authored-by: Quentin Gallouédec <[email protected]>
w/ this PR #3515, it by default set
bf16
set to True regardless whatfp16
setting is. In case where user setfp16
explicitly toTrue
(as in ut case here https://github.com/huggingface/trl/blob/main/tests/slow/test_dpo_slow.py#L132), there will be error message "At most one of fp16 and bf16 can be True, but not both".Fixing this issue by setting
bf16
to True only whenfp16
is False andbf16
is not set.@qgallouedec , pls help review, thx.