You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When computing log probabilities in the Megatron backend, values are cast to float32, which increases CPU usage in ray.get. Would it be possible to retain the original bfloat16 dtype, or is float32 required to avoid precision loss?