Track entropy at training time #178

corbt · 2025-06-30T08:04:10Z

It may be useful to see how entropy changes over time, since once entropy collapses the model is unlikely to learn much more.

Not sure whether this is the best way to do it; notably, when doing it this way we aren't able to track entropy on the val set, only at training.

This is a subset of my changes from the increase-entropy branch, where I tried to explicitly reward increased entropy. However that branch, while successfully maintaining higher entropy, didn't seem to help training, so I only copied over the reporting changes here.

It may be useful to see how entropy changes over time, since once entropy collapses the model is unlikely to learn much more. Not sure whether this is the best way to do it; notably, when doing it this way we aren't able to track entropy on the val set, only at training.

bradhilton

Looks great!

corbt requested a review from bradhilton June 30, 2025 08:04

bradhilton approved these changes Jun 30, 2025

View reviewed changes

bradhilton merged commit 9ece8d1 into main Jun 30, 2025
1 check passed

bradhilton deleted the entropy-tracking branch June 30, 2025 19:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Track entropy at training time #178

Track entropy at training time #178

Uh oh!

corbt commented Jun 30, 2025

Uh oh!

bradhilton left a comment

Uh oh!

Uh oh!

Uh oh!

Track entropy at training time #178

Track entropy at training time #178

Uh oh!

Conversation

corbt commented Jun 30, 2025

Uh oh!

bradhilton left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!