Updated 1B Configs #866

aman-17 · 2025-07-16T18:36:52Z

Updated 1B configs wrt to peteish1-anneal.sh to avoid confusion for users.

2015aroras · 2025-07-16T21:35:20Z

configs/official-0425/OLMo2-1B-stage2-seed666.yaml

 eval_interval: 1000
 eval_subset_num_batches: -1
-device_eval_batch_size: ${device_train_microbatch_size}
+device_eval_batch_size: 8


Why is this needed?

Just matching all the overwritten comments from this bash script.

2015aroras · 2025-07-16T21:37:02Z

README.md

-torchrun --nproc_per_node=8 scripts/train.py {path_to_train_config}
+torchrun --nproc_per_node=8 scripts/train.py {path_to_train_config} --force_save_unsharded=True
 ```
+> Please use `--force_save_unsharded=True` argument to save checkpoints in unsharded format.


I think setting save_num_checkpoints_to_keep = 0 achieves a similar end result and also doesn't save sharded checkpoints through training.

Updated all the configs.

…ve sharded ckpts

aman-17 added 2 commits July 16, 2025 11:27

updated 1b config's and readme

0db5b52

updated the load_paths

09d9a7c

aman-17 requested a review from liujch1998 July 16, 2025 18:36

aman-17 mentioned this pull request Jul 16, 2025

Olmo2 1B Stage 2 config does not start from Stage 1 checkpoint #865

Closed

aman-17 requested a review from 2015aroras July 16, 2025 21:14

2015aroras reviewed Jul 16, 2025

View reviewed changes

aman-17 added 2 commits July 16, 2025 14:53

updated save_num_checkpoints_to_keep=0 in all the configs to not sa…

600a8d0

…ve sharded ckpts

updated README

acf344d

2015aroras approved these changes Jul 16, 2025

View reviewed changes

aman-17 merged commit 0482070 into main Jul 16, 2025
9 of 10 checks passed

aman-17 deleted the amanr/1b_configs branch July 16, 2025 22:10

This was referenced Jul 16, 2025

Initial Stage 2 learning rates in config and wandb do not match #849

Closed

Updated 1B Config #850

Closed

aman-17 mentioned this pull request Jul 24, 2025

Cannot unshard using unshard.py #862

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Updated 1B Configs #866

Updated 1B Configs #866

Uh oh!

aman-17 commented Jul 16, 2025

Uh oh!

2015aroras Jul 16, 2025

Uh oh!

aman-17 Jul 16, 2025

Uh oh!

2015aroras Jul 16, 2025

Uh oh!

aman-17 Jul 16, 2025

Uh oh!

Uh oh!

Uh oh!

Updated 1B Configs #866

Updated 1B Configs #866

Uh oh!

Conversation

aman-17 commented Jul 16, 2025

Uh oh!

2015aroras Jul 16, 2025

Choose a reason for hiding this comment

Uh oh!

aman-17 Jul 16, 2025

Choose a reason for hiding this comment

Uh oh!

2015aroras Jul 16, 2025

Choose a reason for hiding this comment

Uh oh!

aman-17 Jul 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!