Skip to content

Conversation

@bm-synth
Copy link
Contributor

@bm-synth bm-synth commented Mar 6, 2025

adds an example for the use case of variable batch size and LR, introduced in DeepSeed PR 7104

@bm-synth bm-synth marked this pull request as ready for review March 6, 2025 08:52
@bm-synth bm-synth requested a review from tjruwase as a code owner March 6, 2025 08:52
@bm-synth
Copy link
Contributor Author

bm-synth commented Mar 6, 2025

ping @tjruwase

@bm-synth
Copy link
Contributor Author

bm-synth commented Mar 7, 2025

@tjruwase I just replaced T for S in all documentation, code and images, as requested.

@tjruwase
Copy link
Contributor

@bm-synth, FYI, I added a versioning guidance to reduce user confusion.

@tjruwase tjruwase merged commit 420352c into deepspeedai:master Mar 11, 2025
2 checks passed
@bm-synth bm-synth deleted the variable_batch_size_and_lr_example branch March 11, 2025 17:46
hwchen2017 pushed a commit that referenced this pull request Jun 8, 2025
* moved example from DeepSpeed PR #7104 to this repo

* Update training/data_efficiency/variable_batch_size_and_lr/README.md

Co-authored-by: Olatunji Ruwase <[email protected]>

* Update training/data_efficiency/variable_batch_size_and_lr/README.md

Co-authored-by: Olatunji Ruwase <[email protected]>

* replaced T by S for sequence length

* replaced T by S for sequence length

* replaced T by S for sequence length

* more detailed explanation

* --pipeline-num-stages is now a comd line argument

* cleaner syntax

* Update training/data_efficiency/variable_batch_size_and_lr/README.md

---------

Co-authored-by: Olatunji Ruwase <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants