[Train] Training Pipeline #1214

horheynm · 2025-02-28T20:57:23Z

Order of reviews:
#1206
#1207
#1209
#1212
#1214 <-- Here

SUMMARY:

Refactor Training pipeline
Remove initialize, finalize from the session functions
Add training information on entrypoints/readme.md on the different types of training that can be carried out on llm-compressor
Decouple training from text_generation.py::main. The new logic loves in llmcompressor/entrypoints/train.py that takes the flow of pre-process, carry out training logic and then post-process
Delete outdated info on transformers/finetune/readme.md
Update session_mixin.py to use session().initialize or session().finalize.
Deprecate train.py in text_generation.py, raising deprecation message if used.
Update tests to use llmcompressor's train, not llmcompressor.transformers' train

TEST PLAN:

Pass tests

Signed-off-by: George Ohashi <[email protected]>

github-actions · 2025-02-28T20:57:35Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

Order of reviews: #1206 #1207 <-- Here #1209 #1212 #1214 SUMMARY: * Decouple arg parser to be used for both oneshot and train TEST PLAN: * Pass tests

Order of reviews: #1206 <-- Here #1207 #1209 #1212 #1214 SUMMARY: Rename data_args to dataset_args TEST PLAN: Pass tests FInd `data_args` using `grep` --------- Signed-off-by: George Ohashi <[email protected]> Co-authored-by: Dipika Sikka <[email protected]>

Order of reviews: #1206 #1207 #1209 <-- Here #1212 #1214 SUMMARY: * Move dataset logic out of transformers module `src/llmcompressor/transformers/finetune/data/data_helpers.py`, add it to `src/llmcompressor/datasets/utils.py` TEST PLAN: Pass tests

…ot (#1212) Order of reviews: #1206 #1207 #1209 #1212 <-- Here #1214 SUMMARY: * Move the preprocessing and postprocessing logic out of `src/llmcompressor/transformers/finetune/text_generation.py` and into `src/llmcompressor/entrypoints/utils.py` TEST PLAN: Pass tests

rahul-tuli · 2025-03-10T14:21:20Z

LGTM pending tests!

…ot (#1212) Order of reviews: #1206 #1207 #1209 #1212 <-- Here #1214 SUMMARY: * Move the preprocessing and postprocessing logic out of `src/llmcompressor/transformers/finetune/text_generation.py` and into `src/llmcompressor/entrypoints/utils.py` TEST PLAN: Pass tests Signed-off-by: Brian Dellabetta <[email protected]>

…rain

dsikka

Please write a more descriptive PR description, summarizing changes and test steps.
The current description isn't very helpful.

src/llmcompressor/core/session_functions.py

dsikka

LGTM given one comment.
We've validated with our current examples:

run to completion, outputs make sense?

src/llmcompressor/transformers/finetune/text_generation.py

train - main refac

82b1b62

Signed-off-by: George Ohashi <[email protected]>

This was referenced Feb 28, 2025

[Training] Datasets - update Module #1209

Merged

[Training] Unifying Preprocess + Postprocessing logic for Train/Oneshot #1212

Merged

[Training] Decouple Argument parser #1207

Merged

[Cosmetic] Rename data_args to dataset_args #1206

Merged

dsikka pushed a commit that referenced this pull request Mar 3, 2025

[Training] Decouple Argument parser (#1207)

7bb517f

Order of reviews: #1206 #1207 <-- Here #1209 #1212 #1214 SUMMARY: * Decouple arg parser to be used for both oneshot and train TEST PLAN: * Pass tests

Merge branch 'main' into train

11a8ad6

brian-dellabetta previously approved these changes Mar 6, 2025

View reviewed changes

Merge branch 'main' into train

5edf461

horheynm dismissed brian-dellabetta’s stale review via 5edf461 March 6, 2025 19:07

horheynm added 2 commits March 6, 2025 14:40

merge main

0ce00ad

add train logic

11d0cc3

horheynm added the ready When a PR is ready for review label Mar 6, 2025

add type checks

fdcfc0d

horheynm changed the title ~~[Train] Main refac~~ [Train] Training Pipeline Mar 6, 2025

Merge branch 'main' into train

0b0dd59

brian-dellabetta previously approved these changes Mar 7, 2025

View reviewed changes

horheynm added 3 commits March 11, 2025 11:09

Merge branch 'main' into train

75c8e92

remove link on markdown

3d23bb0

Merge branch 'train' of github.com:vllm-project/llm-compressor into t…

9b0dee4

…rain

horheynm dismissed brian-dellabetta’s stale review via 9b0dee4 March 11, 2025 15:24

brian-dellabetta approved these changes Mar 11, 2025

View reviewed changes

dsikka requested changes Mar 11, 2025

View reviewed changes

kylesayrs requested changes Mar 11, 2025

View reviewed changes

src/llmcompressor/core/session_functions.py Show resolved Hide resolved

dsikka reviewed Mar 12, 2025

View reviewed changes

src/llmcompressor/transformers/finetune/text_generation.py Show resolved Hide resolved

Merge branch 'main' into train

62dd640

dsikka approved these changes Mar 13, 2025

View reviewed changes

kylesayrs approved these changes Mar 13, 2025

View reviewed changes

kylesayrs merged commit d43ea79 into main Mar 13, 2025
8 checks passed

kylesayrs deleted the train branch March 13, 2025 14:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Train] Training Pipeline #1214

[Train] Training Pipeline #1214

Uh oh!

horheynm commented Feb 28, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Feb 28, 2025

Uh oh!

rahul-tuli commented Mar 10, 2025

Uh oh!

dsikka left a comment

Uh oh!

Uh oh!

dsikka left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[Train] Training Pipeline #1214

[Train] Training Pipeline #1214

Uh oh!

Conversation

horheynm commented Feb 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Feb 28, 2025

Uh oh!

rahul-tuli commented Mar 10, 2025

Uh oh!

dsikka left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dsikka left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

horheynm commented Feb 28, 2025 •

edited

Loading