v0.5.0
What's Changed
- re-add vllm e2e test now that bug is fixed by @brian-dellabetta in #1162
- Fix Readme Imports by @kylesayrs in #1165
- Remove event_called by @kylesayrs in #1155
- Update: Test name by @rahul-tuli in #1172
- Remove lifecycle initialized_structure attribute by @kylesayrs in #1156
- [VLM] Qwen 2.5 VL by @kylesayrs in #1113
- Revert bump by @dsikka in #1178
- Remove CLI by @dsikka in #1144
- Add group act order case to lm_eval test by @dsikka in #1080
- Update e2e test timings ouputs by @dsikka in #1179
- [Oneshot Refactor] Main refactor by @horheynm in #1110
- [StageRunner Removal] Remove Evalulate / validate pathway by @horheynm in #1145
- [StageRemoval] Remove Predict pathway by @horheynm in #1146
- Fix 2of4 Apply Example by @dsikka in #1181
- Fix Sparse2of4 Example by @dsikka in #1182
- Add qwen moe w4a16 example by @mgoin in #1186
- [Callbacks] Consolidate Saving Methods by @kylesayrs in #1168
- lmeval tests multimodal by @brian-dellabetta in #1150
- [Dataset Performance] Add num workers on dataset processing - labels, tokenization by @horheynm in #1189
- Fix a minor typo by @eldarkurtic in #1191
- [Callbacks] Remove pre_initialize_structure by @kylesayrs in #1160
- Make
transformers-tests
job conditional on files changed by @dbarbuzzi in #1197 - Update finetune tests to decrease execution time by @dsikka in #1208
- Update transformers tests to speed-up execution by @dsikka in #1211
- Fix logging bug in oneshot.py by @aman2304 in #1213
- [Training] Decouple Argument parser by @horheynm in #1207
- Remove MonkeyPatch for GPUs by @dsikka in #1227
- [Cosmetic] Rename data_args to dataset_args by @horheynm in #1206
- [Training] Datasets - update Module by @horheynm in #1209
- [BugFix] Fix logging disabling bug and add tests by @aman2304 in #1218
- [Training] Unifying Preprocess + Postprocessing logic for Train/Oneshot by @horheynm in #1212
- [Docs] Add info on when to use which PTQ/Sparsification by @horheynm in #1157
- [Callbacks] Remove
MagnitudePruningModifier.leave_enabled
by @kylesayrs in #1198 - Replace Xenova model stub with nm-testing model stub by @kylesayrs in #1239
- Offload Cache Support torch.dtype by @kylesayrs in #1141
- Remove unused/duplicated/non-applicable utils from pytorch/utils/helpers by @kylesayrs in #1174
- [Bugfix] Staged 2of4 example by @kylesayrs in #1238
- wandb/tensorboard loggers set default init to False by @brian-dellabetta in #1235
- fixing reproducibility of lmeval tests by @brian-dellabetta in #1220
- [Audio] People's Speech dataset and tracer tool by @kylesayrs in #1086
- Use KV cache constant names provided by compressed tensors by @kylesayrs in #1200
- [Bugfix] Raise error for processor remote code by @kylesayrs in #1184
- Remove missing weights silencers in favor of HFQuantizer solution by @kylesayrs in #1017
- Fix run_compressed tests by @dsikka in #1246
- [Train] Training Pipeline by @horheynm in #1214
- [Tests] Increase maximum quantization error by @kylesayrs in #1245
- [Callbacks] Remove EventLifecycle and on_start event by @kylesayrs in #1170
- [Bugfix] Disable generation of deepseek models with transformers>=4.48 by @kylesayrs in #1259
- Remove clear_ml by @dsikka in #1261
- [Tests] Remove clear_ml test from GHA by @kylesayrs in #1265
- Remove click by @dsikka in #1262
- [Bugfix] Remove constant pruning from 2of4 examples by @kylesayrs in #1267
- Addback: ConstantPruningModifier for finetuning cases by @rahul-tuli in #1272
- Remove docker by @kylesayrs in #1255
- move failing mulitmodal lmeval tests to skipped folder by @brian-dellabetta in #1273
- Replace tj-action/changed-files by @dbarbuzzi in #1270
- [BugFix]: Sparse2of4 example sparsity-only case by @rahul-tuli in #1282
- Revert "update" by @dsikka in #1296
- Fix Multi-Context Manager Syntax for Python 3.9 Compatibility by @rahul-tuli in #1287
- Revert "Fix Multi-Context Manager Syntax for Python 3.9 Compatibility… by @dsikka in #1300
- [StageRunner] Stage Runner entrypoint and pipeline by @horheynm in #1202
- Bump: Min python version to 3.9 by @rahul-tuli in #1288
- Keep quantization enabled during calibration by @kylesayrs in #1299
- [BugFix] TRL distillation bug fix by @horheynm in #1278
- Update: Readme for fp8 support by @rahul-tuli in #1304
- [GPTQ] Add inversion fallback by @kylesayrs in #1283
- fix typo by @eldarkurtic in #1290
- [Tests] Fix oneshot + finetune test by passing splits to oneshot by @kylesayrs in #1316
- [Tests] Remove the
compress
entrypoint by @dsikka in #1317 - Fix Multi-Context Manager Syntax for Python 3.9 Compatibility by @rahul-tuli in #1313
- [BugFix] Directly Convert Modifiers to Recipe Instance by @rahul-tuli in #1271
- bump version, tag ct by @dsikka in #1318
New Contributors
Full Changelog: 0.4.1...0.5.0