Release v0.5.0 · vllm-project/llm-compressor

What's Changed

re-add vllm e2e test now that bug is fixed by @brian-dellabetta in #1162
Fix Readme Imports by @kylesayrs in #1165
Remove event_called by @kylesayrs in #1155
Update: Test name by @rahul-tuli in #1172
Remove lifecycle initialized_structure attribute by @kylesayrs in #1156
[VLM] Qwen 2.5 VL by @kylesayrs in #1113
Revert bump by @dsikka in #1178
Remove CLI by @dsikka in #1144
Add group act order case to lm_eval test by @dsikka in #1080
Update e2e test timings ouputs by @dsikka in #1179
[Oneshot Refactor] Main refactor by @horheynm in #1110
[StageRunner Removal] Remove Evalulate / validate pathway by @horheynm in #1145
[StageRemoval] Remove Predict pathway by @horheynm in #1146
Fix 2of4 Apply Example by @dsikka in #1181
Fix Sparse2of4 Example by @dsikka in #1182
Add qwen moe w4a16 example by @mgoin in #1186
[Callbacks] Consolidate Saving Methods by @kylesayrs in #1168
lmeval tests multimodal by @brian-dellabetta in #1150
[Dataset Performance] Add num workers on dataset processing - labels, tokenization by @horheynm in #1189
Fix a minor typo by @eldarkurtic in #1191
[Callbacks] Remove pre_initialize_structure by @kylesayrs in #1160
Make transformers-tests job conditional on files changed by @dbarbuzzi in #1197
Update finetune tests to decrease execution time by @dsikka in #1208
Update transformers tests to speed-up execution by @dsikka in #1211
Fix logging bug in oneshot.py by @aman2304 in #1213
[Training] Decouple Argument parser by @horheynm in #1207
Remove MonkeyPatch for GPUs by @dsikka in #1227
[Cosmetic] Rename data_args to dataset_args by @horheynm in #1206
[Training] Datasets - update Module by @horheynm in #1209
[BugFix] Fix logging disabling bug and add tests by @aman2304 in #1218
[Training] Unifying Preprocess + Postprocessing logic for Train/Oneshot by @horheynm in #1212
[Docs] Add info on when to use which PTQ/Sparsification by @horheynm in #1157
[Callbacks] Remove MagnitudePruningModifier.leave_enabled by @kylesayrs in #1198
Replace Xenova model stub with nm-testing model stub by @kylesayrs in #1239
Offload Cache Support torch.dtype by @kylesayrs in #1141
Remove unused/duplicated/non-applicable utils from pytorch/utils/helpers by @kylesayrs in #1174
[Bugfix] Staged 2of4 example by @kylesayrs in #1238
wandb/tensorboard loggers set default init to False by @brian-dellabetta in #1235
fixing reproducibility of lmeval tests by @brian-dellabetta in #1220
[Audio] People's Speech dataset and tracer tool by @kylesayrs in #1086
Use KV cache constant names provided by compressed tensors by @kylesayrs in #1200
[Bugfix] Raise error for processor remote code by @kylesayrs in #1184
Remove missing weights silencers in favor of HFQuantizer solution by @kylesayrs in #1017
Fix run_compressed tests by @dsikka in #1246
[Train] Training Pipeline by @horheynm in #1214
[Tests] Increase maximum quantization error by @kylesayrs in #1245
[Callbacks] Remove EventLifecycle and on_start event by @kylesayrs in #1170
[Bugfix] Disable generation of deepseek models with transformers>=4.48 by @kylesayrs in #1259
Remove clear_ml by @dsikka in #1261
[Tests] Remove clear_ml test from GHA by @kylesayrs in #1265
Remove click by @dsikka in #1262
[Bugfix] Remove constant pruning from 2of4 examples by @kylesayrs in #1267
Addback: ConstantPruningModifier for finetuning cases by @rahul-tuli in #1272
Remove docker by @kylesayrs in #1255
move failing mulitmodal lmeval tests to skipped folder by @brian-dellabetta in #1273
Replace tj-action/changed-files by @dbarbuzzi in #1270
[BugFix]: Sparse2of4 example sparsity-only case by @rahul-tuli in #1282
Revert "update" by @dsikka in #1296
Fix Multi-Context Manager Syntax for Python 3.9 Compatibility by @rahul-tuli in #1287
Revert "Fix Multi-Context Manager Syntax for Python 3.9 Compatibility… by @dsikka in #1300
[StageRunner] Stage Runner entrypoint and pipeline by @horheynm in #1202
Bump: Min python version to 3.9 by @rahul-tuli in #1288
Keep quantization enabled during calibration by @kylesayrs in #1299
[BugFix] TRL distillation bug fix by @horheynm in #1278
Update: Readme for fp8 support by @rahul-tuli in #1304
[GPTQ] Add inversion fallback by @kylesayrs in #1283
fix typo by @eldarkurtic in #1290
[Tests] Fix oneshot + finetune test by passing splits to oneshot by @kylesayrs in #1316
[Tests] Remove the compress entrypoint by @dsikka in #1317
Fix Multi-Context Manager Syntax for Python 3.9 Compatibility by @rahul-tuli in #1313
[BugFix] Directly Convert Modifiers to Recipe Instance by @rahul-tuli in #1271
bump version, tag ct by @dsikka in #1318

New Contributors

@aman2304 made their first contribution in #1213

Full Changelog: 0.4.1...0.5.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

v0.5.0

What's Changed

New Contributors

Contributors

Uh oh!