Update examples/quantization_24_sparse_w4a16 README #52

dbarbuzzi · 2024-08-02T15:25:23Z

SUMMARY:
This PR fixes some file/folder name references in https://github.com/vllm-project/llm-compressor/blob/main/examples/quantization_24_sparse_w4a16/README.md

Fix filenames/paths

* group size * add logic in base observer * group size full lifecycle run * before vectorize the for loop * comments, todo add channelwise * chan wise impl * comments * fix channel wise * comments, validators * fix typo * tensor return error fix * fix sparseml-side of code and add per channel * pyndatic defaults * token wise quant * Update src/compressed_tensors/quantization/quant_args.py Co-authored-by: Benjamin Fineran <[email protected]> * comments' * update dim * shape consistency * Update src/compressed_tensors/quantization/lifecycle/forward.py Co-authored-by: Benjamin Fineran <[email protected]> * comments * pass test_quant_args * fix channelwise * new tests, some fail * WIP * group compression * fix output type on decompress * fix channelwise * revert * more tests --------- Co-authored-by: George Ohashi <[email protected]> Co-authored-by: Benjamin Fineran <[email protected]>

Update example README.md

7963549

Fix filenames/paths

bfineran approved these changes Aug 5, 2024

View reviewed changes

bfineran merged commit 0a62ffc into vllm-project:main Aug 5, 2024
8 of 12 checks passed

dbarbuzzi deleted the patch-1 branch October 7, 2024 14:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update examples/quantization_24_sparse_w4a16 README #52

Update examples/quantization_24_sparse_w4a16 README #52

Uh oh!

dbarbuzzi commented Aug 2, 2024

Uh oh!

Uh oh!

Uh oh!

Update examples/quantization_24_sparse_w4a16 README #52

Update examples/quantization_24_sparse_w4a16 README #52

Uh oh!

Conversation

dbarbuzzi commented Aug 2, 2024

Uh oh!

Uh oh!

Uh oh!