Skip to content

Commit 84c3806

Browse files
authored
add quantization tools (#477)
1 parent 2cced56 commit 84c3806

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -512,6 +512,7 @@ This repository contains a curated list of awesome open source libraries that wi
512512
* [AWQ](https://github.com/mit-han-lab/llm-awq) ![](https://img.shields.io/github/stars/mit-han-lab/llm-awq.svg?style=social) - Activation-aware Weight Quantization for LLM Compression and Acceleration.
513513
* [GPTQ](https://github.com/IST-DASLab/gptq) ![](https://img.shields.io/github/stars/IST-DASLab/gptq.svg?style=social) - Accurate Post-training Quantization of Generative Pretrained Transformers.
514514
* [neural-compressor](https://github.com/intel/neural-compressor) ![](https://img.shields.io/github/stars/intel/neural-compressor.svg?style=social) - Intel® Neural Compressor aims to provide popular model compression techniques such as quantization, pruning (sparsity), distillation, and neural architecture search on mainstream frameworks.
515+
* [Quanto](https://github.com/huggingface/quanto) ![](https://img.shields.io/github/stars/huggingface/quanto.svg?style=social) - Quanto aims to simplify quantizing deep learning models.
515516

516517

517518
## Industry Strength CV

0 commit comments

Comments
 (0)