Skip to content

Pull requests: NVIDIA/TensorRT-Model-Optimizer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix hf_quant_config with kv cache type
#557 opened Nov 14, 2025 by jenchen13 Loading…
Support Wan2.2 t2v diffusers quantization
#556 opened Nov 13, 2025 by shengliangxu Loading…
GPTQ Lite implementation
#555 opened Nov 13, 2025 by sugunav14 Draft
Fix QLoRA example test
#553 opened Nov 13, 2025 by sugunav14 Loading…
Feat: Eagle3 HF Online - support nemotron models
#548 opened Nov 13, 2025 by h-guo18 Loading…
[5336870] AutoCast: Unblock LSTM from conversion
#544 opened Nov 12, 2025 by galagam Loading…
[OMNIML-3015]Add per tensor/per channel MSE calibrator
#540 opened Nov 12, 2025 by Fridah-nv Loading…
2 tasks
Optimize NVFP4 Triton kernel
#533 opened Nov 11, 2025 by mxinO Draft
parallel eagle draft
#523 opened Nov 6, 2025 by yeyu-nvidia Draft
[Bug #193] fix fp8 blockwise real quantization
#522 opened Nov 6, 2025 by meenchen Loading…
Fix BMM style MoE export in fp8_pc_pt recipe
#515 opened Nov 5, 2025 by Edwardf0t1 Loading…
Yeyu/set block
#480 opened Oct 28, 2025 by yeyu-nvidia Draft
feat: add onnxslim support
#478 opened Oct 28, 2025 by inisis Loading…
Feat: Eagle3 HF Online - support nemotron models
#463 opened Oct 25, 2025 by h-guo18 Loading…
ProTip! Adding no:label will show everything without a label.