-
Notifications
You must be signed in to change notification settings - Fork 1.7k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
test: align kv_frac in perf test with perflab and add more cases for 4 gpus GB200
#6632
opened Aug 5, 2025 by
ruodil
Loading…
[https://nvbugs/5436461][infra] Adjust free_gpu_memory_fraction of test_eagle3 to prevent OOM on CI
#6631
opened Aug 5, 2025 by
leslie-fang25
Loading…
[TRTLLM-5863][feat] Support MoE INT8 Weight-Only-Quantization in PyTorch Workflow
#6629
opened Aug 5, 2025 by
Yuening-wa
Loading…
[TRTLLM-6637][feat] Resolve KV cache divergence issue
#6628
opened Aug 5, 2025 by
ziyixiong-nv
Loading…
[None][fix] Fix unnecessary GPU synchronization in torch sampler caused by incorrect tensor reference
Community want to contribute
PRs initiated from Community
#6626
opened Aug 5, 2025 by
zhanghaotong
Loading…
[None][perf] Improve the performance of online EPLB on Hopper by better overlapping
#6624
opened Aug 5, 2025 by
jinyangyuan-nvidia
Loading…
[TRTLLM-6772][feat] Multimodal benchmark_serving support
#6622
opened Aug 5, 2025 by
yechank-nvidia
Loading…
[None][Doc] Add doc for multimodal feature support matrix
#6619
opened Aug 5, 2025 by
chang-l
Loading…
[TRTLLM-6898][feat] make fused_moe_cute_dsl work on blackwell
#6616
opened Aug 5, 2025 by
limin2021
Loading…
refactor: Refactor Torch Compile Backend, MoeLoadBalancer and warmup Logic
#6615
opened Aug 5, 2025 by
yizhang-nv
Loading…
[None][fix] Adjust default moe_max_num_tokens to fix OOM.
#6614
opened Aug 5, 2025 by
yuxianq
Loading…
[None][doc] Created Deployment Guide for SGLang DeepSeek-R1 FP8 and NVFP4
Community want to contribute
PRs initiated from Community
#6610
opened Aug 4, 2025 by
jamieliNVIDIA
Loading…
Update CMakeLists.txt extend find_library names
Community want to contribute
PRs initiated from Community
#6609
opened Aug 4, 2025 by
mc-nv
Loading…
feat: Enable nanobind as the default binding library
#6608
opened Aug 4, 2025 by
Linda-Stadter
•
Draft
[TRTLLM-5633][infra] Change the TOT repo to default-llm-repo for merge waive list
#6605
opened Aug 4, 2025 by
yiqingy0
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.