-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Description
Qwen3 runs seamlessly with Ollama. Here are some additional resources:
-
- The title "思深,行速" highlights its impressive speed and accuracy.
- Pre-trained on data covering 119 languages and dialects, including traditional chinese
-
Official model repository: Supports FP8 and GPT-Q.
-
Hugging Face dedicated article.
- still stress ollama is good for simlipcity(development). vllm is good for high concurrent(production).
-
Nvidia blog on TensorRT-LLM deployment noting Qwen3's unique features.
Other resources
Metadata
Metadata
Assignees
Labels
No labels