Skip to content

Conversation

sbalandi
Copy link
Contributor

@sbalandi sbalandi commented Sep 25, 2025

Description

To run with GenAI:
python tools/llm_bench/benchmark.py -m ./models/SmolLM2-135M-GGUF/SmolLM2-135M.F16.gguf -n 2 -d CPU

To run with transformers:
python tools/llm_bench/benchmark.py -m ./models/SmolLM2-135M-GGUF/SmolLM2-135M.F16.gguf -n 2 -d CPU -f pt

Ticket: CVS-171473

Checklist:

  • Tests have been updated or added to cover the new code
  • This patch fully addresses the ticket.
  • I have made corresponding changes to the documentation

@github-actions github-actions bot added category: llm_bench Label for tool/llm_bench folder category: GGUF GGUF file reader labels Sep 25, 2025
@sbalandi sbalandi force-pushed the gguf_bench branch 2 times, most recently from c877dbd to 5ab0158 Compare September 25, 2025 12:22
@sbalandi sbalandi requested a review from as-suvorov September 25, 2025 12:28
@sbalandi sbalandi force-pushed the gguf_bench branch 4 times, most recently from 3f98311 to 9724a9c Compare September 26, 2025 12:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: GGUF GGUF file reader category: llm_bench Label for tool/llm_bench folder
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant