Skip to content

Commit b8d74e5

Browse files
authored
chore(model gallery): add steiner-32b-preview (#4602)
Signed-off-by: Ettore Di Giacinto <[email protected]>
1 parent 62abe0d commit b8d74e5

File tree

1 file changed

+16
-0
lines changed

1 file changed

+16
-0
lines changed

gallery/index.yaml

Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -3122,6 +3122,22 @@
31223122
- filename: Qwen2.5-72b-RP-Ink-Q4_K_M.gguf
31233123
sha256: 2c2bf785dc5798403e0ccf6c4f5f9d7d53fcfb0c0b28855c584e09be88f91517
31243124
uri: huggingface://bartowski/Qwen2.5-72b-RP-Ink-GGUF/Qwen2.5-72b-RP-Ink-Q4_K_M.gguf
3125+
- !!merge <<: *qwen25
3126+
name: "steiner-32b-preview"
3127+
urls:
3128+
- https://huggingface.co/peakji/steiner-32b-preview
3129+
- https://huggingface.co/bartowski/steiner-32b-preview-GGUF
3130+
description: |
3131+
Steiner is a series of reasoning models trained on synthetic data using reinforcement learning. These models can explore multiple reasoning paths in an autoregressive manner during inference and autonomously verify or backtrack when necessary, enabling a linear traversal of the implicit search tree.
3132+
3133+
Steiner is a personal interest project by Yichao 'Peak' Ji, inspired by OpenAI o1. The ultimate goal is to reproduce o1 and validate the inference-time scaling curves. The Steiner-preview model is currently a work-in-progress. The reason for open-sourcing it is that I’ve found automated evaluation methods, primarily based on multiple-choice questions, struggle to fully reflect the progress of reasoning models. In fact, the assumption that "the correct answer is always among the options" doesn’t align well with real-world reasoning scenarios, as it encourages models to perform substitution-based validation rather than open-ended exploration. For this reason, I’ve chosen to open-source these intermediate results and, when time permits, to build in public. This approach allows me to share knowledge while also gathering more evaluations and feedback from real human users.
3134+
overrides:
3135+
parameters:
3136+
model: steiner-32b-preview-Q4_K_M.gguf
3137+
files:
3138+
- filename: steiner-32b-preview-Q4_K_M.gguf
3139+
sha256: 1d7bf6d6dc8db8c81b3e71dc89756cd23417bb0a645b7dcdd1f9457781a88652
3140+
uri: huggingface://bartowski/steiner-32b-preview-GGUF/steiner-32b-preview-Q4_K_M.gguf
31253141
- &smollm
31263142
## SmolLM
31273143
url: "github:mudler/LocalAI/gallery/chatml.yaml@master"

0 commit comments

Comments
 (0)