Skip to content

Commit 3e89975

Browse files
committed
README: fix model name and improve CUDA section
- Corrected the model name under the Benchmark section; previous name was not available in Ollama's registry. - Added instructions to switch between CPU-only mode and using all available GPUs via CUDA_VISIBLE_DEVICES. Signed-off-by: Mario Antonio Bortoli Filho <[email protected]>
1 parent 1d2e1a1 commit 3e89975

File tree

4 files changed

+17
-4
lines changed

4 files changed

+17
-4
lines changed

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -224,7 +224,7 @@ $ cat /usr/share/ramalama/shortnames.conf
224224
<br>
225225

226226
```
227-
$ ramalama bench granite-moe3
227+
$ ramalama bench granite3-moe
228228
```
229229
</details>
230230

@@ -831,7 +831,7 @@ $ cat /usr/share/ramalama/shortnames.conf
831831

832832
Perplexity measures how well the model can predict the next token with lower values being better
833833
```
834-
$ ramalama perplexity granite-moe3
834+
$ ramalama perplexity granite3-moe
835835
```
836836
</details>
837837

docs/ramalama-bench.1.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -148,7 +148,7 @@ Benchmark specified AI Model.
148148
## EXAMPLES
149149

150150
```
151-
ramalama bench granite-moe3
151+
ramalama bench granite3-moe
152152
```
153153

154154
## SEE ALSO

docs/ramalama-cuda.7.md

Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -137,6 +137,19 @@ ramalama run granite
137137

138138
This is particularly useful in multi-GPU systems where you want to dedicate specific GPUs to different workloads.
139139

140+
If the `CUDA_VISIBLE_DEVICES` environment variable is set to an empty string, RamaLama will default to using the CPU.
141+
142+
```bash
143+
export CUDA_VISIBLE_DEVICES="" # Defaults to CPU
144+
ramalama run granite
145+
```
146+
147+
To revert to using all available GPUs, unset the environment variable:
148+
149+
```bash
150+
unset CUDA_VISIBLE_DEVICES
151+
```
152+
140153
## Troubleshooting
141154

142155
### CUDA Updates

docs/ramalama-perplexity.1.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -156,7 +156,7 @@ Calculate the perplexity of an AI Model. Perplexity measures how well the model
156156
## EXAMPLES
157157

158158
```
159-
ramalama perplexity granite-moe3
159+
ramalama perplexity granite3-moe
160160
```
161161

162162
## SEE ALSO

0 commit comments

Comments
 (0)