Pinned Loading
-
llama-cpp-python-wheels
llama-cpp-python-wheels PublicPre-built wheels for llama-cpp-python across platforms and CUDA versions
-
llama_32_3b_instruct_launcher
llama_32_3b_instruct_launcher PublicQuick and dirty launcher scripts for Llama-3.2-3B-Instruct using HuggingFace Transformers. Includes full version with optimizations and bare-bones minimal version. Just update the model path and run.
Python
-
llama_32_3b_instruct_launcher_gguf
llama_32_3b_instruct_launcher_gguf PublicQuick and dirty launcher scripts for Llama-3.2-3B-Instruct GGUF models using llama-cpp-python. Includes full version with optimizations and bare-bones minimal version. Just update the model path an…
Python
-
llama_32_3b_instruct_launcher_gguf_metal
llama_32_3b_instruct_launcher_gguf_metal PublicQuick and dirty macOS launcher for Llama-3.2-3B-Instruct GGUF models with Metal GPU acceleration. Optimized for Apple Silicon (M1/M2/M3/M4). Just compile with Metal support, update the model path, …
Python
-
llama_32_3b_instruct_launcher_langchain
llama_32_3b_instruct_launcher_langchain PublicQuick and dirty LangChain launcher for Llama-3.2-3B-Instruct GGUF models. Minimal setup using LangChain abstractions with streaming chat. Just update the model path and run.
Python
-
llama_32_3b_instruct_launcher_ollama
llama_32_3b_instruct_launcher_ollama PublicQuick and dirty Ollama launcher for Llama-3.2-3B-Instruct. Uses Ollama API for automatic model management and hardware optimization. Just install Ollama, pull the model, and run.
Python
If the problem persists, check the GitHub status page or contact support.