localllama

Star

Here are 43 public repositories matching this topic...

mostlygeek / llama-swap

Sponsor

Star

Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc

golang openai llama openai-api llamacpp vllm localllm localllama

Updated Jun 21, 2026
Go

awaescher / OllamaSharp

Sponsor

Star

The easiest way to use Ollama in .NET

library streaming ai llama gpt llm llamacpp ollama ollama-api localllama microsoft-extensions-ai ichatclient

Updated Mar 24, 2026
C#

sozercan / kubectl-ai

Star

✨ Kubectl plugin to create manifests with LLMs

kubernetes ai openai k8s kubectl gpt hacktoberfest kubectl-plugins openai-api gpt-4 llm chatgpt open-llm localllama

Updated Jan 27, 2025
Go

kaito-project / aikit

Star

🏗️ Fine-tune, build, and deploy open-source LLMs easily!

Updated Jun 22, 2026
Go

SqueezeAILab / KVQuant

Star

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

natural-language-processing compression text-generation transformer llama quantization mistral model-compression efficient-inference efficient-model large-language-models llm small-models localllm localllama

Updated Aug 13, 2024
Python

moritztng / fltr

Star

Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.

rust cli operating-system llama grep mistral grep-like llm llama-2 localllama mixtral mixtral-8x7b

Updated Mar 13, 2024
Rust

BrutalCoding / aub.ai

Sponsor

Star

AubAI brings you on-device gen-AI capabilities, including offline text generation and more, directly within your app.

Updated Apr 27, 2024
Dart

poloclub / wordflow

Star

Social and customizable AI writing assistant! ✍️

ai gemini gpt mlc gpt-4 writing-assistant llm prompt-engineering llama2 localllama gemini-pro

Updated Apr 7, 2026
TypeScript

Auto-tuned launcher for GGUF models on llama.cpp / ik_llama.cpp — OpenAI-compatible server with multi-GPU tensor-split, MoE expert placement, measured flag tuning (AI Tune), hardware-matched HuggingFace downloads, and crash recovery. An Ollama alternative for multi-GPU rigs.

golang metal vulkan cuda self-hosted moe inference-server multi-gpu rocm openai-api llm llamacpp llama-cpp local-llm gguf speculative-decoding localllama ollama-alternative

Updated Jun 21, 2026
Go

lordmathis / llamactl

Star

Unified management and routing for llama.cpp, MLX and vLLM models with web dashboard.

self-hosted mlx openai-api llm llamacpp llama-cpp vllm llm-inference localllm localllama llama-server llm-router mlx-lm

Updated Jun 21, 2026
Go

lef-fan / aria

Star

A local and uncensored AI entity.

python bot text-to-speech ai deep-learning speech pytorch tts assistant vad speech-to-text voice-assistant large-language-models llm xttsv2 localllama llamacpp-python kokoro-tts

Updated Aug 1, 2025
Python

yankeexe / llm-rag-with-reranker-demo

Star

LLM RAG Application with Cross-Encoders Re-ranking for YouTube video 🎥

awesome re-ranking rag streamlit cross-encoders langchain retrieval-augmented-generation ollama localllama

Updated Dec 8, 2025
Python

napmany / llmsnap

Star

Fast LLM swapping with sleep/wake support, compatible with vllm, llama.cpp, etc. llama-swap fork.

golang openai openai-api llm llm-serving llamacpp vllm localllm ai-gateway localllama llmrouter

Updated Apr 5, 2026
Go

grigio / opencode-benchmark-dashboard

Star

Benchmark system for testing opencode with various LLM models, measuring speed (latency) and correctness (accuracy).

benchmark opencode local-llm localllama openclaw

Updated Jun 5, 2026
TypeScript

BrunoArsioli / llama-optimus

Star

Lightweight Python tool using Optuna for tuning llama.cpp flags: towards optimal tok/s for your machine

benchmark automation optimization optuna llamacpp localai localllama

Updated Jun 30, 2025
Python

thomas9120 / LLama-GUI

Star

User friendly GUI for Llama.cpp for easy configuration and launching.

gui llm llamacpp llama-cpp llm-inference gguf localllama

Updated Jun 18, 2026
JavaScript

cuolm / pi-sbx-llamacpp

Star

Run Pi coding agent isolated in a Docker Sandbox microVM with a local llama-server as the inference backend

sbx docker-sandbox microvm ai-agent llama-cpp local-llm gguf localllama llama-server pi-agent pi-coding-agent

Updated Jun 16, 2026

CloudToLocalLLM-online / CloudToLocalLLM

Star

Privacy-first desktop AI companion with 5 pillars: unified chat, OpenClaw Gateway, evolving avatar, desktop control, and vision. Auth0 with encrypted tunneling.

ai chatbot localllm ollama-api localllama

Updated Jun 20, 2026
Dart

slb350 / open-agent-sdk-rust

Star

Rust SDK for building AI agents with local OpenAI-compatible servers (LMStudio, Ollama, llama.cpp, vLLM). Features streaming, tools, hooks, retry logic, and comprehensive examples.

rust openai rust-library rust-crate llm llamacpp vllm localllm ollama localllama

Updated Jun 20, 2026
Rust

seyf1elislam / OneClick_LLM_API_onColab

Star

Run gguf LLM models in Latest Version TextGen-webui and koboldcpp

python colab-notebook llm llms gptq localllm exllama gguf localllama

Updated Aug 6, 2025
Jupyter Notebook

Improve this page

Add a description, image, and links to the localllama topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the localllama topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

localllama

Here are 43 public repositories matching this topic...

mostlygeek / llama-swap

awaescher / OllamaSharp

sozercan / kubectl-ai

kaito-project / aikit

SqueezeAILab / KVQuant

moritztng / fltr

BrutalCoding / aub.ai

poloclub / wordflow

raketenkater / ggrun

lordmathis / llamactl

lef-fan / aria

yankeexe / llm-rag-with-reranker-demo

napmany / llmsnap

grigio / opencode-benchmark-dashboard

BrunoArsioli / llama-optimus

thomas9120 / LLama-GUI

cuolm / pi-sbx-llamacpp

CloudToLocalLLM-online / CloudToLocalLLM

slb350 / open-agent-sdk-rust

seyf1elislam / OneClick_LLM_API_onColab

Improve this page

Add this topic to your repo