llamagram 🦙

llamagram is a starter repo for running a local LLM with MCP tools, chatting with it over Telegram. It is local-first by default: local model, local stack, and local/custom MCP servers in this repo. You can still plug in third-party MCP servers when needed.

How it works

Define an agent (model path, system prompt, MCP tools)
Run a local llama-server with your GGUF model
Expose tools via mcp-proxy (local/custom MCP servers by default)
Chat through a Telegram bot that bridges it all together

Getting Started

Included example agents

assistant: general starter profile
beepboop: playful assistant, no tools
timekeeper: always responds with current time, uses the local time MCP server

You can use local/custom MCP servers from this repo by default, and you can also wire in third-party MCP servers by editing each agent's mcp.config.json.

Prerequisites

Linux or macOS
llama-server binary (llama.cpp)
uv and uvx
Python 3.10+

1. Point to your llama-server binary

Edit infra/llama/defaults.env:

LLAMA_SERVER_BIN=/path/to/llama.cpp/build/bin/llama-server

2. Configure your agent

Edit the existing assitant agents/assistant/agent.env or create a new agent agents/<name>/agent.env:

AGENT_LLAMA_MODEL=/path/to/your/model.gguf
AGENT_LLAMA_ALIAS=your/model-name

3. Add llama-server flags (optional)

Any AGENT_LLAMA_FLAG_* var gets passed as a flag to llama-server:

AGENT_LLAMA_FLAG_CTX_SIZE=32768   # → --ctx-size 32768
AGENT_LLAMA_FLAG_JINJA=true       # → --jinja

--model, --alias, and --port are managed by the stack — don't set these.

4. Start the stack 🚀

./llm.sh start assistant

This validates your config, starts llama-server, and launches mcp-proxy.

If you prefer fully local, you can stop here and use the llama-server web UI directly at http://127.0.0.1:8002. Make sure to connect the correct MCP servers in the UI if you want tool access there.

5. Set up the Telegram bot

cd telegram-bot
cp .env.example .env

Fill in:

TELEGRAM_BOT_TOKEN — from @BotFather
TELEGRAM_ALLOWED_USER_IDS — your Telegram user ID(s), comma-separated
IMAGE_SUPPORT_ENABLED — set true if your model supports vision input (use the mmproj flag with the llama-server)

6. Start the bot

cd telegram-bot
./start.sh

The bot auto-discovers the running model and active agent, and starts relaying your Telegram messages.

Controller commands

./llm.sh list-agents
./llm.sh start <agent>
./llm.sh stop
./llm.sh restart <agent>
./llm.sh status
./llm.sh logs llama
./llm.sh logs mcp

Reference

Agent contract

Each agent lives in agents/<name>/ and requires three files:

File	Purpose
`agent.env`	Model path, alias, llama flags
`mcp.config.json`	MCP tool configuration
`system.md`	System prompt

Required vars in agent.env: AGENT_LLAMA_MODEL, AGENT_LLAMA_ALIAS

One active agent at a time

llamagram runs a single agent stack at a time. Starting a second agent while one is running is blocked — use restart to switch. The active agent is tracked in .state/active-agent.

Repo layout

agents/          one folder per agent
mcps/            custom MCP servers (includes a `time` example)
telegram-bot/    Telegram bridge
bin/             shared shell helpers
infra/llama/     stack defaults
.state/          runtime files, logs, PIDs (generated at runtime)

License

This project is licensed under the MIT License. See LICENSE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llamagram 🦙

How it works

Getting Started

Included example agents

Prerequisites

1. Point to your llama-server binary

2. Configure your agent

3. Add llama-server flags (optional)

4. Start the stack 🚀

5. Set up the Telegram bot

6. Start the bot

Controller commands

Reference

Agent contract

One active agent at a time

Repo layout

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
agents		agents
bin		bin
infra/llama		infra/llama
mcps/time		mcps/time
telegram-bot		telegram-bot
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
llm.sh		llm.sh

Folders and files

Latest commit

History

Repository files navigation

llamagram 🦙

How it works

Getting Started

Included example agents

Prerequisites

1. Point to your llama-server binary

2. Configure your agent

3. Add llama-server flags (optional)

4. Start the stack 🚀

5. Set up the Telegram bot

6. Start the bot

Controller commands

Reference

Agent contract

One active agent at a time

Repo layout

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages