AlphaAvatar is a self-hostable Omni Personal Assistant framework designed to evolve into an intelligent personal butler β a continuous, personalized, and proactive assistant that can remember, understand, plan, and act on behalf of the user.
It is built around a plugin-based real-time Agent architecture, combining:
- π§ Memory for long-term user, assistant, and tool interaction history
- 𧬠Persona for user understanding, identity continuity, and personalization
- π‘ Reflection for self-improvement and long-term behavioral adaptation
- π Planning for task decomposition, reminders, and future-oriented actions
- βοΈ Behavior for response style, workflow policy, and proactive assistance
- π§° Tools through MCP, RAG, DeepResearch, and external integrations
- π Virtual Character for real-time voice/avatar interaction
β¨ Fully self-hostable and privacy-first β AlphaAvatar can run locally or on your own infrastructure, giving you control over your data, memory, tools, and behavior.
|
|
|
|
|
|
π‘ AlphaAvatar is not just a chatbot. It is a foundation for building stateful, proactive, multimodal, and self-evolving personal AI assistants.
|
Intermediate status system for reducing perceived latency during thinking, tool calls, and multi-step workflows. |
Omni interaction routing module that decides whether the Avatar should respond, how the request should be handled, and which status feedback should be emitted. |
|
Self-improving memory module for Omni-Avatar. |
Automatic extraction and real-time matching of multimodal user persona. |
|
A self-improvement module that reflects on memory, behavior, and interaction history. |
Long-horizon planning module for tasks, reminders, goals, and multi-step workflows. |
|
Controls response style, workflow policy, tool-use behavior, and proactive assistance rules. |
The real-time generated virtual character that visually represents the Avatar during interactions. |
|
Allow AlphaAvatar to access the network and perform single-step/multi-step inference through a separate Agent service to search for more accurate content. |
Allow AlphaAvatar to access Documents/Skills (user-uploaded/generated by the Reflection module/URL access) to obtain document-related information. |
|
Allows AlphaAvatar to discover and call real-world external tools such as databases, email, calendars, APIs, and productivity apps. |
Provide AlphaAvatar with a sandbox environment to interact with the external world or with other agents, thereby enabling multi-agent interaction and exploration. |
-
[2026/06] Released AlphaAvatar version 0.6.0: Added the Status plugin, sampled visual input support, and status-aware DeepResearch / RAG / MCP tool feedback.
-
[2026/05] Released AlphaAvatar version 0.5.4:
- Added LanceDB-backed MCP tool retrieval, enabling AlphaAvatar to semantically search relevant MCP tools from Agent queries.
- Refactored system prompt and runtime prompt composition, improved Persona runtime state tracking, added temporary-user to real-user identity merging, and improved RAG runtime behavior.
- Released AlphaAvatar version 0.5.5: Fixed the inference runner registration lifecycle for production
startmode, ensuring plugins runners are registered after config parsing and before LiveKit creates the inference executor.
-
[2026/04] Released AlphaAvatar version 0.5.3:
- Added localized Markdown backup for the Memory plugin.
- Added LanceDB as the default local VDB option when Qdrant credentials are not provided.
-
[2026/03] Released AlphaAvatar version 0.5.0:
-
[2026/02] Released AlphaAvatar version 0.4.0:
- Added RAG support through RAG-Anything.
- Optimized the Memory and DeepResearch modules.
- Released AlphaAvatar version 0.4.1: Fixed Persona plugin bugs and added a new MCP plugin.
-
[2026/01] Released AlphaAvatar version 0.3.0:
- Added DeepResearch support through the Tavily API.
- Released AlphaAvatar version 0.3.1: Added tool-call memory extraction during userβassistant interactions.
2025 Release History
-
[2025/12] Released AlphaAvatar version 0.2.0:
- Added AIRI Live2D-based virtual character display.
-
[2025/11] Released AlphaAvatar version 0.1.0:
- Added automatic memory extraction.
- Added automatic user persona extraction and matching.
Install stable AlphaAvatar version from PyPI:
uv venv .my-env --python 3.11
source .my-env/bin/activate
pip install alpha-avatar-agentsInstall latest AlphaAvatar version from GitHub:
git clone --recurse-submodules https://github.com/AlphaAvatar/AlphaAvatar.git
cd AlphaAvatar
uv venv .venv --python 3.11
source .venv/bin/activate
uv sync --all-packagesStart your agent in dev mode to connect it to LiveKit and make it available from anywhere on the internet.
π§© Step 1. Configure Environment Variables
cd AlphaAvatar
# Copy template
cp .env.template .env.devEdit .env.dev and set required environment variables.
π¦ Step 2. Download Required Files
alphaavatar download-filesβ Step 3. Run the Agent
ENV_FILE=.env.dev alphaavatar dev examples/agent_configs/voice/pipeline_openai_tools.yaml
# or
ENV_FILE=.env.dev alphaavatar dev examples/agent_configs/mm/pipeline_openai_tools.yamlTo see more supported modes, please refer to the LiveKit doc.
To see more examples, please refer to the Examples README
AlphaAvatar is built around a real-time multimodal Agent runtime. It separates streaming interaction, model reasoning, runtime context, plugin state, tool execution, and channel adapters.
AlphaAvatar Runtime
βββββββββββββββββββ
ββββββββββββββββββββββββββββββββββββββββββββββββ
β LiveKit Room β
β audio / text / video / data streams β
ββββββββββββββββββββββββ¬ββββββββββββββββββββββββ
β
βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββ
β AgentSession β
β real-time turn detection / streaming IO β
ββββββββββββββββββββββββ¬ββββββββββββββββββββββββ
β
βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββ
β AvatarEngine β
β Realtime Model Pipeline β
β STT / VAD / LLM / TTS / video-ready input β
ββββββββββββββββββββββββ¬ββββββββββββββββββββββββ
β
βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββ
β Prompt & Runtime Context Layer β
β system prompt / runtime prompt / time / mode β
β memory / persona / plan / reflection / rules β
ββββββββββββββββββββββββ¬ββββββββββββββββββββββββ
β
ββββββββββββββββββββββββΌββββββββββββββββββββββββ
β β β
βΌ βΌ βΌ
ββββββββββββββββββββ ββββββββββββββββββββ ββββββββββββββββββββ
β Memory Plugin β β Persona Plugin β β Tools Plugins β
β conversation β β user profile β β MCP / RAG β
β tool memory β β speaker / face β β DeepResearch β
β avatar memory β β runtime state β β external APIs β
ββββββββββββββββββββ ββββββββββββββββββββ ββββββββββββββββββββ
ββββββββββββββββββββ ββββββββββββββββββββ ββββββββββββββββββββ
β Reflection Pluginβ β Planning Plugin β β Behavior Plugin β
β self-analysis β β tasks / plans β β response policy β
β adaptation β β reminders β β workflow rules β
ββββββββββββββββββββ ββββββββββββββββββββ ββββββββββββββββββββ
β
βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββ
β UserPath Workspace β
β local data / cache / logs / memory / RAG β
β persona state / artifacts / user files β
ββββββββββββββββββββββββ¬ββββββββββββββββββββββββ
β
ββββββββββββββββββββββββ΄ββββββββββββββββββββββββ
β β
βΌ βΌ
ββββββββββββββββββββββββ ββββββββββββββββββββββββ
β Native Web / App β β Channel Adapters β
β voice / text / video β β WhatsApp / future β
β WebRTC / UI β β messaging platforms β
ββββββββββββββββββββββββ ββββββββββββββββββββββββ
AlphaAvatar supports multiple Access Channels, allowing different types of users β from end users to developers β to interact with the system.
AlphaAvatar now provides a browser-based realtime demo interface built on LiveKit.
π Try the Web Demo: https://www.alphaavatar.ai/demo
The Web Demo supports:
- ποΈ Real-time voice interaction
- π¬ Text chat with the Avatar
- π· Camera preview and video-ready interaction
- π Agent audio playback
- π Virtual character / avatar stage
- π§ Full plugin support, including Memory, Persona, RAG, MCP, and DeepResearch
- π Browser timezone metadata, enabling AlphaAvatar to understand local login time
The Web Demo is the recommended way to try AlphaAvatar with a full realtime multimodal experience.
Interact with AlphaAvatar directly inside messaging platforms.
Capabilities:
- π¬ Text-based conversation
- π€ Voice message interaction
- π§° Tool invocation via chat interface
π¦ Channel introduction: README
Make sure AlphaAvatar Agent is already running (see Quick Start above).
ENV_FILE=.env.dev sh examples/channels/start_whatsapp.shπ‘ The WhatsApp channel runs as an independent bridge process and connects to the Agent runtime.
A dedicated AlphaAvatar mobile application providing:
- ποΈ Real-time voice communication
- π Live2D / Virtual character visualization
- π§ Persistent memory & persona
Developers can immediately access AlphaAvatar via the LiveKit Playground.
π https://agents-playground.livekit.io/
After starting your AlphaAvatar server:
- Connect to your LiveKit instance
- Configure the Agent name in the Playground (must match
avatar_name, default:Assistant) to enable Explicit Dispatch. - Connect to the agent room
- Start testing real-time interaction
Supported capabilities:
- ποΈ Voice interaction
- π§ Memory extraction
- π RAG retrieval
- π§° MCP tool invocation
- π Virtual character display
π‘ AlphaAvatar is currently developer-first, with a Web Demo available for realtime interaction.
More user-facing web and mobile experiences are under active development.

