🛡️ ArbiterOS

Language: English | 简体中文

🛡️ ArbiterOS

ArbiterOS: Governance Kernel for AI Agents

ArbiterOS sits beneath or beside your agent runtime to enforce policy, emit authoritative traces, and intercept unsafe actions before side effects occur.

ArbiterOS is not another agent framework. It is a runtime governance layer for agent systems that route model calls through an OpenAI-compatible endpoint.

It focuses on three things first:

Authoritative trace for what the agent planned, called, and returned.
Policy enforcement over parsed instructions, tool calls, and taint-propagated dataflow.
Unsafe action interception before sensitive side effects happen.

ArbiterOS in Agent Systems

Why ArbiterOS

Drop-in governance boundary for agent runtimes that can customize LLM base URL and API key.
Taint-aware policy checks over instruction flow and tool execution.
Full local deployment support across Linux, macOS, and Windows.
Auditable runtime logs and optional Langfuse-based visualization.
Minimal host changes: point the agent runtime to ArbiterOS, then enforce policy before execution.

Supported Integration Pattern

ArbiterOS Kernel currently works best with agent runtimes that can override the model endpoint per request or per profile.

Supported in current repository: OpenClaw, Nanobot, Hermes Agent, and additional parser mappings documented in the Kernel codebase.
Compatible serving pattern: OpenAI-compatible / LiteLLM-based routing.
Default local endpoint after startup: http://127.0.0.1:4000/v1

See Value Quickly

The fastest path is:

Install and start ArbiterOS-Kernel.
Configure one upstream model in ArbiterOS-Kernel/litellm_config.yaml.
Point your agent runtime to http://127.0.0.1:4000/v1.
Run a tool-using workflow and inspect the resulting trace, policy decisions, and runtime logs.

Benchmarks

ArbiterOS has shown strong interception or warning gains in multiple agent safety evaluations:

Native OpenClaw (GPT + Claude): 6.17% -> 92.95%
Agent-SafetyBench (Claude Sonnet 4): 0% -> 94.25%
AgentDojo (GPT-4o): 0% -> 93.94%
WildClawBench (GPT-5.2): 55% -> 100% (warning-oriented metric)

These numbers should be interpreted with their benchmark-specific metric definitions and baselines. In the current repository, benchmark results are presented as headline outcomes rather than a standalone reproducibility pack.

What the Root Installer Does

The root installer helps you bootstrap the Kernel quickly:

verifies required commands (curl, git) and installs uv to user space when needed
ensures Python 3.12+
clones or updates ArbiterOS
installs Kernel dependencies with uv sync --group dev
creates ArbiterOS-Kernel/.env from .env.example when available
guides you through the first model entry in ArbiterOS-Kernel/litellm_config.yaml
configures ~/.openclaw/openclaw.json for the arbiteros provider
generates runnable scripts such as run-kernel.sh / run-kernel.ps1

Project Structure

ArbiterOS-Kernel: the core governance kernel, including instruction parsing, taint propagation, policy checks, replay assets, and runtime hooks.
assets/docs: technical docs for architecture, policy interfaces, registry behavior, new-agent integration, and visualization.
langfuse: optional visualization and observability stack for traces and governance workflows.
scripts: helper scripts for environment generation and local setup.
assets/readme: README images and supporting assets.

If you are new to the codebase, start from ArbiterOS-Kernel as the product core, then assets/docs for architecture and extension details.

Quick Start

Install

# Linux / macOS
git clone https://github.com/cure-lab/ArbiterOS.git
cd ArbiterOS
chmod +x install.sh
./install.sh

# Windows (PowerShell)
git clone https://github.com/cure-lab/ArbiterOS.git
cd ArbiterOS
Set-ExecutionPolicy -Scope Process -ExecutionPolicy Bypass
.\install-windows.ps1

Start Kernel

# Linux / macOS
./run-kernel.sh

# Windows (PowerShell)
.\run-kernel.ps1

Connect Your Agent Runtime

After the Kernel starts:

Edit ArbiterOS-Kernel/litellm_config.yaml and fill in the upstream model, API key, and base URL.
Point your agent runtime or provider profile to http://127.0.0.1:4000/v1.
Run a tool-using task and inspect the Kernel output under runtime logs or Langfuse.

Optional: Langfuse UI

cd ArbiterOS/langfuse
cp .env.prob.example .env
docker compose -f docker-compose.yml up -d --build

Documentation

Kernel architecture: assets/docs/kernel.md
Policy interface: assets/docs/kernel-policy_interface.md
Registry and taint labels: assets/docs/registry_usage.md
Add support for a new agent: assets/docs/add_new_agent.md
Visualization guide: assets/docs/visualization.md
Documentation index: assets/docs/README.md

Optional: User systemd Service

If you want background auto-restart and simpler day-to-day operation, use a user-level service:

service name: arbiteros-kernel
service file: ~/.config/systemd/user/arbiteros-kernel.service
working directory: ArbiterOS/ArbiterOS-Kernel
start command: uv run poe litellm

Useful commands:

systemctl --user status arbiteros-kernel
journalctl --user -u arbiteros-kernel -f
systemctl --user restart arbiteros-kernel

Roadmap

Near-Term

Reproducible benchmark packaging and clearer metric definitions
Hardening across Linux, macOS, and Windows environments
More policy packs for common risky operations
Better operator-facing trace and policy inspection workflows

Research Direction

Long-term memory protection improvements
Prompt-injection detection with clustered dataflow signals
Self-evolving policy mechanisms
Multimodal model support
Optimize memory and resource management to support long-runing agent

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🛡️ ArbiterOS

ArbiterOS: Governance Kernel for AI Agents

ArbiterOS sits beneath or beside your agent runtime to enforce policy, emit authoritative traces, and intercept unsafe actions before side effects occur.

ArbiterOS in Agent Systems

Why ArbiterOS

Supported Integration Pattern

See Value Quickly

Benchmarks

What the Root Installer Does

Project Structure

Quick Start

Install

Start Kernel

Connect Your Agent Runtime

Optional: Langfuse UI

Documentation

Optional: User systemd Service

Roadmap

Near-Term

Research Direction

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 359 Commits
ArbiterOS-Kernel		ArbiterOS-Kernel
assets		assets
langfuse		langfuse
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
install-windows.ps1		install-windows.ps1
install.sh		install.sh

Folders and files

Latest commit

History

Repository files navigation

🛡️ ArbiterOS

ArbiterOS: Governance Kernel for AI Agents

ArbiterOS sits beneath or beside your agent runtime to enforce policy, emit authoritative traces, and intercept unsafe actions before side effects occur.

ArbiterOS in Agent Systems

Why ArbiterOS

Supported Integration Pattern

See Value Quickly

Benchmarks

What the Root Installer Does

Project Structure

Quick Start

Install

Start Kernel

Connect Your Agent Runtime

Optional: Langfuse UI

Documentation

Optional: User systemd Service

Roadmap

Near-Term

Research Direction

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages