Store data with cryptographic provenance on the Swarm decentralized network. Every upload is hashed, optionally signed by a notary, and can be anchored on-chain — giving you an immutable, verifiable record of what was stored, when, and by whom.
┌─────────────────────────────────────────────────────────┐
│ dataprovenance-app │
│ (Astro + React web UI) │
├──────────────────────────┬──────────────────────────────┤
│ SDK │ CLI │ MCP Server │
│ (TS/JS) │ (Python) │ (AI agent integration) │
└───┬─────────┴──┬─────────┴──────────────┬───────────────┘
│ │ │
├────────────┼────────────────────────┤
│ Chain Clients (direct) │
│ (viem / web3.py) │
├────────────┼────────────────────────┘
│ │ │
│ ┌────────┘ ┌─────────────┘
│ │ │
▼ ▼ ▼
┌─────────────────────────────────────────────────────────┐
│ Provenance Gateway │
│ (FastAPI — swarm_connect) │
│ │
│ stamps, upload/download, notary signing │
└──────────────────────────┬──────────────────────────────┘
│
▼
Swarm Network
(decentralized storage)
── All clients also connect directly ──
Blockchain RPC (Base Sepolia)
DataProvenance smart contracts (Solidity)
All three clients (SDK, CLI, MCP Server) can anchor data hashes directly on-chain via their own chain client. The gateway handles Swarm storage, stamp management, and notary signing — it has no blockchain endpoints.
| Component | Repo | Language | Description |
|---|---|---|---|
| Smart Contracts | ConsentsBasedDataProvenance | Solidity | Consent management and on-chain data provenance (Base Sepolia) |
| SDK | swarm_provenance_SDK | TypeScript | Library for browser and Node.js apps |
| CLI | swarm_provenance_CLI | Python | Command-line tool for uploads, downloads, chain anchoring, and stamp management |
| MCP Server | swarm_provenance_MCP | Python | AI agent integration via Model Context Protocol, with optional chain anchoring |
| App | dataprovenance-app | TypeScript | Web interface for recording, verifying, and tracing data provenance |
| Gateway | swarm_connect | Python | FastAPI server bridging clients to a Swarm Bee node |
Via the gateway (all clients):
- Provenance metadata — every upload wraps data with SHA256 hash, timestamp, and encoding info
- Notary signing — optional EIP-191 cryptographic signature proving data authenticity
- Stamp management — purchase, extend, and monitor postage stamps using duration and size presets
- Free tier — rate-limited access for development and testing (3 requests/min)
- x402 payments — pay-per-request access using USDC on Base chain
- Integrity verification — automatic SHA256 hash check on download
Direct to blockchain (SDK, CLI, and MCP Server):
- Blockchain anchoring — register data hashes on the DataProvenance smart contract (Base Sepolia)
- Transformation lineage — record data transformations and merges, building a provenance DAG
- Storage references — bidirectional lookup between Swarm hashes and external storage identifiers
pnpm add @datafund/swarm-provenanceimport { ProvenanceClient } from '@datafund/swarm-provenance';
const client = new ProvenanceClient();
const result = await client.upload('Hello, World!', { standard: 'v1' });
console.log('Reference:', result.reference);pip install -e "git+https://github.com/datafund/swarm_provenance_CLI.git#egg=swarm-provenance-uploader"# Upload a file
swarm-prov-upload upload --file /path/to/data.txt
# Download and verify
swarm-prov-upload download <swarm_hash> --output-dir ./downloadsOption A — Docker (recommended):
docker pull ghcr.io/datafund/swarm-provenance-mcp:latestOption B — From source:
git clone https://github.com/datafund/swarm_provenance_MCP.git
cd swarm_provenance_mcp
pip install -e .Add to your Claude Desktop config (claude_desktop_config.json):
{
"mcpServers": {
"swarm-provenance": {
"command": "docker",
"args": ["run", "-i", "--rm", "ghcr.io/datafund/swarm-provenance-mcp:latest"],
"env": {
"SWARM_GATEWAY_URL": "https://provenance-gateway.datafund.io",
"CHAIN_ENABLED": "true"
}
}
}
}Or if installed from source, replace "command" / "args" with "command": "swarm-provenance-mcp".
The public gateway is available at https://provenance-gateway.datafund.io. To self-host, see swarm_connect.
This toolkit is in alpha / proof-of-concept stage. Storage on Swarm is rented — data persistence depends on postage stamp validity.
- datafund.io — Datafund
- github.com/datafund — all repos
MIT