AIQuant — HFT Statistical Arbitrage Framework

AegisFintech · Professional-grade quantitative trading system for BTCUSD on Hyperliquid.

Default: 5-Year Backtest (1825 days · Jan 2021 – Jun 2026 · ~2.63M bars)

The system defaults to 5 years of BTCUSDT 1-minute data from Binance Vision. The table below shows validated results on the most recent 365-day window:

Metric	Value
Sharpe Ratio	+0.625
Total Return	+6.26%
Max Drawdown	-7.87%
Calmar Ratio	0.795
Win Rate	63.5%
Total Trades	19,302
Profit Factor	1.024x
Dataset	530,628 bars · BTC $60k → $126k

Note: The 365-day window includes the Jun–Nov 2025 bear market (BTC -52%). The ML ensemble achieves Sharpe 3.39 on the bull-run period (Mar–Jun 2026) when trained on that window alone. With 5 years of data the model learns multiple full market cycles (2021 bull, 2022 bear, 2023–2024 recovery, 2025–2026 bull).

Quick Start

Google Colab (Recommended)

Click the badge above
Set runtime to GPU (Runtime → Change runtime type → T4 GPU)
Run all cells in order — Step 3 lets you change DAYS (default 1825)

Local Installation

git clone https://github.com/AegisFintech/AIQuant.git
cd AIQuant
pip install -r requirements.txt

Data Setup

Place yearly zip files (provided or downloaded from Binance Vision) into the data/ folder, then run:

# Auto-detects local zips in data/ and extracts them.
# If no zips are found, downloads from Binance Vision automatically.
# Also fetches the last 7 days from Hyperliquid to bridge the gap to today.
python3 scripts/prepare_data.py

# Custom window (default is 1825 = 5 years)
python3 scripts/prepare_data.py --days 365

Accepted zip filenames: BTCUSDT_1m_2021.zip, 21.zip, or any *.zip containing BTCUSDT-1m-YYYY-MM.csv files.

CLI Usage

# Full 5-year ML ensemble backtest (default)
python3 run.py backtest

# Faster mode (skip LSTM, ~3x speedup)
python3 run.py backtest --fast

# Custom window
python3 run.py backtest --days 365

# Live trading with ML model (run backtest first to save model)
python3 run.py live --ml

# Live trading with rule-based strategy
python3 run.py live

Architecture

AIQuant/
├── run.py                          # CLI entry point (backtest + live)
├── aiquant/
│   ├── data/
│   │   └── fetcher.py              # Binance Vision + Hyperliquid data
│   ├── features/
│   │   ├── technical.py            # 92 technical indicators (fast NumPy)
│   │   ├── microstructure.py       # 55 microstructure features
│   │   ├── statarb.py              # 19 StatArb / regime features
│   │   └── gpu_features.py         # CuPy GPU-accelerated feature engineering
│   ├── models/
│   │   ├── gpu_ml.py               # GPU ML: XGBoost + LightGBM + LSTM
│   │   └── ml_signal.py            # ML signal generator
│   ├── execution/
│   │   ├── hyperliquid_trader.py   # Hyperliquid mainnet execution
│   │   ├── ml_live_trader.py       # ML live trading (loads saved model bundle)
│   │   └── live_trader.py          # Rule-based live trading orchestrator
│   ├── risk/
│   │   └── manager.py              # Kelly Criterion + drawdown limits
│   └── utils/
│       └── fast_math.py            # Numba JIT: Hurst, ADF, Kalman, OU
├── scripts/
│   ├── prepare_data.py             # Build Binance Vision dataset
│   ├── train_ml_ensemble.py        # Standalone ML training script
│   └── build_colab.py              # Regenerate AIQuant_Colab.ipynb
├── models/
│   └── ml_live_bundle.pkl          # Saved ML model bundle (after backtest)
├── config/
│   ├── settings.yaml               # System configuration
│   └── ml_best_params.json         # Saved ML best parameters
└── data/raw/                       # OHLCV data (gitignored)

ML Ensemble Pipeline

The backtest uses a walk-forward cross-validation pipeline with no lookahead bias:

Labels — 15-bar forward return, threshold 0.08% net of fees
Feature selection — Top 60 features by mutual information (from 171 total)
Walk-forward folds — Dynamic sizing: targets ~50 folds regardless of dataset length
- 365 days → 60d train / 6d step / ~50 folds
- 1825 days (5y) → 90d train / 30d step / ~57 folds
XGBoost — 200 estimators, depth 5, class-balanced weights, CUDA GPU
LightGBM — 200 estimators, 31 leaves, class-balanced weights, GPU
LSTM + Attention — 30-bar sequences, 2-layer LSTM, 64 hidden units, PyTorch CUDA
Ensemble — XGB 40% + LGB 40% + LSTM 20%
Threshold search — Grid search over long/short confidence thresholds
Model saving — Best model bundle saved to models/ml_live_bundle.pkl for live trading

Data Sources

Source	Coverage	Auth Required
Binance Vision	Monthly CSVs, Jan 2017–present	None
Hyperliquid	Real-time candles (last 7 days)	None (read)

Data scaling by DAYS setting:

DAYS	Files	Approx size	Approx bars
365 (1 year)	12	~40 MB	530k
730 (2 years)	24	~80 MB	1.05M
1095 (3 years)	36	~120 MB	1.58M
1825 (5 years, default)	60	~200 MB	2.63M

Live Trading

ML Live Trading (Recommended — uses trained model)

# Step 1: Train and save the model bundle
python3 run.py backtest

# Step 2: Start ML live trading
python3 run.py live --ml

Rule-Based Live Trading

# Requires Hyperliquid mainnet account
echo "HYPERLIQUID_PRIVATE_KEY=0x..." >> .env
python3 run.py live

Configuration

Edit config/settings.yaml or set environment variables in .env:

pair: BTCUSDT
interval: 1m
kelly_fraction: 0.5          # Half-Kelly position sizing
max_position_pct: 0.25       # Max 25% of capital per trade
max_drawdown_pct: 0.15       # Stop trading at 15% drawdown

License

Apache 2.0 — see LICENSE

Disclaimer

This software is for educational and research purposes only. Cryptocurrency trading involves substantial risk of loss. Past performance does not guarantee future results.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AIQuant — HFT Statistical Arbitrage Framework

Default: 5-Year Backtest (1825 days · Jan 2021 – Jun 2026 · ~2.63M bars)

Quick Start

Google Colab (Recommended)

Local Installation

Data Setup

CLI Usage

Architecture

ML Ensemble Pipeline

Data Sources

Live Trading

ML Live Trading (Recommended — uses trained model)

Rule-Based Live Trading

Configuration

License

Disclaimer

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
aiquant		aiquant
config		config
data/raw		data/raw
logs/paper_trading		logs/paper_trading
results		results
scripts		scripts
.env.example		.env.example
.gitignore		.gitignore
AIQuant_Colab.ipynb		AIQuant_Colab.ipynb
LICENSE		LICENSE
QUANT_RESEARCH_REPORT.md		QUANT_RESEARCH_REPORT.md
README.md		README.md
requirements.txt		requirements.txt
run.py		run.py
setup.py		setup.py

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

AIQuant — HFT Statistical Arbitrage Framework

Default: 5-Year Backtest (1825 days · Jan 2021 – Jun 2026 · ~2.63M bars)

Quick Start

Google Colab (Recommended)

Local Installation

Data Setup

CLI Usage

Architecture

ML Ensemble Pipeline

Data Sources

Live Trading

ML Live Trading (Recommended — uses trained model)

Rule-Based Live Trading

Configuration

License

Disclaimer

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages