Skip to content

feat: 'Code City Apocalypse' Thematic Frontend Dashbord#36

Open
BleakNarratives wants to merge 5 commits into
mainfrom
feat/code-city-apocalypse-frontend-9446818386844749147
Open

feat: 'Code City Apocalypse' Thematic Frontend Dashbord#36
BleakNarratives wants to merge 5 commits into
mainfrom
feat/code-city-apocalypse-frontend-9446818386844749147

Conversation

@BleakNarratives
Copy link
Copy Markdown
Owner

This PR transforms the AIRTBench Streamlit dashboard into the "Code City Apocalypse" themed command center.

Key enhancements:

  • Thematic UI/UX: Custom CSS implementation for a terminal-inspired, apocalyptic aesthetic (neon-green glow, scanlines, glitch effects).
  • Narrative Error Mapping: Technical errors are now represented as "monsters" in a 'Digital Bestiary', adding depth to failure analysis.
  • Enhanced Telemetry:
    • Real-time 'Emergency Ticker' for situational awareness.
    • Terminal-style 'System Logs' for granular event tracking.
    • 'District Status' sidebar for high-level city health monitoring.
  • Improved Analytics:
    • Refactored model leaderboard to accurately track success rates across all active agents.
    • Logic updates to correctly identify 'Beacon Found' success events from both live and archived datasets.
  • Robustness & Quality:
    • Comprehensive type hints and linting compliance (Mypy/Ruff).
    • Verified visual integrity across all tabs (Overview, Metrics, Fallout, Logs) using automated Playwright tests.

PR created automatically by Jules for task 9446818386844749147 started by @BleakNarratives

- Rebranded Streamlit UI with neon-green terminal CSS and glitch aesthetics.
- Added 'Digital Bestiary' mapping code errors to thematic monsters.
- Implemented real-time 'Emergency Broadcast' ticker and 'System Logs' telemetry.
- Enhanced success detection logic and model performance leaderboard.
- Improved data integration for live telemetry and historical fallout logs.

Co-authored-by: google-labs-jules[bot] <161369871+google-labs-jules[bot]@users.noreply.github.com>
@google-labs-jules
Copy link
Copy Markdown

👋 Jules, reporting for duty! I'm here to lend a hand with this pull request.

When you start a review, I'll add a 👀 emoji to each comment to let you know I've read it. I'll focus on feedback directed at me and will do my best to stay out of conversations between you and other bots or reviewers to keep the noise down.

I'll push a commit with your requested changes shortly after. Please note there might be a delay between these steps, but rest assured I'm on the job!

For more direct control, you can switch me to Reactive Mode. When this mode is on, I will only act on comments where you specifically mention me with @jules. You can find this option in the Pull Request section of your global Jules UI settings. You can always switch back!

New to Jules? Learn more at jules.google/docs.


For security, I will only act on instructions from the user who triggered this task.

Copy link
Copy Markdown

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces several enhancements to the airtbench frontend, including a new 'System Logs' tab with a terminal-like log viewer, improved leaderboard metrics that account for all models, and visual updates to the digital bestiary. The code review feedback highlights several robust improvements: ensuring flag validation checks for non-empty strings rather than just non-null values, filtering out NaN values from the leaderboard model list, refining the timestamp parsing logic in the system logs to handle stringified numeric values, and preventing the display of literal 'nan' strings when model or challenge names are missing.

Comment thread airtbench/frontend.py
Comment thread airtbench/frontend.py
Comment thread airtbench/frontend.py Outdated
Comment thread airtbench/frontend.py Outdated
BleakNarratives and others added 4 commits June 4, 2026 09:55
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Signed-off-by: BleakNarratives <bleaknarratives@gmail.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Signed-off-by: BleakNarratives <bleaknarratives@gmail.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Signed-off-by: BleakNarratives <bleaknarratives@gmail.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Signed-off-by: BleakNarratives <bleaknarratives@gmail.com>
Copy link
Copy Markdown
Owner Author

@BleakNarratives BleakNarratives left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🤘

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant