Skip to content
View SQLicious's full-sized avatar

Block or report SQLicious

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
SQLicious/README.md

Profile Views

Ruby (Roopmathi) Gunna

Senior Data Engineer Β β†’Β  AI Platform Engineer
Databricks Lakehouse Β Β·Β  Azure Β Β·Β  LLM Pipelines Β Β·Β  Agentic RAG Β Β·Β  Banking & Cybersecurity

Open to Work TN Visa


πŸ‘©β€πŸ’» About Me

I'm a Senior Data Engineer (VP-level) with 10+ years building production-grade data platforms across banking, cybersecurity, risk technology, and healthcare β€” and I'm now extending that foundation into AI platform engineering.

Most AI systems fail because the data layer is broken. I build both.

I've led enterprise Lakehouse programs at Tier-1 financial institutions, processing 30M+ records daily across full Bronze/Silver/Gold medallion architectures with real security tooling data. I design the metadata-driven frameworks, governance layers, and CI/CD foundations that teams actually adopt β€” not just POCs that get shelved.

I'm now building LLM pipelines, Agentic RAG systems, and AI-powered DataOps tools on top of the same governed data infrastructure I've spent a decade perfecting β€” bridging the gap between enterprise data platforms and the AI workloads companies are rushing to deploy.

ruby = {
    "shipped":            ["Agentic RAG Knowledge Assistant 🧠", "M&A Oracle (team capstone) 🏦"],
    "currently_building": ["CyberLens πŸ”΅", "QueryForge 🟒", "PipelineGuardian 🟑"],
    "production_stack":   ["Databricks", "Delta Lake", "PySpark", "Azure", "Unity Catalog"],
    "ai_stack":           ["Claude API", "LangGraph", "RAG", "MLflow", "ChromaDB", "RAGAS"],
    "domain_expertise":   ["Banking", "Cybersecurity", "Risk Technology", "Healthcare", "M&A / PE"],
    "open_to":            ["Senior Data Engineer", "AI Platform Engineer", "ML Platform Engineer"],
}

βœ… Shipped β€” Live AI Projects

Real systems. Real code. Running now.

Project What It Does Key Stack Status
🧠 Claude Cert Knowledge Assistant Domain-specific Agentic RAG system with a 3-tier adaptive execution model: Tier 0 (direct LLM), Tier 1 (single-source retrieval), Tier 2 (multi-hop decomposition across sources). Implements Corrective RAG with query rewriting, Self-RAG hallucination grading, semantic caching, and hierarchical parent-child chunking. RAGAS evaluation suite included. LangGraph Β· LangChain Β· ChromaDB Β· OpenAI API Β· RAGAS Β· Brave Search Β· Pydantic βœ… Live
🏦 M&A Oracle (team capstone) [Shipping Apr 20, 2026] Enterprise RAG system for private equity due diligence β€” surfaces contradictions between management earnings calls and SEC filing footnotes. Integrates 7+ data sources (SEC EDGAR 10-K/10-Q, USPTO patents, earnings transcripts) via a Knowledge Graph, RAG Router, and multi-step agentic reasoning. Enterprise-grade observability, audit trails, and Slack notifications. Knowledge Graph Β· RAG Router Β· SEC EDGAR Β· USPTO API Β· LangGraph Β· Agentic AI Β· Multimodal RAG πŸ”¨ Shipping Apr 20, 2026

🚧 Currently Building β€” Data + AI Platform Projects

Bridging 10 years of enterprise Data Engineering into the AI platform layer. Each project applies production-grade patterns from real regulated environments β€” banking, cybersecurity, and healthcare.

Project What It Does Key Stack Status
πŸ”΅ CyberLens Security Data Lakehouse + Agentic RAG platform. Ingests multi-source security telemetry into a Bronze/Silver/Gold medallion architecture, then layers a LangGraph agent that answers natural language questions about enterprise security posture by reasoning over both structured Delta data and unstructured threat intel. Databricks Β· Delta Lake Β· LangGraph Β· Claude API Β· ChromaDB Β· FastAPI Β· MLflow Β· Docker πŸ”¨ Building
🟒 QueryForge Production-grade Text-to-SQL LLMOps platform β€” the open-source version of what Databricks Genie and Snowflake Cortex Analyst are commercializing. Prompt versioning in MLflow, RAGAS evaluation CI/CD that fails builds on accuracy drops, SQL validation layer, and a user feedback loop for continuous improvement. Claude API Β· MLflow Β· RAGAS Β· FastAPI Β· PostgreSQL Β· GitHub Actions Β· Streamlit Β· Databricks πŸ”¨ Building
🟑 PipelineGuardian Agentic DataOps monitor that detects pipeline anomalies (schema drift, volume drops, SLA breaches), gathers lineage context, generates LLM-powered root cause analysis via a 4-node LangGraph workflow, and auto-creates incident tickets with AI-written descriptions β€” turning hours of manual triage into 60-second automated resolution. LangGraph Β· Claude API Β· Delta Lake Β· Docker Β· FastAPI Β· MLflow Β· Streamlit Β· Databricks πŸ”¨ Building

πŸ› οΈ Tech Stack

Data Platforms & Engineering

Databricks Apache Spark Delta Lake Azure Data Factory Snowflake ADLS Gen2

AI & LLM Engineering

Claude API LangGraph LangChain MLflow RAG ChromaDB

Languages

Python SQL T-SQL Spark SQL

Cloud & DevOps

Azure GitHub Actions Docker Azure DevOps Azure Functions

Databases

PostgreSQL Oracle SQL Server Cosmos DB


πŸ“œ Certifications

πŸ”΅ Microsoft 🟠 Databricks 🟀 Anthropic
βœ… Azure Data Engineer Associate (DP-203)
βœ… Azure Data Scientist Associate (DP-100)
βœ… Fabric Analytics Engineer (DP-600)
βœ… Power BI Data Analyst (PL-300)
βœ… Azure AI Fundamentals (AI-900)
βœ… Azure Data Fundamentals (DP-900)
βœ… Azure Fundamentals (AZ-900)
βœ… Lakehouse Fundamentals
πŸ”„ Data Engineer Professional (May 2026)
βœ… Building Single-Agent Apps on Databricks
βœ… GenAI App Deployment & Monitoring
βœ… Building Retrieval Agents on Databricks
πŸ”„ Claude Code Architect Foundations (Apr 2026)
βœ… Claude Code 101
πŸ”„ Building with the Claude API
πŸ”„ Introduction to Agent Skills

πŸ“‘ Right Now β€” April 2026

This is a live job search sprint. I'm building in public.

πŸ”¨ Active Build CyberLens Β· QueryForge Β· PipelineGuardian β€” all three in parallel
πŸ“š Active Learning Databricks Mosaic AI Β· Azure AI Foundry Β· RAG Architect Boot Camp (datasenseai.com)
πŸŽ“ Certs in Progress Databricks Data Engineer Professional Β· Claude Code Architect Foundations
🌍 Available Immediately · New Jersey / Remote / Hybrid
🀝 Looking for Senior DE · AI Platform Engineer · ML Platform Engineer

Profile Views Β  Repos Β  Stack


🌱 Currently Deepening

  • Databricks Mosaic AI β€” Vector Search, Model Serving, AI Gateway, Genie patterns
  • LLMOps β€” Prompt versioning, RAGAS evaluation, agent observability, cost tracking
  • Azure AI Foundry β€” Prompt Flow, Azure AI Search, multi-provider LLM abstraction
  • Agentic System Design β€” Multi-agent orchestration, tool use, context engineering, evaluation harnesses
  • RAG Architect Boot Camp β€” 8-week enterprise RAG solution builder (datasenseai.com)

πŸ“« Let's Connect

I'm actively exploring Senior Data Engineering and AI Platform Engineering roles where I can bridge enterprise data platforms with production AI workloads.

  • 🌍 Location: New Jersey β€” open to remote & hybrid roles across the US
  • πŸ“§ Email: roopmathi.gj@gmail.com
  • πŸ’Ό LinkedIn: linkedin.com/in/roopmathi
  • πŸ‡¨πŸ‡¦ Work Auth: Canadian citizen Β· TN visa Β· processes at US border in 1 day Β· no USCIS wait Β· no lottery


"Most AI systems fail because of the data layer. I build both."

Pinned Loading

  1. Tableau-Project--User-Retention-Analysis-for-Dognition- Tableau-Project--User-Retention-Analysis-for-Dognition- Public

    This repository houses the final capstone project for the "Data Visualization with Tableau" course on Coursera. It features a comprehensive dashboard and an accompanying video presentation, demonst…

  2. Steel-Data-SQL-Python-Challenges Steel-Data-SQL-Python-Challenges Public

    My solutions to a series of engaging SQL challenges designed by Matthew Steel.

    1

  3. Agentic_RAG_Assignment_3 Agentic_RAG_Assignment_3 Public

    A domain-specific AI Agent utilizing Retrieval-Augmented Generation (RAG) to navigate Anthropic’s certification paths and course documentation. Built for RAG Assignment 3. Features End-to_End Imple…

    Python

  4. Microsoft-Azure-Data-Engineering-Associate-DP-203-Professional-Certificate Microsoft-Azure-Data-Engineering-Associate-DP-203-Professional-Certificate Public

    Master designing and implementing data solutions that use Microsoft Azure data services

  5. salesforce-es-data-ai-poc-portfolio salesforce-es-data-ai-poc-portfolio Public

    POC portfolio for Salesforce Employee Success: three solution concepts demonstrating how I think about data engineering, metrics architecture, and agentic AI opportunities for enterprise people ana…

    HTML

  6. SOLVED--super-duper-SQL-Challenges SOLVED--super-duper-SQL-Challenges Public