Skip to content

NemoClaw + Dynamo + Nemotron-3 Super FP8#215

Open
antonslutskyms wants to merge 16 commits into
mainfrom
aks-nemoclaw
Open

NemoClaw + Dynamo + Nemotron-3 Super FP8#215
antonslutskyms wants to merge 16 commits into
mainfrom
aks-nemoclaw

Conversation

@antonslutskyms
Copy link
Copy Markdown
Collaborator

Summary
Adds/updates the Azure Kubernetes Service (AKS) workshop under nim-deploy/cloud-service-providers/azure/workshops/aks-nemoclaw. The workshop shows NVIDIA NemoClaw agents using NVIDIA Dynamo inference on-cluster, with nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8 as the reference model.

What’s in this tree
README.md — Workshop goals, plain-language governable application surface vs inference plane, how they connect (Mermaid), Azure AKS tables (identity, secrets, policy, GPU, PVC, scaling, LB), economics vs metered APIs (with illustrative charts), disaggregated serving overview, KV routing caveat for this Nemotron disagg recipe, links to upstream Nemotron recipes and recipe README (PVC / model-cache / prerequisites), and install/ops for deploy_nemoclaw_k8s.sh.
deploy_nemoclaw_k8s.sh — Optional Dynamo apply + wait, NemoClaw source checkout, linux/amd64 image build, ACR push with login retry, apply nemoclaw-install manifests with ACR name substitution.
nemoclaw-base/ / nemoclaw-install/ — DinD + workspace pattern, NemoClaw install, endpoint wiring toward Dynamo frontend.
dynamo/ — Vendored Dynamo tree including default dynamo/dynamo/recipes/nemotron-3-super-fp8/sglang/disagg/deploy.yaml (other recipe paths documented: agg / TRT-LLM disagg, etc.).
How to test / review

README renders (tables + Mermaid) on your Git host.

Links to GitHub
ai-dynamo/dynamo
recipe paths resolve.

./deploy_nemoclaw_k8s.sh --help
matches documented flags and env vars.

(If code changed) Run deploy against a dev ACR + AKS with GPU recipe prerequisites satisfied.
Notes / follow-ups
Default inference manifest is SGLang disaggregated; DYNAMO_DEPLOY_MANIFEST can point at other upstream deploy.yaml layouts.
Nemotron KV-aware routing limitations for this disagg path are documented; align with upstream recipe README if behavior changes.
Related
Upstream recipes: https://github.com/ai-dynamo/dynamo/tree/main/recipes/nemotron-3-super-fp8
Recipe README (prereqs, PVC, quick start): https://github.com/ai-dynamo/dynamo/blob/main/recipes/nemotron-3-super-fp8/README.md

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant