dantp-ai

Follow

Daniel Plop dantp-ai

Follow

15 followers · 3 following

Achievements

Achievements

Organizations

dantp-ai/README.md

Howdy, I'm Daniel 👋.

I am a Senior Machine Learning Software Engineer.

Focused on reinforcement learning, AI infrastructure, and building reliable and scalable software for AI systems.

Current Projects

Reinforcement Learning

💪🏋️💧 gym-puddle: Off-policy PAC algorithm implemented on the Puddle World Gymnasium environment using TorchRL

Tooling

🧪⚙️🔁 AlphaEx: Sweep parameters and dispatch thousands of Slurm jobs from one Python script

Educational

📉 nabla: Educational numpy implementations of 15 optimizers (SGD → Muon), animated on a 2D saddle & benchmarked on matrix LS.

GitHub Activity

Latest Blog Posts

Review on the Technical Report: Gemini Robotics 1.5

Pinned Loading

tianshou tianshou Public

Forked from thu-ml/tianshou

An elegant PyTorch deep reinforcement learning library.

Python
clawloop clawloop Public

Forked from aganthos/clawloop

Make your agents learn from experience. One protocol for weights, harness and routing.

Python
deep-rl-algos-methods deep-rl-algos-methods Public

Jupyter Notebook
minitorch minitorch Public template

Forked from minitorch/minitorch

The full minitorch student suite.

Python
DLRC_2018 DLRC_2018 Public

Forked from georgosgeorgos/DLRC_2018

Statistical Models for Robotic Perception

Jupyter Notebook
gym-puddle gym-puddle Public

Forked from EhsanEI/gym-puddle

Continuous grid-world environment for RL using Gymnasium

Python