Skip to content
View dantp-ai's full-sized avatar

Organizations

@AmiiThinks

Block or report dantp-ai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
dantp-ai/README.md

Howdy, I'm Daniel πŸ‘‹.

I am a Senior Machine Learning Software Engineer.

Focused on reinforcement learning, AI infrastructure, and building reliable and scalable software for AI systems.

Claude Google Assistant Cursor NumPy JAX CUDA Docker PyTorch Python

Current Projects

Reinforcement Learning

  • πŸ’ͺπŸ‹οΈπŸ’§ gym-puddle: Off-policy PAC algorithm implemented on the Puddle World Gymnasium environment using TorchRL

Tooling

  • πŸ§ͺβš™οΈπŸ” AlphaEx: Sweep parameters and dispatch thousands of Slurm jobs from one Python script

Educational

  • πŸ“‰ nabla: Educational numpy implementations of 15 optimizers (SGD β†’ Muon), animated on a 2D saddle & benchmarked on matrix LS.

GitHub Activity

GitHub Contribution Graph

Latest Blog Posts

Pinned Loading

  1. tianshou tianshou Public

    Forked from thu-ml/tianshou

    An elegant PyTorch deep reinforcement learning library.

    Python

  2. clawloop clawloop Public

    Forked from aganthos/clawloop

    Make your agents learn from experience. One protocol for weights, harness and routing.

    Python

  3. deep-rl-algos-methods deep-rl-algos-methods Public

    Jupyter Notebook

  4. minitorch minitorch Public template

    Forked from minitorch/minitorch

    The full minitorch student suite.

    Python

  5. DLRC_2018 DLRC_2018 Public

    Forked from georgosgeorgos/DLRC_2018

    Statistical Models for Robotic Perception

    Jupyter Notebook

  6. gym-puddle gym-puddle Public

    Forked from EhsanEI/gym-puddle

    Continuous grid-world environment for RL using Gymnasium

    Python