I am a Senior Machine Learning Software Engineer.
Focused on reinforcement learning, AI infrastructure, and building reliable and scalable software for AI systems.
- πͺποΈπ§ gym-puddle: Off-policy PAC algorithm implemented on the Puddle World Gymnasium environment using TorchRL
- π§ͺβοΈπ AlphaEx: Sweep parameters and dispatch thousands of Slurm jobs from one Python script
- π nabla: Educational numpy implementations of 15 optimizers (SGD β Muon), animated on a 2D saddle & benchmarked on matrix LS.




