
Hey! I'm Alex Wa, a 2nd year Math and CS double major at Yale and a YES Scholar. My research interests currently span RL, NLP, and computer vision, and I'm especially interested in model architecture and how models work under the hood. Previously, I've done research in geometric algebra (APOLLO Labs), algebraic topology (SUMaC '23), abstract algebra (SUMaC '22), and biostatistics (Emory).
In my free time, I enjoy drawing, drinking coffee or tea, and exploring other ML and math disciplines.
Posts
August 04, 2025
Whirlwind of PPO and RLHF for LLMs from scratch
RLHF with PPO from scratch and lots of fine-tuning GPT-2 models for movie sentiment classification. Transformer environments, adapative KL control, logit/temperature scaling, whitening, and more. Full implementation here.