Alex Wa

Hey! I'm Alex Wa, a 2nd year Math and CS double major at Yale and a YES Scholar. My research interests currently span RL, NLP, and computer vision, and I'm especially interested in model architecture and how models work under the hood. Previously, I've done research in geometric algebra (APOLLO Labs), algebraic topology (SUMaC '23), abstract algebra (SUMaC '22), and biostatistics (Emory).

In my free time, I enjoy drawing, drinking coffee or tea, and exploring other ML and math disciplines.

Posts

Whirlwind of PPO and RLHF for LLMs from scratch

RLHF with PPO from scratch and lots of fine-tuning GPT-2 models for movie sentiment classification. Transformer environments, adapative KL control, logit/temperature scaling, whitening, and more. Full implementation here.