Triton vs. Cutlass vs. ThunderKittens in Flash Attention 3 Oct 1, 2025 • Alex Wa Share on: Coming soon! We develop Triton, Cutlass, and TK kernel implementations of Flash Attention 3 from scratch and test on H100s. <Previous PostWhirlwind of PPO and RLHF for LLMs from scratch >Blog ArchiveArchive of all previous blog posts