Blog Archive
Other
- December 2026 - a paper a day keeps the rust away
- January 2026 - [WIP] frontier model training methodologies
- December 2025 - activation engineering for privacy protection in LLMs
- November 2025 - combinatorial reasoning environments for LLMs and RL
- August 2025 - whirlwind of PPO and RLHF for LLMs from scratch