2025  3

June  2

Exploring Distributed Deep Learning with LizardDist

June 30, 2025 · 1 min · 129 words

Part 1: How I Built My Own (tiny) Distributed Data Parallel Engine (LizarDist)

June 30, 2025 · 6 min · 1258 words

April  1

Adam vs. AdamW: A Practical Deep Dive into Optimizer Differences

April 4, 2025 · 11 min · 2167 words