🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
COMMITS
January 22, 2026
V
Merge pull request #265 from thanhtcptit/master
vpj committed
November 11, 2025
V
link to jax transformer
Varuna Jayasiri committed
September 19, 2025
V
gepa paper highlighted
Varuna Jayasiri committed
August 21, 2025
V
cleanup jax
Varuna Jayasiri committed
V
jax transformer
Varuna Jayasiri committed
August 8, 2025
V
flash attention
Varuna Jayasiri committed
August 1, 2025
V
all comments
Varuna Jayasiri committed
V
all comments
Varuna Jayasiri committed
V
all comments
Varuna Jayasiri committed
V
sm scale log2
Varuna Jayasiri committed
V
backward pass formulas
Varuna Jayasiri committed
July 31, 2025
V
backward pass formulas
Varuna Jayasiri committed
V
flash comments
Varuna Jayasiri committed
V
flash comments
Varuna Jayasiri committed
July 30, 2025
V
Merge remote-tracking branch 'origin/master'
Varuna Jayasiri committed
V
triton flash wip
Varuna Jayasiri committed
July 20, 2025
V
Update readme.md
Varuna Jayasiri committed
V
cleanup save checkpoint
Varuna Jayasiri committed
V
cleanup log activations
Varuna Jayasiri committed
V
cleanup hook model outputs
Varuna Jayasiri committed
V
remove labml_helpers dep
Varuna Jayasiri committed
July 18, 2025
V
Merge pull request #280 from kommentlezz/patch-1
Varuna Jayasiri committed
V
seq length in rope
Varuna Jayasiri committed
V
cleanup some unused imports
Varuna Jayasiri committed
V
remove labml_helpers dependency: replace Module with nn.Module
Varuna Jayasiri committed
December 24, 2024
P
Fix typo: language
Pavel Dmitriev committed