🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
COMMITS
20
in the last year
CONTRIBUTORS
2
active
STARS
66167
total
FORKS
0
total
TOP CONTRIBUTORS
V
Varuna Jayasiri V
vpj RECENT COMMITS
V
link to jax transformer
Varuna Jayasiri
V
gepa paper highlighted
Varuna Jayasiri
V
cleanup jax
Varuna Jayasiri
V
jax transformer
Varuna Jayasiri
V
flash attention
Varuna Jayasiri
V
all comments
Varuna Jayasiri
V
all comments
Varuna Jayasiri
V
all comments
Varuna Jayasiri
V
sm scale log2
Varuna Jayasiri
V
backward pass formulas
Varuna Jayasiri
V
backward pass formulas
Varuna Jayasiri
V
flash comments
Varuna Jayasiri
V
flash comments
Varuna Jayasiri
V
Merge remote-tracking branch 'origin/master'
Varuna Jayasiri
V
triton flash wip
Varuna Jayasiri