🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
FFN ready for GLU
V
Varuna Jayasiri committed
0bf8951258456e80b8c2b9181ae2c730db9f0c5d
Parent: 559a5e8