Transformer-Implementation-from-Scratch-PyTorch
PublicCustom Implementation of the famous Transformer Architecture from scratch based on the Seminal Paper Attention is All You Need by Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser and Illia Polosukhin