Intro to Transformer Models
Literature
Sutskever, I., Vinyals, O., & Le, Q. V. (2014). Sequence to sequence learning with neural networks. Advances in neural information processing systems, 27.
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., … & Polosukhin, I. (2017). Attention is all you need. Advances in neural information processing systems, 30.
The illustrated transformer
Simple transformer LM
Notebooks - Basics
Notebooks - Applications
Notebooks - FineTuning
Slides - Attention Mechanism
Slides - SBERT
Classification with various vectorization approaches
Resources