logo AAUBS Data Science 2023
  • Info, Schedule & Co
    • Modules
    • Literature & Resources
    • Semester Schedule
    • Semester Project Requirements
  • 1. Applied Data Science and Machine Learning
    • A) Introduction to Data Science (W35-36)
      • - Welcome Students!
      • - Data Handling and Manipulation
      • - Exploratory Data Analysis and Essential Statistics
      • - Data Visualization in Data Science
      • Mathematics (Brushup)
    • B) Rapid Prototyping (W37)
      • Introduction to Streamlit: Building an Employee Attrition Dashboard
      • Streamlit Development & Running Offline
      • - Real World Data to Online Dashboard
    • C) Intro to Unsupervised Machine Learning (W38)
      • - Introduction to Unsupervised ML
      • - Recommendation and Similarity Search
      • - Introduction to Clustering: K-means and Hierarchical Approaches
    • D) Intro to Supervised Machine Learning (W39)
      • - Introduction to Supervised ML
      • - SML - Further topics
      • - Time Series Forecasts
  • 2. Network Analysis & NLP
    • Natural Language Processing
      • Basics of NLP
      • NLP Applications Chatbot
    • Network Analysis
      • Basics Network Analysis
      • 2 Mode Networks
      • NW Exercises
      • NW Cases
  • 3. Applied Deep Learning and Artificial Intelligence
    • Intro to Traditional Deep Learning
    • Group Assignment 1
    • Intro to Transformer Models
    • Group assignment 2
    • Intro to GPT Models
    • Group Assignment 3
    • Intro to Graph Neural Networks
    • Group Assignment 4
  • 4. Data Engineering and Machine Learning Operations in Business
    • Lecture 1 Introduction to Serverless ML and Databases
    • Lecture 2 Refactoring & First Serverless App
    • Lecture 3: Credit Card Prediction Service Project
    • Lecture 5 Feature Selection, Batch Inference Pipelines, Model Registry
    • Lecture 6 Feature Selection, Batch Inference Pipelines, Model Registry
    • Bonus Workshop: Using LLMs in your applications

  • Clear History

Built with from Grav and Hugo

Social / Business Data Science 2023 > Applied Deep Learning and Artificial Intelligence > Intro to Transformer Models

Intro to Transformer Models

Literature

Sutskever, I., Vinyals, O., & Le, Q. V. (2014). Sequence to sequence learning with neural networks. Advances in neural information processing systems, 27.

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., … & Polosukhin, I. (2017). Attention is all you need. Advances in neural information processing systems, 30.

The illustrated transformer

Simple transformer LM

Notebooks - Basics

  • Transformer Models - Basics

Notebooks - Applications

  • TM Applications - SBERT
  • TM Applications - HF
  • Simple transformer LM
  • SBERT for Patent Search using PatentSBERTa in PyTorch

Notebooks - FineTuning

  • TM FineTuning - SimpleTransformers
  • TM FineTuning - SBERT
  • TM FineTuning - HF
  • SetFit Hatespeech vs bert and distilroberta
  • Seq2Seq - Neural Machine Translation

Slides - Attention Mechanism

Slides - SBERT

Classification with various vectorization approaches

  • TF-IDF and W2V Multi-Class Text Classification
  • BERT Multi-Class Text Classification
  • Implementing Multi-Class Text Classification LSTMs using PyTorch

Resources

  • OG SBERT-Paper Reimers, N., & Gurevych, I. (2019). Sentence-bert: Sentence embeddings using siamese bert-networks. arXiv preprint arXiv:1908.10084.
  • SBERT Docu
  • NLP with SBERT - an ebook/course on the use of dense vectors (with SBERT for business applications)
  • SBERT-Training Tutorial
  • BERTopic - a framework for topic modelling with SBERT embeddings
  • Milvus - Vector database