Lecture

Pretraining Sequence-to-Sequence Models: BART and T5

Related lectures (30)

Natural Language Processing: Understanding Transformers and Tokenization

Provides an overview of Natural Language Processing, focusing on transformers, tokenization, and self-attention mechanisms for effective language analysis and synthesis.

Transformers: Pretraining and Decoding Techniques

Covers advanced transformer concepts, focusing on pretraining and decoding techniques in NLP.

Deep Learning for NLP

Introduces deep learning concepts for NLP, covering word embeddings, RNNs, and Transformers, emphasizing self-attention and multi-headed attention.

Sequence to Sequence Models: Overview and Applications

Covers sequence to sequence models, their architecture, applications, and the role of attention mechanisms in improving performance.

Introduction to Modern Natural Language Processing

Introduces the course on Modern Natural Language Processing, covering its significance, applications, challenges, and advancements in technology.

Pretraining: Transformers & Models

Explores pretraining models like BERT, T5, and GPT, discussing their training objectives and applications in natural language processing.

Transformers in Vision: Applications and Architectures

Covers the impact of transformers in computer vision, discussing their architecture, applications, and advancements in various tasks.

Language Models: From Theory to Computation

Explores the mathematics of language models, covering architecture design, pre-training, and fine-tuning, emphasizing the importance of pre-training and fine-tuning for various tasks.

Machine Translation: Attention Mechanism

Explores the attention mechanism in machine translation, addressing the bottleneck problem and improving NMT performance significantly.

Multilingual NLP: Challenges and Innovations

Covers the importance of multilingual NLP and the challenges in scaling language models.

Contextual Representations: ELMO and BERT Overview

Covers contextual representations in NLP, focusing on ELMO and BERT architectures and their applications in various tasks.

Transformers: Revolutionizing Attention Mechanisms in NLP

Covers the development of transformers and their impact on attention mechanisms in NLP.

Data Annotation: Collection and Biases in NLP

Addresses data collection, annotation processes, and biases in natural language processing.

Neural Word Embeddings: Learning Representations for Natural Language

Covers neural word embeddings and methods for learning word representations in natural language processing.

Pre-Training: BiLSTM and Transformer

Delves into pre-training BiLSTM and Transformer models for NLP tasks, showcasing their effectiveness and applications.

Transformer: Pre-Training

Explores the Transformer model, from recurrent models to attention-based NLP, highlighting its key components and significant results in machine translation and document generation.

Modern NLP and Ethics in NLP

Delves into advancements and challenges in NLP, along with ethical considerations and potential harms.

BERT: Pretraining and Applications

Delves into BERT pretraining for transformers, discussing its applications in NLP tasks.

Modern NLP: Introduction

By Antoine Bosselut introduces Natural Language Processing and its challenges, advancements in neural models, and course goals.

Neural Networks for NLP

Covers modern Neural Network approaches to NLP, focusing on word embeddings, Neural Networks for NLP tasks, and future Transfer Learning techniques.