Lecture

Words Tokens: Lexical Level Overview

Related lectures (32)

Explores lexicons, n-grams, and language models, emphasizing their importance in recognizing words and the effectiveness of n-grams for various tasks.

Words and Tokens: Language Models and Probabilities

Reviews language models, tokenization, and probability estimation in NLP systems.

Neural Word Embeddings: Learning Representations for Natural Language

Covers neural word embeddings and methods for learning word representations in natural language processing.

Language Models: Fixed-context and Recurrent Neural Networks

Discusses language models, focusing on fixed-context neural models and recurrent neural networks.

Words, tokens, n-grams and Language Models

Explores words, tokens, n-grams, and language models, focusing on probabilistic approaches for language identification and spelling error correction.

Natural Language Processing: Understanding Transformers and Tokenization

Provides an overview of Natural Language Processing, focusing on transformers, tokenization, and self-attention mechanisms for effective language analysis and synthesis.

Prompting and Alignment

Explores prompting, alignment, and the capabilities of large language models for natural language processing tasks.

Modern NLP and Ethics in NLP

Delves into advancements and challenges in NLP, along with ethical considerations and potential harms.

Neural Word Embeddings

Introduces neural word embeddings and dense vector representations for natural language processing.

Classical Language Models: Foundations and Applications

Introduces classical language models, their applications, and foundational concepts like count-based modeling and evaluation metrics.

Neural Networks for NLP

Covers modern Neural Network approaches to NLP, focusing on word embeddings, Neural Networks for NLP tasks, and future Transfer Learning techniques.

Model Analysis

Explores neural model analysis in NLP, covering evaluation, probing, and ablation studies to understand model behavior and interpretability.

Coreference Resolution

Delves into coreference resolution, discussing challenges, advancements, and evaluation methods.

Introduction to NLP and the Course

Covers the basics of Natural Language Processing, including challenges, linguistic processing levels, and the impact of power laws.

Dirichlet-Multinomial Model

Discusses the Dirichlet distribution, Bayesian inference, posterior mean and variance, conjugate priors, and predictive distribution in the Dirichlet-Multinomial model.

Pretraining Sequence-to-Sequence Models: BART and T5

Covers the pretraining of sequence-to-sequence models, focusing on BART and T5 architectures.

Topic Models

Introduces topic models, covering clustering, GMM, LDA, Dirichlet distribution, and variational inference.

Ethical Considerations in Natural Language Processing

Explores ethical challenges in NLP systems, including biases, toxicity, privacy, and disinformation.

Ethics in NLP

Discusses the ethical implications of NLP systems, focusing on biases, toxicity, and privacy concerns in language models.

Introduction to Natural Language Processing

Introduces the basics of Natural Language Processing, covering challenges, application domains, and linguistic processing levels.