Lecture

Training Strategies for Transformers

Related lectures (23)

Introduces deep learning concepts for NLP, covering word embeddings, RNNs, and Transformers, emphasizing self-attention and multi-headed attention.

Deep Learning: Convolutional Neural Networks

Covers Convolutional Neural Networks, standard architectures, training techniques, and adversarial examples in deep learning.

Deep Learning: Convolutional Neural Networks and Training Techniques

Discusses convolutional neural networks, their architecture, training techniques, and challenges like adversarial examples in deep learning.

Neural Networks: Training and Activation

Explores neural networks, activation functions, backpropagation, and PyTorch implementation.

Transformers in Vision: Applications and Architectures

Covers the impact of transformers in computer vision, discussing their architecture, applications, and advancements in various tasks.

Computer Vision History Recap

Offers a historical overview of computer vision, exploring key developments and influential figures in the field.

Scaling Language Models: Efficiency and Deployment

Covers the scaling of language models, focusing on training efficiency and deployment considerations.

Neural Networks for NLP

Covers modern Neural Network approaches to NLP, focusing on word embeddings, Neural Networks for NLP tasks, and future Transfer Learning techniques.

Neural Networks: Two Layers Neural Network

Covers the basics of neural networks, focusing on the development from two layers neural networks to deep neural networks.

Deep Learning for Question Answering

Explores deep learning for question answering, analyzing neural networks and model robustness to noise.

Transformers: Unifying Machine Learning Communities

Covers the role of Transformers in unifying various machine learning fields.

Neural Taskonomy and Historical Perspectives in Visual Intelligence

Covers Neural Taskonomy, the evolution of neural networks, and historical perspectives in visual intelligence.

Convolutional Neural Networks

Covers Convolutional Neural Networks, including layers, training strategies, standard architectures, tasks like semantic segmentation, and deep learning tricks.

Vision-Language-Action Models: Training and Applications

Delves into training and applications of Vision-Language-Action models, emphasizing large language models' role in robotic control and the transfer of web knowledge. Results from experiments and future research directions are highlighted.

Recurrent Neural Networks: Training and Challenges

Discusses recurrent neural networks, their training challenges, and solutions like LSTMs and GRUs.

Transformers in Vision

Explores the evolution of visual intelligence models, focusing on Transformers and their applications in computer vision and natural language processing.

Neural Networks: Regression and Classification

Explores neural networks for regression and classification tasks, covering training, regularization, and practical examples.

Deep Visual Recognition: Interpretability

Explores deep visual recognition, interpretability, CNN architectures, visual dictionaries, and attention mechanisms.

Deep Learning for NLP

Delves into Deep Learning for Natural Language Processing, exploring Neural Word Embeddings, Recurrent Neural Networks, and Attentive Neural Modeling with Transformers.

Deep Learning: Edge Detection and Neural Networks

Discusses edge detection techniques and the evolution of deep learning in neural networks.