Explores the evolution of visual intelligence models, focusing on Transformers and their applications in computer vision and natural language processing.
Covers Convolutional Neural Networks, including layers, training strategies, standard architectures, tasks like semantic segmentation, and deep learning tricks.
Introduces the fundamentals of deep learning, covering neural networks, CNNs, special layers, weight initialization, data preprocessing, and regularization.