Introduces Natural Language Processing (NLP) and its applications, covering tokenization, machine learning, sentiment analysis, and Swiss NLP applications.
Covers feature extraction, clustering, and classification methods for high-dimensional datasets and behavioral analysis using PCA, t-SNE, k-means, GMM, and various classification algorithms.
Explores methods for information extraction, including traditional and embedding-based approaches, supervised learning, distant supervision, and taxonomy induction.
Introduces the Naive Bayes classifier, covering independence assumptions, conditional probabilities, and applications in document classification and medical diagnosis.
Explores the nearest neighbor classifier method, discussing its limitations in high-dimensional spaces and the importance of spatial correlation for effective predictions.