Introduction to Data Stream ProcessingCovers the fundamentals of data stream processing, including tools like Apache Storm and Kafka, key concepts like event time and window operations, and the challenges of stream processing.
Deanonymization ExerciseExplores deanonymization using public datasets from Netflix, focusing on matching users and evaluating films based on ratings.
Water Consumption in GenevaExplores water consumption data in Geneva, including charts on consumption and losses, available datasets, and data processing phases.
Data Wrangling and AnalysisCovers a homework assignment on data wrangling and analysis using Python's pandas library for real-world datasets.
Data Wrangling with HadoopCovers data wrangling techniques using Hadoop, focusing on row versus column-oriented databases, popular storage formats, and HBase-Hive integration.