Indexing in Database SystemsExplores indexing in database systems, covering storage, files, and efficient data retrieval techniques using various types of indexes.
File Organization and IndexingExplores file organization, indexing methods, and database storage design, including record formats, page formats, and index classification.
Data Wrangling with HadoopCovers data wrangling techniques using Hadoop, focusing on row versus column-oriented databases, popular storage formats, and HBase-Hive integration.
Water Consumption in GenevaExplores water consumption data in Geneva, including charts on consumption and losses, available datasets, and data processing phases.
Hashing and SortingCovers hashing, sorting, extendible hashing, linear hashing, and external sorting.
Introduction to Data ScienceIntroduces the basics of data science, covering decision trees, machine learning advancements, and deep reinforcement learning.
Time Series ClusteringCovers clustering time series data using dynamic time warping, string metrics, and Markov models.
General Introduction to Big DataCovers data science tools, Hadoop, Spark, data lake ecosystems, CAP theorem, batch vs. stream processing, HDFS, Hive, Parquet, ORC, and MapReduce architecture.
Query Operators: Part 1Explores query processing steps, physical plans, pipelined execution, and hashing for projections and joins.