Explores the use of fast interconnects for scalable co-processing with GPUs in databases, emphasizing the importance of overcoming the transfer bottleneck and reevaluating assumptions for performance improvements.
Covers relational and spatial databases, including storage, management systems, ACID properties, historical typologies, primary and foreign keys, and spatial functions.
Covers data science tools, Hadoop, Spark, data lake ecosystems, CAP theorem, batch vs. stream processing, HDFS, Hive, Parquet, ORC, and MapReduce architecture.