Covers data science tools, Hadoop, Spark, data lake ecosystems, CAP theorem, batch vs. stream processing, HDFS, Hive, Parquet, ORC, and MapReduce architecture.
Covers interactions between raster and vector data layers in GIS, including extracting information at specific points and calculating statistics within polygon zones.