Data Wrangling with HadoopCovers data wrangling techniques using Hadoop, focusing on row versus column-oriented databases, popular storage formats, and HBase-Hive integration.
Introduction to Data ScienceIntroduces the basics of data science, covering decision trees, machine learning advancements, and deep reinforcement learning.
General Introduction to Data ScienceOffers a comprehensive introduction to Data Science, covering Python, Numpy, Pandas, Matplotlib, and Scikit-learn, with a focus on practical exercises and collaborative work.
Collaborative Data ScienceCovers collaborative data science tools, big data concepts, Spark, and data stream processing, with tips for the final project.
Statistical Signal ProcessingCovers Gaussian Mixture Models, Denoising, Data Classification, and Spike Sorting using Principal Component Analysis.
Spark Data FramesCovers Spark Data Frames, distributed collections of data organized into named columns, and the benefits of using them over RDDs.