Covers data science tools, Hadoop, Spark, data lake ecosystems, CAP theorem, batch vs. stream processing, HDFS, Hive, Parquet, ORC, and MapReduce architecture.
Explores data privacy challenges and perspectives in eHealth research, focusing on GDPR compliance, sensitive health data management, and decentralized agents.