Data Issues in ResearchExplores challenges in data assumptions, biases, and more in research, including incomplete write-ups and frustrations of newcomers.
Water Consumption in GenevaExplores water consumption data in Geneva, including charts on consumption and losses, available datasets, and data processing phases.
Streaming AlgorithmsCovers streaming algorithms, power of two choices, Misra-Gries estimator, and AMS sketch for frequency estimation.
Bots: Wikipedia WikificationDelves into the role of bots in Wikipedia, their wikification of public domain content, and the controversies surrounding their use.
Deanonymization ExerciseExplores deanonymization using public datasets from Netflix, focusing on matching users and evaluating films based on ratings.
Data ProcessingCovers the processing of data from a chemical experiment using Excel.