Christian Hollinger
Software Engineering, GNU/Linux, Data, GIS, and other things I like
Home
About
Tags & Stats
Tag: Data Engineering
28
Feb 2021
Bad Data and Data Engineering: Dissecting Google Play Music Takeout Data using Beam, go, Python, and SQL
On the joy of inheriting a rather bad dataset - dissecting ~120GB of terrible Google Takeout data to make it usable, using Dataflow/Beam, go, Python, and SQL.