Bad Data and Data Engineering: Dissecting Google Play Music Takeout Data using Beam, go, Python, and SQL 2021-02-28: On the joy of inheriting a rather bad dataset - dissecting ~120GB of terrible Google Takeout data to make it usable, using Dataflow/Beam, go, Python, and SQL.
data engineering
linux
bash
go
python
dataflow
beam
big data