Data Ingestion Pipelines Without Headaches: 8 simple steps
Data, like wine and cheese, becomes more valuable when combined. However, to combine, you must first retrieve the data and a reliable and scalable manner. This post covers the 8 steps of a data ingestion pipeline and 3 overarching topics to ensure reliability and quality over time.
Continue reading...
Own your web analytics pipeline for €0.02 per day: Snowplow, Terraform, dbt, BigQuery and Docker
Running Snowplow for your (web) analytics pipeline to expensive? Here's a €0.02/day minimal, serverless version of Snowplow open source that you can deploy for your blog or website with Terraform (on GCP/BigQuery) in 5 minutes giving you full ownership of a web and app analytics pipeline from data collection to custom data models (👋 goodbye Google Analytics).
Continue reading...
Automatically Lint and Publish your Snowplow Schemas with Github Actions
Snowplow schemas are a great way to codify expected data in JSON format. Using Github actions you can make them eevn more powerful by automatically checking for typos, validity, and other errors as well as directly publishing them to your production environment with no manual action.
Continue reading...
Why web analytics is still a mess in 2023
Web analytics still feels 'messy' in 2023. Why is it so hard to solve the problem of web analytics? Let's dive into some of the misconceptions that fuel the mess, like the ideas that websites are easy, are visited by people, that web analytics is about tracking poeple, that we have all the tools we need, and that web analytics is actually important.
Continue reading...