We’re hard at work on the SQL Query Enrichment, which will let Snowplow users dimension widen their Snowplow events with the results of arbitrary SQL queries.
After the response to the Clearbit tutorial for the API Request Enrichment, we’d love to do a similar tutorial for the SQL Query Enrichment. Although a lot of use cases for the SQL Query Enrichment will be using internal data (e.g. customer records / product databases), there are no doubt some interesting public data sets which could be fun to join into a Snowplow event stream.
We’d love to get your suggestions for our tutorial here! We are looking for an interesting public dataset which:
- Is already available in MySQL/Postgres format, or is easily convertible to the same (i.e. is published as CSV or maybe JSON)
- Has some key which is easily joined onto a fairly standard Snowplow event stream
Suggestions in this thread please! I’ll get the ball rolling with an idea we had internally.