For the last couple of months we’ve been working on updating and extending our wiki documentation. Below are just a few recently added articles you might find useful to gain a better understanding how the Snowplow pipeline works.
-
Collector Logging Format - What do the log files (raw events) produced by all of our collectors look like?
-
The Enrichment Process - Get a deeper understanding what is going on during the enrichment process.
-
The StorageLoader - An insight into the StorageLoader role in the ETL process. How do we manage to load the data related to user specific events and contexts into Redshift?
-
Batch Pipeline Steps - How to resume the failed job the right way?
Let us know if there’s any (other) topic you would like us to cover in the wiki.