Hello, guys
I am trying to use Snowplow with Python and React Native trackers, a Scala Stream Collector and a Stream enrich. The result is then loaded via Firehose into S3. The data in S3 is then accessed by Athena to be queried.
The problem is that the data that comes from my Enrichment process does not fit my atomic.events table. It has 2 or 3 columns more than it should have.
I got the atomic.events create table statement from https://github.com/snowplow/snowplow/blob/master/4-storage/redshift-storage/sql/atomic-def.sql
Is there something wrong with the version of the enricher or collector that I’m using? Or am I using an old version for atomic.events?
So ATOMIC.EVENTS is supposed to receive shredded events instead of enriched events?
How do I do that if I’m using Stream Enrich instead of EmrEtlRunner?
@Alexandre_Rayes, it depends on what ATOMIC.EVENTS you are talking about. When it comes to Redshift then yes, events should be stripped off those 3 fields as the data in them is loaded into separate tables.
To work with enriched data, we offer Snowplow Analytics SDK and you should take into consideration the enriched event’s structure as per the link I provided earlier.