Redshift tables

bryce · September 14, 2017, 6:35pm

Hello again all,

Is there a definitive source outlining which tables should be created in Redshift?

I was doing some load testing and the load failed because the com_snowplowanalytics_snowplow_mobile_context_1 table wasn’t present. Presumably Avalanche uses a tracker which sets this context?

I found this post with some information, which is helpful.

Also, has there been any discussion around a more graceful way to fail when a Redshift table isn’t present?

Thanks again!

ihor · September 14, 2017, 6:54pm

@bryce,

We use COMMIT when loading data. This means that we deliberately load all the tables in Redshift using a single transaction. This is important because it means that either the complete load succeeds or fails. In the event of failure, recovery is straightforward - if a load was part successful it would be complicated to recover without the risk of introducing duplicates.

The post you found is the only “definitive source” available for now. Do let us know if something is missing from that mapping.

bryce · September 14, 2017, 7:06pm

Thanks @ihor, makes sense.

I was just surprised to see that the Avalanche events didn’t load. Do you happen to know if it needs any additional tables beyond the one I mentioned above?

mike · September 15, 2017, 4:01am

Looks like Avalanche sends through page view and structured events and both of these contain the mobile_context and the client_session context so you’ll want to make sure you’ve got com_snowplowanalytics_snowplow_mobile_context_1 as well as com_snowplowanalytics_snowplow_client_session_1 tables.

bryce · September 15, 2017, 11:02pm

Thanks @mike!

Topic		Replies	Views
Loading Redshift - can't find missing tables in snowplow/snowplow repo Storage targets	3	2253	November 10, 2016
Missing a table for data modelling step For engineers	13	1835	June 14, 2019
Why is the Redshift table definition for a schema not the latest version? Storage targets	14	2973	October 11, 2016
StorageLoader error Storage targets	3	1690	December 13, 2016
Redshift + Snowplow Mini version 4 Snowplow Mini	5	2477	January 23, 2019

Redshift tables

Related topics