RDB loader fails after load

anton · April 21, 2021, 8:39pm

It seems that RDB Loader “fails” on what seems to be dogfooding of monitoring details back to Snowplow. When Loader finishes its work it reports back to a Snowplow collector you have configured in monitoring.snowplow section in config.yml file and seems that the collector is not responding in time. Unless you have this requirement - it’s completely optional and doesn’t bear much of useful information, my guess is that you have monitoring.snowplow section configured by mistake.

What confuses me the most is that it’s an error, while clearly must be just a warning.

I’ll investigate if this is still the case in latest 1.0.0 version, but very likely it is not. There’s a lot of changes since the version you use (presumably R32). An important detail is that RDB Loader is not running on EMR cluster anymore (and it’s never been a Spark job, btw, so setting timeout wouldn’t help anyway). I think you might want to consider upgrading to the latest version:

Topic		Replies	Views
RDB Loader can hang for many hours Troubleshooting	3	1457	September 22, 2017
Snowplow-rdb-loader timing out For engineers	2	740	March 5, 2020
Error loading data to Redshift Storage targets	4	1309	May 3, 2019
[IMPORTANT ALERT] R90-R91 bug may result in shredded types not loading into Redshift after recovery Troubleshooting	2	2581	September 7, 2017
RDB loader container fails when there's no new shredded data Storage targets	3	1145	July 22, 2021

RDB loader fails after load

Related topics