EMR jobflow failing on Hadoop Enrich step after a few seconds

joaocorreia · April 18, 2016, 5:23pm

Hi Snowplowers,

I have a weird problem with EMR ETL runner. I scheduled it to run every day at 4AM, and it does, at least most of the times.

When it doesn’t run it quits very early in the enrichment step and it doesn’t leave any logs, although I have level: DEBUG in my config file.

Any ideas?

This is my crontab:

Snowplow Analytics

00 4 * * * /snowplow/snowplow-runner-and-loader.sh

alex · April 18, 2016, 6:35pm

Hi Joao,

We have had a similar issue, although mostly with the Hadoop Shred job - we raised it with AWS and heard back:

I’ve been looking at the clusters that you sent with another engineer
here and we believe that this is an instance-controller issue. There are
certain rare situations if there’s a long process name, it can break
instance-controller which looks like what’s happening here. A couple of
your command includes a very long argument and we believe that’s what’s
causing the instability. This exists in emr-4.3.0 and older.

We have been upgrading jobs to emr-4.5.0 and so far so good - incidence of this problem seem to have declined. Can you try the same on your side and report back?

joaocorreia · April 19, 2016, 8:20am

Hi Alex,

Upgrading jobs to 4.5.0 solved the problem.

Thank you!

alex · April 29, 2016, 10:10am

Thanks Joao. For others reading this - we have not seen this error again since upgrading jobs to 4.5.0.

dweitzenfeld · April 29, 2016, 12:45pm

How do I upgrade jobs to 4.5.0?

ihor · April 29, 2016, 5:14pm

Hi @dweitzenfeld,

Change the ami_version to 4.5.0 in config.yml:

  emr:
    ami_version: 4.5.0

Regards,
Ihor

Topic		Replies	Views
Snowplow EMR jobflow error Troubleshooting	6	1675	January 9, 2018
Shred problems using Batch Troubleshooting	1	949	December 5, 2020
Recommended/Supported EMR Versions? Enrichment	3	1197	March 31, 2021
Snowplow ETL on AWS EMR hangs AWS batch pipeline (Legacy)	1	1936	October 26, 2018
EMR failing : Enriched HDFS -> S3: FAILED Troubleshooting	4	2007	April 11, 2017

EMR jobflow failing on Hadoop Enrich step after a few seconds

Snowplow Analytics

Related topics