Trouble with s3distcp in EMR

neelam_bagnial · July 7, 2020, 10:57pm

We are using snowplow 112 version with stream enrich and last 2 weeks we have been getting troubles with S3distcp.

Sometimes it fails while copying shredded data from HDFS-> S3 or archiving the data.
Most of the time it archives the data but still sends failure signal to EMR job.
In another scenario, while copying data from HDFS-> S3 using distcp, reduce job fails at reduce step and tries 3 4 times and recreates multiple version of data in S3.
EMR failed another day at the loader step as it was unable to locate one of JsonPath files but worked again on retries.

Is there someone encountering similar issue with s3distcp? Any solutions/recommendations will be highly appreciated as this issue is impacting our production environment. Thanks in advance!

ihor · July 7, 2020, 11:37pm

Hey @neelam_bagnial, yes, we encounter this issue from time to time as well and I believe our developers are looking into a possible solution. I’m afraid there’s not much that can be done at the moment.

neelam_bagnial · July 8, 2020, 5:55pm

Thanks @ihor for your reply, Any suggestion on how are you recovering failed EMR in such case? Do you have any recovery plan?

ihor · July 9, 2020, 3:40am

@neelam_bagnial, we follow the recovery strategy as per https://github.com/snowplow/snowplow/wiki/Batch-pipeline-steps.

Topic		Replies	Views
Getting error in Elasticity S3DistCp Step: Enriched HDFS -> S3: FAILED AWS batch pipeline (Legacy)	5	1678	September 27, 2017
Problem at S3 to HDFS S3DistCp step AWS batch pipeline (Legacy)	19	7280	June 4, 2021
EMR failing : Enriched HDFS -> S3: FAILED Troubleshooting	4	2007	April 11, 2017
Shred problems using Batch Troubleshooting	1	949	December 5, 2020
Failing in the 4th step process of storage every time.(Elasticity S3DistCp Step: Enriched HDFS -> S3: FAILED) AWS batch pipeline (Legacy)	2	1520	November 9, 2017

Trouble with s3distcp in EMR

Related topics