D, [2016-05-13T10:32:23.605000 #4125] DEBUG -- : Initializing EMR jobflow
F, [2016-05-13T10:32:27.299000 #4125] FATAL -- :
Press ENTER or type command to continue
ArgumentError (AWS EMR API Error (ValidationException): 1 validation error detected: Value null at 'instances.placement.availabilityZone' failed to satisfy constraint: Member must not be null):
/Users/..mypath../snowplow-emr/snowplow-emr-etl-runner!/gems/elasticity-6.0.7/lib/elasticity/aws_session.rb:33:
in `submit'
any ideas? let me know if more information is needed.
The AWS availability zone is sometimes confused with the AWS region.
Amazon EC2 is hosted in multiple locations world-wide. These locations are composed of regions and Availability Zones. Each region is a separate geographic area. Each region has multiple, isolated locations known as Availability Zones.
– Regions and Zones - Amazon Elastic Compute Cloud
The last character like a, b, c is added to the region as an indication of the availability zone of that region.
While s3:region and emr:region keys are self-explanatory, the emr:placement: requires an availability zone (as opposed to region) and it complements the emr:region.
Additionally, only one key has to be used: either placement: or ec2_subnet_id: but not both. It depends on whether the app is running in VPC or not.