AWS Elasticsearch Service Request Signing

aspensmonster · December 15, 2016, 6:12pm

Howdy everyone. I’m currently in the process of setting up a real-time component for a snowplow pipeline. Presently, the elasticsearch cluster is provided by AWS ElasticSearch Service, rather than an independent ES stack in EC2 (or elsewhere). Stream enriching, and sinking to this cluster, work with my barebones/ad-hoc deployment (single stream enrich instance, single elasticsearch-sink instance) after whitelisting the IP of the sink instance in the AWS ES Service’s access policy.

However, future plans are of course to have apps in appropriate ASGs, with automated deployments. I’ve thought of a couple different approaches for handling automated access to the cluster, such as:

make a static proxy instance with EIP, whitelist this instance in the AWS ES Service, and have any elasticsearch sink app proxy traffic through this instance
Utilize a NAT gateway with an EIP, whitelist that EIP, and ensure sink instances utilize that NAT gateway
Bite the bullet and build out our own ES cluster

None of these are optimal, as each adds maintenance overhead. It’d be much more straightforward if the sink apps were able to sign requests to the AWS ES Service endpoint via an iam role (like how reading/writing the kinesis streams is set up already). With that being said…

Is there any capability for the elasticsearch sink app to utilize request signing when sinking to AWS ElasticSearch Service endpoints? Is there a setting I’ve just missed? Sniffing the traffic just shows raw POSTs to the endpoint with no signature headers.

Is there perhaps some other approach that I haven’t though of?

josh · December 16, 2016, 8:02am

Hi @aspensmonster,

Currently no there is no support for request signing in the Elasticsearch Sink. It would be a much cleaner approach to sinking data to the service in lieu of the fact that we cannot put it into a VPC.

I did find an approach that appears to work with our current Elasticsearch Client library (Jest) with signing on this thread:

I have created a new ticket to track this as I think it would be a great feature to have included! Especially if AWS eventually adds something like the VPC -> S3 endpoint so we can stop traversing the public network to sink events.

The current approach has been the one you have mentioned to use NAT gateways and to whitelist them. This has worked quite well for us at high volumes.

I guess from a security standpoint we have always looked to have our micro-services nested in private subnets and thus hidden from the public internet as much as possible - meaning that we would need to traverse the NAT irrespective of request signing to actually get data to the Elasticsearch Cluster.

nitin_k · May 29, 2017, 6:27am

Hi, We are looking for a similar process of signing Elasticsearch requests (for AWS ElasticSearch) as well. I can see from the github tracker that the item is still open. Is there any progress/milestone decided?

Also, as mentioned,

Can you please describe this alternative approach in detail, so that I can follow it temporarily.

nitin_k · July 26, 2017, 6:05am

Support is now added for AWS ElasticSearch request signing.
snowplow-elasticsearch-loader - version 0.9.0

https://github.com/snowplow/snowplow-elasticsearch-loader/issues/12

(Just posting here for people who are looking for it!)

Topic		Replies	Views
ElasticSearch HTTP Sink failing with AWS ES Signing AWS real-time pipeline	1	2844	March 26, 2020
Can the batch Elasticsearch target sign requests? AWS batch pipeline (Legacy)	2	1210	August 14, 2017
Snowplow with AWS Elastic Search AWS real-time pipeline	2	1477	June 7, 2019
Unable to receive Snowplow data into Elasticsearch Data store sources	14	3424	January 17, 2018
Elasticsearch Loader idle AWS real-time pipeline	1	2213	March 5, 2018

AWS Elasticsearch Service Request Signing

Related topics