Snowbridge - Kinesis shard doesn't exist

Rob_Ellison · November 14, 2024, 4:55pm

Hi,

We ran into issues with Snowbridge reporting the following error:

Failed to pull next Kinesis record from Kinsumer client: shard error (shardId-n) in getShardIterator: ResourceNotFoundException: Shard shardId-n in stream snowplow-analytics-enriched-good under account 123456 does not exist

I think this was caused by the updates to the DynamoDB tables failing and potentially kinesis shard scaling occurring.

The solution for this was to delete the shards in the metadata table that no longer exist.

Is there a cleaner way to do this? Shouldn’t service check for shards as part of startup?

Thanks in advance!
Rob

Colm · November 14, 2024, 7:59pm

Hey @Rob_Ellison,

I have only encountered this error during testing when I’ve made some configuration mistake - for example when I’ve changed the stream I’m reading from, but forgot to purge the DDB table - then I’ll see this error.

The client does check for shards - the way it works is that there is one ‘leader’, which polls the stream for shards at an interval, and updates that table. The other clients (which will be one per pod/snowbridge instance) read from DDB and operate from there.

So I guess you can reach this state if the leader wasn’t able to update DDB correctly, and another client booted up before it did so, attempted to read an old shard ID, and that shard didn’t exist.

But what I’m confused about is how a shard ID already in there returned this error - I don’t know enough about the internals of kinesis but I would have expected to be able to call a method on a shard ID even after that shard ID has been merged to another in a scaling action.

Is it possible that things weren’t operational for long enough for all of the data in the shard to have expired completely?

Topic		Replies	Views
Snowbridge - # Open Connections resulting in Failed to pull next Kinesis record from Kinsumer client: connection reset by peer For engineers	6	125	December 2, 2024
Scala Stream Collector and Kinesis Shards AWS real-time pipeline	1	1800	September 13, 2017
Enrich app stopped processing events For engineers	1	412	February 29, 2024
"Sleeping" kinesis stream shards with latest ES loaders AWS real-time pipeline	8	1348	September 18, 2022
Elasticsearch sink - Application processRecords() threw an exception when processing shard Storage targets	10	2762	July 18, 2017

Snowbridge - Kinesis shard doesn't exist

Related topics