Hello,
I am struggling to get enrichments working. Currently, it seems like they are just being ignored by the process. Its been broken for a while - but it was working when I first set this up. I dont see anything strange in the logs. Any help gratefully received.
I have a feeling its all the enrichments not working. Not just this one.
Thanks,
Andrew
root@enrich-server-899fb7548-r6jjh:/snowplow/enrichments# cat /snowplow/enrichments/snowplow_enrichment_geo
{
"schema": "iglu:com.snowplowanalytics.snowplow/ip_lookups/jsonschema/2-0-0",
"data": {
"name": "ip_lookups",
"vendor": "com.snowplowanalytics.snowplow",
"enabled": true,
"parameters": {
"geo": {
"database": "GeoLite2-City.mmdb",
"uri": "http://snowplow-hosted-assets.s3.amazonaws.com/third-party/maxmind"
}
}
}
}
root@enrich-server-899fb7548-r6jjh:/snowplow/enrichments# cat /snowplow/config/enrich.hocon
{
"input": {
"type": "Kinesis"
"appName": "snowplow-prod-enrich-kinesis"
"streamName": "snowplow-prod-raw-stream"
"region": eu-west-1
"initialPosition": {
"type": "TRIM_HORIZON"
}
"retrievalMode": {
"type": "Polling"
"maxRecords": 10000
}
"bufferSize": 3
"checkpointBackoff": {
"minBackoff": 100 milliseconds
"maxBackoff": 10 seconds
"maxRetries": 10
}
}
"output": {
"good": {
"type": "Kinesis"
"streamName": "snowplow-prod-enriched-stream"
"region": eu-west-1
"backoffPolicy": {
"minBackoff": 100 milliseconds
"maxBackoff": 10 seconds
"maxRetries": 10
}
"throttledBackoffPolicy": {
"minBackoff": 100 milliseconds
"maxBackoff": 1 second
}
"recordLimit": 500
"byteLimit": 5242880
}
"bad": {
"type": "Kinesis"
"streamName": "snowplow-prod-bad-1-stream"
"region": eu-west-1
"backoffPolicy": {
"minBackoff": 100 milliseconds
"maxBackoff": 10 seconds
"maxRetries": 10
}
"throttledBackoffPolicy": {
"minBackoff": 100 milliseconds
"maxBackoff": 1 second
}
"recordLimit": 500
"byteLimit": 5242880
}
}
"telemetry": {
"disable": true
}
"monitoring": {
"cloudwatch": false
}
}
production>>>andrew@ROG:~$ kubectl logs enrich-server-899fb7548-r6jjh -n snowplow-prod
[pool-1-thread-1] INFO com.snowplowanalytics.snowplow.enrich.common.fs2.io.FileSystem - Files found in /snowplow/enrichments:
[pool-1-thread-1] INFO com.snowplowanalytics.snowplow.enrich.common.fs2.config.ParsedConfigs - Parsed config file: ConfigFile(Kinesis(snowplow-prod-enrich-kinesis,snowplow-prod-raw-stream,Some(eu-west-1),TrimHorizon,Polling(10000),3,BackoffPolicy(100 milliseconds,10 seconds,Some(10)),None,None,None),Outputs(Kinesis(snowplow-prod-enriched-stream,Some(eu-west-1),None,BackoffPolicy(100 milliseconds,10 seconds,Some(10)),BackoffPolicy(100 milliseconds,1 second,None),500,5242880,None),None,Kinesis(snowplow-prod-bad-1-stream,Some(eu-west-1),None,BackoffPolicy(100 milliseconds,10 seconds,Some(10)),BackoffPolicy(100 milliseconds,1 second,None),500,5242880,None)),Concurrency(256,1),None,RemoteAdapterConfigs(10 seconds,45 seconds,10,List()),Monitoring(None,MetricsReporters(None,None,true)),Telemetry(true,15 minutes,POST,collector-g.snowplowanalytics.com,443,true,None,None,None,None,None),FeatureFlags(false,false,false),None)
[pool-1-thread-1] INFO com.snowplowanalytics.snowplow.enrich.common.fs2.config.ParsedConfigs - Parsed Iglu Client with following registries: Iglu Central
[pool-1-thread-1] INFO com.snowplowanalytics.snowplow.enrich.common.fs2.config.ParsedConfigs - Parsed following enrichments:
[pool-1-thread-1] INFO com.snowplowanalytics.snowplow.enrich.common.fs2.Run - Initialising resources for snowplow-enrich-kinesis 3.8.0
[pool-1-thread-1] INFO com.snowplowanalytics.snowplow.enrich.common.fs2.Environment - Running Enrich
[pool-1-thread-1] INFO com.snowplowanalytics.snowplow.enrich.common.fs2.Assets - Initializing (downloading) enrichments assets
[pool-1-thread-1] INFO software.amazon.kinesis.leases.dynamodb.DynamoDBLeaseCoordinator - With failover time 10000 ms and epsilon 25 ms, LeaseCoordinator will renew leases every 3308 ms, takeleases every 20050 ms, process maximum of 2147483647 leases and steal 1 lease(s) at a time.
[pool-1-thread-1] INFO software.amazon.kinesis.leases.dynamodb.DynamoDBLeaseCoordinator - With failover time 10000 ms and epsilon 25 ms, LeaseCoordinator will renew leases every 3308 ms, takeleases every 20050 ms, process maximum of 2147483647 leases and steal 1 lease(s) at a time.
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.Scheduler - Initialization attempt 1
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.Scheduler - Initializing LeaseCoordinator
[cats-effect-blocker-0] INFO software.amazon.kinesis.leases.LeaseCleanupManager - Starting lease cleanup thread.
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.Scheduler - Starting LeaseCoordinator
[pool-18-thread-1] INFO software.amazon.kinesis.leases.LeaseCleanupManager - Number of pending leases to clean before the scan : 0
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.Scheduler - Scheduling periodicShardSync
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.Scheduler - Initialization complete. Starting worker loop.
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.DeterministicShuffleShardSyncLeaderDecider - Elected leaders: enrich-server-899fb7548-tjccw:43a00eba-aacf-44a9-9c16-6ba217bfbeeb
[LeaseCoordinator-0000] INFO software.amazon.kinesis.leases.dynamodb.DynamoDBLeaseTaker - Worker enrich-server-899fb7548-r6jjh:c0f81ecc-4331-493e-974f-5738fd36bee0 needed 2 leases but none were expired, so it will steal lease shardId-000000000002 from enrich-server-899fb7548-tjccw:43a00eba-aacf-44a9-9c16-6ba217bfbeeb
[LeaseCoordinator-0000] INFO software.amazon.kinesis.leases.dynamodb.DynamoDBLeaseTaker - Worker enrich-server-899fb7548-r6jjh:c0f81ecc-4331-493e-974f-5738fd36bee0 saw 4 total leases, 0 available leases, 2 workers. Target is 2 leases, I have 0 leases, I will take 1 leases
[LeaseCoordinator-0000] INFO software.amazon.kinesis.leases.dynamodb.DynamoDBLeaseTaker - Worker enrich-server-899fb7548-r6jjh:c0f81ecc-4331-493e-974f-5738fd36bee0 successfully took 1 leases: shardId-000000000002
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.Scheduler - Created new shardConsumer for : ShardInfo(streamIdentifierSerOpt=Optional.empty, shardId=shardId-000000000002, concurrencyToken=fd16cff0-8f50-4ba0-9cfc-de1bb93d08ea, parentShardIds=[], checkpoint={SequenceNumber: 49638344096864091378147350375438462427874226596766285858,SubsequenceNumber: 0})
[ShardRecordProcessor-0000] INFO software.amazon.kinesis.lifecycle.BlockOnParentShardTask - No need to block on parents [] of shard shardId-000000000002
[ShardRecordProcessor-0000] INFO software.amazon.kinesis.retrieval.polling.PrefetchRecordsPublisher - snowplow-prod-raw-stream:shardId-000000000002 : Starting Prefetching thread and initializing publisher session.
[ShardRecordProcessor-0000] INFO software.amazon.kinesis.retrieval.polling.KinesisDataFetcher - Initializing shard snowplow-prod-raw-stream:shardId-000000000002 with 49638344096864091378147350375438462427874226596766285858
[ShardRecordProcessor-0001] INFO software.amazon.kinesis.leases.KinesisShardDetector - Stream snowplow-prod-raw-stream: listing shards with list shards request ListShardsRequest(StreamName=snowplow-prod-raw-stream)
[LeaseCoordinator-0000] INFO software.amazon.kinesis.leases.dynamodb.DynamoDBLeaseTaker - Worker enrich-server-899fb7548-r6jjh:c0f81ecc-4331-493e-974f-5738fd36bee0 saw 4 total leases, 3 available leases, 1 workers. Target is 4 leases, I have 1 leases, I will take 3 leases
[LeaseCoordinator-0000] INFO software.amazon.kinesis.leases.dynamodb.DynamoDBLeaseTaker - Worker enrich-server-899fb7548-r6jjh:c0f81ecc-4331-493e-974f-5738fd36bee0 successfully took 3 leases: shardId-000000000003, shardId-000000000001, shardId-000000000000
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.Scheduler - Created new shardConsumer for : ShardInfo(streamIdentifierSerOpt=Optional.empty, shardId=shardId-000000000003, concurrencyToken=5311b162-24fa-4569-9831-cbbe3f71c24d, parentShardIds=[], checkpoint={SequenceNumber: 49638344096886392123345881002331294964411069562263437362,SubsequenceNumber: 0})
[ShardRecordProcessor-0000] INFO software.amazon.kinesis.lifecycle.BlockOnParentShardTask - No need to block on parents [] of shard shardId-000000000003
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.Scheduler - Created new shardConsumer for : ShardInfo(streamIdentifierSerOpt=Optional.empty, shardId=shardId-000000000001, concurrencyToken=4f911f94-3052-4a2c-bf35-0c75cad3d0fa, parentShardIds=[], checkpoint={SequenceNumber: 49638344096841790632948819756647850734394628978625216530,SubsequenceNumber: 0})
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.Scheduler - Created new shardConsumer for : ShardInfo(streamIdentifierSerOpt=Optional.empty, shardId=shardId-000000000000, concurrencyToken=de8c519d-9da8-4358-ad90-12963c3a215f, parentShardIds=[], checkpoint={SequenceNumber: 49638344102528480658574128657990915464399804099708583938,SubsequenceNumber: 0})
[ShardRecordProcessor-0000] INFO software.amazon.kinesis.lifecycle.BlockOnParentShardTask - No need to block on parents [] of shard shardId-000000000001
[ShardRecordProcessor-0001] INFO software.amazon.kinesis.lifecycle.BlockOnParentShardTask - No need to block on parents [] of shard shardId-000000000000
[ShardRecordProcessor-0000] INFO software.amazon.kinesis.retrieval.polling.PrefetchRecordsPublisher - snowplow-prod-raw-stream:shardId-000000000001 : Starting Prefetching thread and initializing publisher session.
[ShardRecordProcessor-0000] INFO software.amazon.kinesis.retrieval.polling.KinesisDataFetcher - Initializing shard snowplow-prod-raw-stream:shardId-000000000001 with 49638344096841790632948819756647850734394628978625216530
[ShardRecordProcessor-0001] INFO software.amazon.kinesis.retrieval.polling.PrefetchRecordsPublisher - snowplow-prod-raw-stream:shardId-000000000003 : Starting Prefetching thread and initializing publisher session.
[ShardRecordProcessor-0001] INFO software.amazon.kinesis.retrieval.polling.KinesisDataFetcher - Initializing shard snowplow-prod-raw-stream:shardId-000000000003 with 49638344096886392123345881002331294964411069562263437362
[ShardRecordProcessor-0002] INFO software.amazon.kinesis.retrieval.polling.PrefetchRecordsPublisher - snowplow-prod-raw-stream:shardId-000000000000 : Starting Prefetching thread and initializing publisher session.
[ShardRecordProcessor-0002] INFO software.amazon.kinesis.retrieval.polling.KinesisDataFetcher - Initializing shard snowplow-prod-raw-stream:shardId-000000000000 with 49638344102528480658574128657990915464399804099708583938
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.DiagnosticEventLogger - Current thread pool executor state: ExecutorStateEvent(executorName=SchedulerThreadPoolExecutor, currentQueueSize=0, activeThreads=0, coreThreads=0, leasesOwned=4, largestPoolSize=5, maximumPoolSize=2147483647)
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.Scheduler - Current stream shard assignments: shardId-000000000003, shardId-000000000002, shardId-000000000001, shardId-000000000000
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.DiagnosticEventLogger - Current thread pool executor state: ExecutorStateEvent(executorName=SchedulerThreadPoolExecutor, currentQueueSize=0, activeThreads=0, coreThreads=0, leasesOwned=4, largestPoolSize=5, maximumPoolSize=2147483647)
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.Scheduler - Sleeping ...
[pool-18-thread-1] INFO software.amazon.kinesis.leases.LeaseCleanupManager - Number of pending leases to clean before the scan : 0
[pool-16-thread-1] INFO software.amazon.kinesis.coordinator.DeterministicShuffleShardSyncLeaderDecider - Elected leaders: enrich-server-899fb7548-r6jjh:c0f81ecc-4331-493e-974f-5738fd36bee0
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.DiagnosticEventLogger - Current thread pool executor state: ExecutorStateEvent(executorName=SchedulerThreadPoolExecutor, currentQueueSize=0, activeThreads=0, coreThreads=0, leasesOwned=4, largestPoolSize=5, maximumPoolSize=2147483647)
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.DiagnosticEventLogger - Current thread pool executor state: ExecutorStateEvent(executorName=SchedulerThreadPoolExecutor, currentQueueSize=0, activeThreads=0, coreThreads=0, leasesOwned=4, largestPoolSize=5, maximumPoolSize=2147483647)
[pool-18-thread-1] INFO software.amazon.kinesis.leases.LeaseCleanupManager - Number of pending leases to clean before the scan : 0
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.Scheduler - Current stream shard assignments: shardId-000000000003, shardId-000000000002, shardId-000000000001, shardId-000000000000
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.Scheduler - Sleeping ...
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.DiagnosticEventLogger - Current thread pool executor state: ExecutorStateEvent(executorName=SchedulerThreadPoolExecutor, currentQueueSize=0, activeThreads=0, coreThreads=0, leasesOwned=4, largestPoolSize=5, maximumPoolSize=2147483647)
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.DiagnosticEventLogger - Current thread pool executor state: ExecutorStateEvent(executorName=SchedulerThreadPoolExecutor, currentQueueSize=0, activeThreads=0, coreThreads=0, leasesOwned=4, largestPoolSize=5, maximumPoolSize=2147483647)
[pool-18-thread-1] INFO software.amazon.kinesis.leases.LeaseCleanupManager - Number of pending leases to clean before the scan : 0
[pool-17-thread-1] INFO software.amazon.kinesis.coordinator.PeriodicShardSyncManager - WorkerId enrich-server-899fb7548-r6jjh:c0f81ecc-4331-493e-974f-5738fd36bee0 is leader, running the periodic shard sync task
[pool-17-thread-1] INFO software.amazon.kinesis.coordinator.PeriodicShardSyncManager - Skipping shard sync for snowplow-prod-raw-stream due to the reason - Hash Ranges are complete for snowplow-prod-raw-stream
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.Scheduler - Current stream shard assignments: shardId-000000000003, shardId-000000000002, shardId-000000000001, shardId-000000000000
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.Scheduler - Sleeping ...
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.DiagnosticEventLogger - Current thread pool executor state: ExecutorStateEvent(executorName=SchedulerThreadPoolExecutor, currentQueueSize=0, activeThreads=0, coreThreads=0, leasesOwned=4, largestPoolSize=5, maximumPoolSize=2147483647)
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.DiagnosticEventLogger - Current thread pool executor state: ExecutorStateEvent(executorName=SchedulerThreadPoolExecutor, currentQueueSize=0, activeThreads=0, coreThreads=0, leasesOwned=4, largestPoolSize=5, maximumPoolSize=2147483647)
[pool-18-thread-1] INFO software.amazon.kinesis.leases.LeaseCleanupManager - Number of pending leases to clean before the scan : 0
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.Scheduler - Current stream shard assignments: shardId-000000000003, shardId-000000000002, shardId-000000000001, shardId-000000000000
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.Scheduler - Sleeping ...
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.DiagnosticEventLogger - Current thread pool executor state: ExecutorStateEvent(executorName=SchedulerThreadPoolExecutor, currentQueueSize=0, activeThreads=0, coreThreads=0, leasesOwned=4, largestPoolSize=5, maximumPoolSize=2147483647)
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.DiagnosticEventLogger - Current thread pool executor state: ExecutorStateEvent(executorName=SchedulerThreadPoolExecutor, currentQueueSize=0, activeThreads=0, coreThreads=0, leasesOwned=4, largestPoolSize=5, maximumPoolSize=2147483647)
[pool-18-thread-1] INFO software.amazon.kinesis.leases.LeaseCleanupManager - Number of pending leases to clean before the scan : 0
[pool-17-thread-1] INFO software.amazon.kinesis.coordinator.PeriodicShardSyncManager - WorkerId enrich-server-899fb7548-r6jjh:c0f81ecc-4331-493e-974f-5738fd36bee0 is leader, running the periodic shard sync task
[pool-17-thread-1] INFO software.amazon.kinesis.coordinator.PeriodicShardSyncManager - Skipping shard sync for snowplow-prod-raw-stream due to the reason - Hash Ranges are complete for snowplow-prod-raw-stream
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.Scheduler - Current stream shard assignments: shardId-000000000003, shardId-000000000002, shardId-000000000001, shardId-000000000000
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.Scheduler - Sleeping ...
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.DiagnosticEventLogger - Current thread pool executor state: ExecutorStateEvent(executorName=SchedulerThreadPoolExecutor, currentQueueSize=0, activeThreads=0, coreThreads=0, leasesOwned=4, largestPoolSize=5, maximumPoolSize=2147483647)
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.DiagnosticEventLogger - Current thread pool executor state: ExecutorStateEvent(executorName=SchedulerThreadPoolExecutor, currentQueueSize=0, activeThreads=0, coreThreads=0, leasesOwned=4, largestPoolSize=5, maximumPoolSize=2147483647)
[pool-18-thread-1] INFO software.amazon.kinesis.leases.LeaseCleanupManager - Number of pending leases to clean before the scan : 0
[pool-16-thread-1] INFO software.amazon.kinesis.coordinator.DeterministicShuffleShardSyncLeaderDecider - Elected leaders: enrich-server-899fb7548-r6jjh:c0f81ecc-4331-493e-974f-5738fd36bee0
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.Scheduler - Current stream shard assignments: shardId-000000000003, shardId-000000000002, shardId-000000000001, shardId-000000000000
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.Scheduler - Sleeping ...
[cats-effect-blocker-0] INFO software.amazon.kinesis.coordinator.DiagnosticEventLogger - Current thread pool executor state: ExecutorStateEvent(executorName=SchedulerThreadPoolExecutor, currentQueueSize=0, activeThreads=0, coreThreads=0, leasesOwned=4, largestPoolSize=5, maximumPoolSize=2147483647)