Hi,
our enrich pubsub deployment (v3.2.3) on Kubernetes is not running properly: When a new pod starts, the CPU goes to its limit and drops after a few minutes to a low level. After a few minutes, this process repeats and CPU starts to ramp up and goes down again:
Looking at the input / output message rates of PubSub, there are sharp spikes when reading events from the collector-good pubsub. On the other hand, the message rate written to enriched-good pubsub is constant, even if the CPU is low at that moment. Overall the message rate processed in this configuration is quite low, which is probably related to the low CPU usage on average.
collector-good PubSub / read:
enriched-good PubSub / write:
We run the enrich pubsub with default configs of:
{
"input": {
"type": "PubSub"
"subscription": "projects/xxx/subscriptions/sp-coll-good-sub"
}
"output": {
"good": {
"type": "PubSub"
"topic": "projects/xxx/topics/sp-enriched-good"
}
"bad": {
"type": "PubSub"
"topic": "projects/xxx/topics/sp-enriched-bad"
}
}
"telemetry": {
"disable": true
}
"monitoring": {
"metrics": {
"stdout": {
"period": "1 minute"
"prefix": "snowplow.enrich."
}
}
}
}
Any idea how to fix this issue? We think it might be related to the config we use, however since we use the default config we are not sure what parameters to tweak.
Thanks in advance!