Replies: 2 comments
-
At some point, Kafka changed how the offset synchronization works. It by default synchronizes it only every 100 or so records. I think that makes it behave better in some situations, but gives you a bigger offset latency. I do not remember which version introduced this change. But maybe that is what you are seeing? I think there is an option to configure it: https://github.com/strimzi/strimzi-kafka-operator/blob/main/examples/mirror-maker/kafka-mirror-maker-2-sync-groups.yaml#L30 ... but not sure if it is recommended to change it. |
Beta Was this translation helpful? Give feedback.
-
For others that might run into this. I asked this on the Kafka mailing list and got this reply.
|
Beta Was this translation helpful? Give feedback.
-
We've been running MM2 for years now in an actice/passive mode and it's generally been pretty stable. We noticed when we moved form 3.4.0 to 3.5.1 we started getting lag alerts on the passive cluster. We moved back to 3.4.0 and MM2 started working again as before. We then tried 3.6.1 and the same issue. Didn't seem to matter if it was the low traffic dev clusters or high traffic prod clusters.
Here's the snippet of config I think is most applicable.
We tried messing with sync.group.offsets.enabled.interval.seconds: 30 and refresh.groups.interval.seconds: 90, but that didn't seem to move the needle. We had considered offset.max.lag, but we already had groups with more than 100+ lag on the passive side. Is there something else we could be missing here config wise? Something that was deprecated or needed to be changed when moving from 3.4.0? Is this something that's been fixed in 3.6.2 or 3.7.1?
Beta Was this translation helpful? Give feedback.
All reactions