feature request: kafka input zone awareness
danielmotaleite opened this issue · 0 comments
Open this to the logstash github, but later found that the kafka-plugin have their own issue tracking, so moving the issue there:
Describe the enhancement:
Right now logstash kafka input reads from all kafka nodes and partitions, even if in a different zone.
With a new option, kafka input should look to it own zone and find kafkas in the same zone and use then, only fallback to the other zone kafkas if the first one is unavailable/inexistent or if the local kafka partitions are already fully processed, jumping to unconsumed remote zone partitions
This will save costs, having the zone logstash talking to their own zone kafka node will reduce cross-zone traffic. Kafka will rebalance their partitions and replicas between zones and filebeat could use a similar feature to help reduce cross-zone traffic costs
Check this example:
https://amplitude.engineering/reducing-costs-with-az-awareness-efc92bc7113a