Open file descriptor leak
Closed this issue · 3 comments
stephen-soltesz commented
Looking at prometheus metrics for process_open_fds
for traceroute that there is a fd leak. See image below:
After a rollout on the 16th, the fd count has steadily increased until machine reboots.
Only lga0* nodes are shown for convenience. The pattern is global. Originally started on ~Nov 8th in staging and Nov 14th in production.
sum by(machine, container, deployment) (process_open_fds{machine=~".*", container=~"traceroute"})
stephen-soltesz commented
pboothe commented
Well, dang.