[request]: Cached /metrics result

Question

[request]: Cached /metrics result

victoramsantos opened this issue a year ago · 3 comments

Use case. Why is this important?

I'm working in a company where we are already reaching the AWS quota limits for API calls for cloudwatch. We are thinking about solutions where we can reduce these calls without impacting user experience like removing metrics or increase too much the period_seconds for all metrics.

I want to discuss if would be interesting to have a cache solution for cloudwatch-exporter. Like a ttl that even though we have other requests to /metrics we will still answering with the cache until this ttl goes way and then we would apply another request to collect metrics and to cache the new answer repeating the process.

This solution could reduce to half of our requests (since we have 2 prometheus replicas running).

Is this a desirable feature that we could spend some time on?

Answer 1 · 2023-12-12T20:08:03.000Z

Caching of the /metrics would probably be more likely implemented as caching of specific cloudwatch API calls. For example, ListMetrics caching was added in #453.

Something similar could be done for the actual metric fetching calls as well.

As a workaround, it's already possible to implement this with any caching reverse proxy. For example it's pretty easy to do with an EnvoyProxy sidecar. This is what we do in production.

Answer 2 · 2024-01-15T22:31:51.000Z

How would the behavior differ if we cache the AWS calls rather than the whole set of metrics?

…

On Tue, 12 Dec 2023, 21:08 Ben Kochie, ***@***.***> wrote: Caching of the /metrics would probably be more likely implemented as caching of specific cloudwatch API calls. For example, ListMetrics caching was added in #453 <#453>. Something similar could be done for the actual metric fetching calls as well. As a workaround, it's already possible to implement this with any caching reverse proxy. For example it's pretty easy to do with an EnvoyProxy sidecar. This is what we do in production. — Reply to this email directly, view it on GitHub <#621 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AABAEBXNBNFEPJWMRNN4J43YJC2S5AVCNFSM6AAAAABAQOBUR6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQNJSG4ZDQMJTHA> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

Answer 3 · 2024-01-15T22:43:43.000Z

@matthiasr I was thinking TTLs could be configured with more granularity. Caching some metric data longer than others.