Improve continuous benchmarking with Bencher

Question

Improve continuous benchmarking with Bencher

epompeii opened this issue 8 months ago · 3 comments

Hey fastcrypto team!
I came across your white paper, and I think you all have built a pretty nice continuous benchmarking site.

I just wanted to reach out because I'm the maintainer of an open source continuous benchmarking tool called Bencher: https://github.com/bencherdev/bencher

It looks like you all currently only benchmark releases. Though, I may be missing something.
Bencher would allow you to track your benchmarks over time, compare the performance of pull requests, and catch performance regressions before they get merged.

I would be more than happy to answer any questions that you all may have!

Answer 1 · 2024-04-29T12:02:46.000Z

Thanks for reaching out, @epompeii! Yea, we don't run the benchmarks on all PRs because it takes a long time to run them. But it sounds very interesting with better reporting and comparison over time.

I'm curious to hear what your experience is doing benchmarks online as part of CI? For us, performance varies quite a lot, making it a bit difficult to detect small changes in performance.

Answer 2 · 2024-04-29T12:27:18.000Z

Yea, we don't run the benchmarks on all PRs because it takes a long time to run them.

Yeah, this can definitely be a blocker. I think the most common thing I've seen is only running a subset of benchmarks on PRs to at least cover the critical path.

I'm curious to hear what your experience is doing benchmarks online as part of CI?

There are a few ways to handle this. In order of most to least effective:

Use a bare metal runner ($100+/month)
Use an instruction count based benchmarking harness (in addition to a wall clock based benchmarking harness)
Use statistical continuous benchmarking on shared CI runners
Use relative continuous benchmarking on shared CI runners
Run a nightly benchmarking job that then does a git bisect to find performance regressions

Answer 3 · 2024-05-01T09:53:21.000Z

Thanks, that's

Yea, we don't run the benchmarks on all PRs because it takes a long time to run them.

Yeah, this can definitely be a blocker. I think the most common thing I've seen is only running a subset of benchmarks on PRs to at least cover the critical path.

I'm curious to hear what your experience is doing benchmarks online as part of CI?

There are a few ways to handle this. In order of most to least effective:

Use a bare metal runner ($100+/month)

Use an instruction count based benchmarking harness (in addition to a wall clock based benchmarking harness)

Use statistical continuous benchmarking on shared CI runners

Use relative continuous benchmarking on shared CI runners

Run a nightly benchmarking job that then does a git bisect to find performance regressions

Thanks! That's great advise.