Some correctness bugs are not found in CI
awf opened this issue · 0 comments
awf commented
From #1034, some correctness bugs are not found in CI, because we have some benchmarks which are expected to give incorrect results (the embedded_INCORRECT benchmarks).
Maybe we should disable those benchmarks in CI so that we can fail the build if another example breaks?
Or we should test correctness unless benchmark is labelled as "incorrect".