uwplse/szalinski

fully implement old evaluation

mwillsey opened this issue · 2 comments

fully implement old evaluation

Benchmarks that fail:

  1. card_org

will make new issue to investigate the failing tests