Fix regression tests

Question

Opened this issue 2 years ago · 0 comments

On the Nightly build 405 we had 320 failed regression tests. To improve our workflow we should avoid failures of regression tests in the future.

QvasrAbstractionJoinTest: Flaky test, sometimes works and sometimes not.
FastUprTest: Some tests because the result is not equivalent to the expected verdict or we fail to show the equivalence.
MonniauxMapEliminatorTest: Expected verdict broken?
AuxVarInCall.bpl and Overapproximation.bpl contain overapproximation flags and therefore the expected result cannot be safe.

ParallelCompositionGeneratorTest: JAXBException
RundefinitionTest: JAXBException
Tests with toolchain MonniauxAutomizerBpl.xml: Did not find plugin with id "de.uni_freiburg.informatik.ultimate.plugins.icfgtransformation"
MonitoredProcessTest.testProcessToolchainTimeout: Process is still running (works sometimes locally)
SystemTest (from SmtInterpol): Currently not working because of some path issues, was fixed on this branch, this reveals some real issues (ArrayIndexOutOfBoundsException).

MixedAcceptingStates*.ats (NWA DelayedSimulation): Result of operation minimizeNwaPmaxSatDelayedBi is wrong (according to its checkResult method)
MinimizeNwaPmaxSat_TestSuiteUsage05.ats: Operand contains transition twice
MinimzeNwaPmaxSatDelayedTest_simpleExamples.ats: Missing variable
ConceptualProblemDelayed02-NiceMountain.ats: Assertion failed
BothSyncMethods.ats (petri net): Assertion failed
BugBlackAndWhiteProduct.ats and difference-Egypt.ats (petri net): Wrong difference
GlobalWeekForFuture01.ats (petri net): Dummy action is synchronized and therefore leads to a NullPointerException
Kerala-TwoThreadOneRessource-difference.ats (petri net): "Must not add place twice". It looks like it is not handled correctly in the difference, if one place occurs in the minuend and the subtrahend.
minimize_hopcroft_01.ats (tree automata): Result of operation minimizeNftaHopcroft is wrong

rtinconsistency_test5.req: SmtInterpol ignored the timeout there
rtinconsistency_test112.req: Timeout during simplification with SmtInterpol
bug_465_ex03.req: Rt-inconsistent requirements different
~~ex-527-ex01.req, ex_527_ex02.req~~, ex-530-ex01.req, ~~ex-rtinconsistent-04.req~~: Vacuous requirements different
vacuity_test14.req, vacuity_test25.req, vacuity_test30.req: Timeout with Z3 on simplification

SyntaxSupportMixedIntReal2.bpl: PolynomialRelation does not support mixed sorts. The problem is that we do not really support ranking functions for real variables, since most constants and variables created in the termination analysis have the sort int. Therefore we often create terms with inconsistent sorts.
multipleCallSuccessors-01.bpl: State has more than one call successor
C2BoogieRegressionTestSuite: All the failing tests are currently unsupported cases, do we want to keep those as regression tests?
Some tests of the RegressionTestSuite (e.g this) also fail, because we don't support a specific feature.
ConstArray.bpl: SMTLIBException: Const is only supported for infinite index sort (with three settings)
InParamRenaming.c: Issue with renaming parameters that are used in ACSL
NonterminatingForLoopSafe.c: Some settings don't support non-linear arithmetic, we could however just simplify this example, the intention would remain the same.

Several SMT tests are too hard for the underlying solver and should not run in regression tests.
Most of the other tests (e.g. from RegressionTestSuite) simply always fail, because we say unknown or have a timeout (at least for some of the settings/toolchains). We should try to fix them and otherwise find a way to ignore these failures in the test result.