This is something to look into, although it may not be a TestCoCa issue.
As reported in [Zulip Thread](#SV-Benchmarks > Execution depending on garbage @ 💬):
I am looking at https://test-comp.sosy-lab.org/2026/results/sv-benchmarks/c/termination-crafted-lit/cstrcspn.c for the first time.
But I do not see how it is possible to generate inputs that force a certain path, because the only inputs are the 2 lengths of the string: the actual execution path will depend on the garbage that the memory block contains.
I have no doubt many similar examples exist.
Am I missing something?
Does this not make the results unreproducible?
Does this not benefit test input generation tools that generate 1000s of random inputs?
Would it not be better if the benchmark initialised the strings by reading characters up to its input length as verifier inputs in a loop?
Has this been discussed before?
The only tool that achieves 100% branch coverage is KLEE: although only TestCoCa reports 100%, Testcov x 4 versions all report 37.5%, which is suspicious.
How can I access the tests generated by KLEE: https://test-comp.sosy-lab.org/2026/results/results-verified/klee.2026-01-06_13-12-15.results.Test-Comp26_coverage-branches.C.coverage-branches.Termination-MainControlFlow.xml.bz2.fixed.xml.bz2.table.html#/table?filter=id_any(value(cstrcspn)) for curiosity?
This is something to look into, although it may not be a TestCoCa issue.
As reported in [Zulip Thread](#SV-Benchmarks > Execution depending on garbage @ 💬):
I am looking at https://test-comp.sosy-lab.org/2026/results/sv-benchmarks/c/termination-crafted-lit/cstrcspn.c for the first time.
But I do not see how it is possible to generate inputs that force a certain path, because the only inputs are the 2 lengths of the string: the actual execution path will depend on the garbage that the memory block contains.
I have no doubt many similar examples exist.
Am I missing something?
Does this not make the results unreproducible?
Does this not benefit test input generation tools that generate 1000s of random inputs?
Would it not be better if the benchmark initialised the strings by reading characters up to its input length as verifier inputs in a loop?
Has this been discussed before?
The only tool that achieves 100% branch coverage is KLEE: although only TestCoCa reports 100%, Testcov x 4 versions all report 37.5%, which is suspicious.
How can I access the tests generated by KLEE: https://test-comp.sosy-lab.org/2026/results/results-verified/klee.2026-01-06_13-12-15.results.Test-Comp26_coverage-branches.C.coverage-branches.Termination-MainControlFlow.xml.bz2.fixed.xml.bz2.table.html#/table?filter=id_any(value(cstrcspn)) for curiosity?