proposal for regression testing by HansVRP · Pull Request #490 · ESA-APEx/apex_algorithms

HansVRP · 2026-04-30T14:33:16Z

Idea for starting to include regression benchmarks.

@JanssenBrm I would also need info on how to best expose it such that we can keep a log on the service catalogue

algorithm-services-catalogue · 2026-04-30T14:36:33Z

🔍 Catalogue's Preview Site Deployed

Your changes have been deployed to the preview site:

🔗 Preview URL: https://esa-apex.github.io/apex-algorithms-catalogue-web/pr-preview/pr-490/

This preview will be updated automatically when you push new changes to your PR.

…e for all benchmark runs

HansVRP · 2026-05-13T07:19:50Z

@JanssenBrm @VictorVerhaert ready to check. I have opted for a more adaptive benchmark where we look at the average and the std. Depending on the nr of successful runs the benchmark becomes more determinantal

HansVRP · 2026-06-02T07:52:31Z

@JanssenBrm @JeroenVerstraelen @VictorVerhaert all feedback is welcome

VictorVerhaert

Two small optional comment aimed at trying to prevent false fails. One more question: could you try and run it using github actions and see how it behaves in practice?
Otherwise the pr looks clean

VictorVerhaert · 2026-06-05T12:30:25Z

+        scaled_mad = 1.4826 * _median([abs(v - median) for v in values])
+
+        k = _adaptive_k(min(n, 10))
+        threshold = median + k * scaled_mad


Consider adding an absolute buffer for the cost metric here. I wouldn't let a benchmark fail if it suddenly costs 9 instead of 8.

VictorVerhaert · 2026-06-05T12:43:18Z

+    )
+
+
+def load_scenario_history(


Consider adding a date cutoff field here which should equal the updated field in the record. That way when a benchmark gets updated it resets the performance tests history.
Not needed and could overcomplicate it, but otherwise we might get a lot of false fails.

HansVRP requested review from JanssenBrm and VictorVerhaert April 30, 2026 14:33

VictorVerhaert self-assigned this May 4, 2026

JeroenVerstraelen mentioned this pull request May 5, 2026

[EPIC] APEx openEO UDP benchmark & regression workflow #1

Open

5 tasks

HansVRP force-pushed the hv_regresion_benchmark branch 2 times, most recently from 44710b2 to 16c9681 Compare May 7, 2026 07:58

HansVRP added 7 commits May 13, 2026 09:16

proposal for regression testing

d458989

drop usage

9d1d14f

unit tests

365bc87

fix tests

8f48e4c

from offline discussion'move to rolling average and std and fixed rul…

7948cca

…e for all benchmark runs

update

528d261

small fixes

be6e8ed

HansVRP force-pushed the hv_regresion_benchmark branch from 42a8863 to be6e8ed Compare May 13, 2026 07:17

HansVRP requested a review from JeroenVerstraelen May 13, 2026 09:35

HansVRP added 6 commits May 27, 2026 14:16

improve robustness by shifting to median

6acae03

median fix

fd8a4fb

simplify

b2827d3

clean up after discussion

0577b91

fix

f051f4b

clean up tolerance reference

04ba2a5

VictorVerhaert reviewed Jun 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

proposal for regression testing#490

proposal for regression testing#490
HansVRP wants to merge 13 commits into
mainfrom
hv_regresion_benchmark

HansVRP commented Apr 30, 2026

Uh oh!

algorithm-services-catalogue Bot commented Apr 30, 2026 •

edited

Loading

Uh oh!

HansVRP commented May 13, 2026

Uh oh!

HansVRP commented Jun 2, 2026

Uh oh!

VictorVerhaert left a comment

Uh oh!

VictorVerhaert Jun 5, 2026

Uh oh!

VictorVerhaert Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		)


		def load_scenario_history(

Conversation

HansVRP commented Apr 30, 2026

Uh oh!

algorithm-services-catalogue Bot commented Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔍 Catalogue's Preview Site Deployed

Uh oh!

HansVRP commented May 13, 2026

Uh oh!

HansVRP commented Jun 2, 2026

Uh oh!

VictorVerhaert left a comment

Choose a reason for hiding this comment

Uh oh!

VictorVerhaert Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

VictorVerhaert Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

algorithm-services-catalogue Bot commented Apr 30, 2026 •

edited

Loading