The following color codes are used to show diffenences between the latest results and the results before:
failed → success success → failed screenshot changed better runtime (<10%%) worse runtime (>10%%)


result table:
success warning failed

Summary of all successful experiments