full fua testing for `simplify_network()` #41

jGaboardi · 2024-10-14T16:05:50Z

This MR:

resolves full FUA(s) for full algo run #38
xref start filling up testing suite & docstrings #20
xref start testing + refactor [stub issue] #21

jGaboardi · 2024-10-14T16:47:43Z

Failures we are seeing are unfortunately not due to small differences in testing precision, but actual _status difference. For example, in Aleppo:

>       assert_series_equal(known._status, observed._status)

sgeop/tests/test_simplify.py:44: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
testing.pyx:55: in pandas._libs.testing.assert_almost_equal
    ???
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 

>   ???
E   AssertionError: Series.index are different
E   
E   Series.index values are different (0.02026 %)
E   [left]:  Index([    0.0,     1.0,     2.0,     3.0,     4.0,     5.0,     6.0,     7.0,
E              8.0,     9.0,
E          ...
E          39658.0, 39659.0, 39660.0, 39666.0, 39668.0, 39669.0, 39670.0, 39676.0,
E          39678.0, 39679.0],
E         dtype='float64', length=39489)
E   [right]: Index([    0.0,     1.0,     2.0,     3.0,     4.0,     5.0,     6.0,     7.0,
E              8.0,     9.0,
E          ...
E          39658.0, 39659.0, 39660.0, 39666.0, 39668.0, 39669.0, 39670.0, 39676.0,
E          39678.0, 39679.0],
E         dtype='float64', length=39489)
E   At positional index 2488, first diff: 2496.0 != 2494.0

testing.pyx:173: AssertionError

I think there is still a good chance that cross-version and -platform installs are contributing to this within sgeop, but it will be much more difficult to solve than simply bumping up the tolerance in testing...

martinfleis · 2024-10-14T17:01:39Z

There seems to be some difference in how something gets computed on Apple Silicon.

jGaboardi · 2024-10-14T17:26:13Z

Yeah Apple Silicon sticks out, but there seem to be differences across all CI environments.

jGaboardi · 2024-10-15T01:50:16Z

@martinfleis after some blood, sweat, & tears I've set up our action to curate the observed simplified network data as artifacts so at least we can easily compare between OS & package versions for results.

martinfleis · 2024-10-15T07:33:38Z

Good job! I think that there won't be a ton of what we could do here as this is most likely due to some precision differences in underlying C code (in Qhull, GEOS...) But good to have the ability to debug.

martinfleis · 2024-10-15T08:21:07Z

I looked into the differences. Vast majority is precision thing. But there are some visible differences here and there. Not sure how much they matter though and if there's any chance we could prevent those.

My proposal is to check the equality of simplified stuff only on one OS in CI.

jGaboardi · 2024-10-15T13:06:29Z

My proposal is to check the equality of simplified stuff only on one OS in CI.

Sounds good. I am thinking latest & Apple Silicon since that's what I have, but I am open to another version/OS combo.

As for non-equality testing of the full FUA simplification, is there anything else we should/could check for?

jGaboardi · 2024-10-15T13:08:03Z

note to self: open another issue to include some documentation about this:

... due to some precision differences in underlying C code (in Qhull, GEOS...)
... is precision thing. But there are some visible differences here and there.

martinfleis · 2024-10-15T13:10:07Z

Sounds good. I am thinking latest & Apple Silicon since that's what I have, but I am open to another version/OS combo.

I'd probably do ubuntu as that is what anyone can get in a container. Apple Silicon is a bit exclusionary.

jGaboardi · 2024-10-15T13:42:29Z

As for non-equality testing of the full FUA simplification, is there anything else we should/could check for?

@martinfleis Do you have anything opinions/ideas for this? I suppose we a try testing .shape but even that might be different. I will try that here in another commit.

martinfleis · 2024-10-15T14:05:52Z

Even shape is different in some cases... I don't know what to check. Rough sum of distances?

jGaboardi · 2024-10-15T15:39:27Z

hmmmm. Let's think more on this.

jGaboardi · 2024-10-15T16:53:21Z

Rough sum of distances?

I'll push this up in a commit and swap out the current "known" simplified for that produced by Ubuntu latest. See where that gets us.

jGaboardi · 2024-10-15T17:28:12Z

We've got 5/7 passing envs by testing against the known results from Ubuntu Python 3.12 latest

jGaboardi · 2024-10-15T18:02:54Z

I want to get the conftest.py started in #27 merged, then build the logic here to only do equality tests from certain CI environments.

jGaboardi added 2 commits October 14, 2024 08:57

curate full FUA testing data

94b5244

add full fua simplify_network() tests

f025a24

jGaboardi added the tests/CI label Oct 14, 2024

jGaboardi self-assigned this Oct 14, 2024

bump 4/5 tolerance to 0.2 – test_simplify_network_full_fua()

a94bc73

jGaboardi added 13 commits October 14, 2024 17:29

try uploading osberved artifacts

1226ca0

try uploading osberved artifacts [2]

34d6a93

try uploading osberved artifacts [3]

8ccc201

try uploading osberved artifacts [4]

5f5f9c5

try uploading osberved artifacts [5]

e2efdaf

try uploading osberved artifacts [6]

0e3568d

try uploading osberved artifacts [7]

343adf4

try uploading osberved artifacts [8]

2118387

try uploading osberved artifacts [9]

1ef22f5

try uploading osberved artifacts [10]

df7af2b

upload osberved artifacts

4a8e5e5

try uploading osberved artifacts [11]

9c40160

try uploading osberved artifacts [12]

3b2c220

known simplified from ci_artifacts-ubuntu-latest-py312_sgeop-latest

08f8d2d

jGaboardi mentioned this pull request Oct 15, 2024

start filling up testing suite & docstrings #20

Open

5 tasks

jGaboardi added 2 commits October 15, 2024 16:22

merge origin/main

aad6672

run equality tests only on Ubuntu + latest/dev CI envs

0593f74

jGaboardi requested a review from martinfleis October 15, 2024 21:06

codecov

be2a3ee

martinfleis approved these changes Oct 16, 2024

View reviewed changes

jGaboardi merged commit 5c033a4 into main Oct 16, 2024
8 checks passed

jGaboardi deleted the GH38_full_fua_testing branch October 16, 2024 13:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

full fua testing for `simplify_network()` #41

full fua testing for `simplify_network()` #41

jGaboardi commented Oct 14, 2024 •

edited

Loading

jGaboardi commented Oct 14, 2024

martinfleis commented Oct 14, 2024

jGaboardi commented Oct 14, 2024

jGaboardi commented Oct 15, 2024

martinfleis commented Oct 15, 2024

martinfleis commented Oct 15, 2024

jGaboardi commented Oct 15, 2024

jGaboardi commented Oct 15, 2024

martinfleis commented Oct 15, 2024

jGaboardi commented Oct 15, 2024

martinfleis commented Oct 15, 2024

jGaboardi commented Oct 15, 2024

jGaboardi commented Oct 15, 2024

jGaboardi commented Oct 15, 2024

jGaboardi commented Oct 15, 2024

full fua testing for simplify_network() #41

full fua testing for simplify_network() #41

Conversation

jGaboardi commented Oct 14, 2024 • edited Loading

jGaboardi commented Oct 14, 2024

martinfleis commented Oct 14, 2024

jGaboardi commented Oct 14, 2024

jGaboardi commented Oct 15, 2024

martinfleis commented Oct 15, 2024

martinfleis commented Oct 15, 2024

jGaboardi commented Oct 15, 2024

jGaboardi commented Oct 15, 2024

martinfleis commented Oct 15, 2024

jGaboardi commented Oct 15, 2024

martinfleis commented Oct 15, 2024

jGaboardi commented Oct 15, 2024

jGaboardi commented Oct 15, 2024

jGaboardi commented Oct 15, 2024

jGaboardi commented Oct 15, 2024

full fua testing for `simplify_network()` #41

full fua testing for `simplify_network()` #41

jGaboardi commented Oct 14, 2024 •

edited

Loading