Probabalistic metric space #87

redhog · 2021-04-15T08:00:33Z

Point pairs can be sampled randomly. Two random subset of all points are chosen, and the distance matrix is calculated only between these two subsets. The size of each subset is set by samples: if < 1 it specifies a fraction of all points, if >= 1 it specifies the number of points in each subset.

Closes #48

Only look at this after merging #84

mmaelicke · 2021-04-15T08:10:44Z

wow it's going really fast here...
I'll try to keep up with your pace :)

redhog · 2021-04-15T08:43:18Z

This is all stuff I'm building for a single feature in our software. For a bit of background: We have built (and are using) https://github.com/emerald-geomodelling/EmeraldTriangles to manage triangulated datasets. Among those are topography, and while that usually is delivered as a raster (geotiff) and easy to sample to the TIN (which is typically build from a grid + all measurement points), sometimes it comes as an existing TIN that we just add the new (grid + measurement) points to. In that case I wanted to use kriging on topo too, not just measurements. But for such a topo (DTM) TIN, the number of points is very large, and I ran into all the things I've been addressing in these two PRs.

redhog · 2021-04-15T08:45:30Z

For now the scikit-gstat kriging is done outside of EmeraldTriangles, but my goal is to have a utility function in that library that uses vertice points with a non-nan value in some column as observations, kriging to all rows with nan values in the same column.

codecov · 2021-04-20T08:02:00Z

Codecov Report

Merging #87 (7b63fb9) into master (6d972e9) will increase coverage by 0.08%.
The diff coverage is 93.84%.

@@            Coverage Diff             @@
##           master      #87      +/-   ##
==========================================
+ Coverage   90.92%   91.01%   +0.08%     
==========================================
  Files          15       15              
  Lines        1785     1847      +62     
==========================================
+ Hits         1623     1681      +58     
- Misses        162      166       +4

Impacted Files	Coverage Δ
skgstat/MetricSpace.py	`87.75% <93.33%> (+3.84%)`	⬆️
skgstat/Variogram.py	`97.05% <100.00%> (+0.01%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 6d972e9...7b63fb9. Read the comment docs.

…balistic-metric-space

redhog · 2021-04-26T09:28:52Z

@mmaelicke This is ready for merging now whenever you have time :)

mmaelicke

@redhog, looks good to me, nice extension.
I think with two unittests we can bring the coverage back. Both could use the Variogram to instantiate a ProbabilisticMetricSpace with two different sample sizes. That should cover all the new code.
If you want me to do that, just go ahead and assign me to the PR, I think I will find some time towards the end of the week.

skgstat/MetricSpace.py

skgstat/Variogram.py

…kit-gstat into probabalistic-metric-space

redhog · 2021-04-27T12:59:22Z

Btw, @mmaelicke I saw you added me to the citation! Thanks! I made an account with orcid so maybe you coul update the author list on zenodo there to point to my id https://orcid.org/0000-0002-8254-1163?

mmaelicke

@redhog, thanks for adding all the stuff, I think everything is addressed. There is one more thing that came to my mind, should be an easy fix
Best

skgstat/MetricSpace.py

mmaelicke

Thanks! Great PR, I really like it.

mmaelicke · 2021-04-29T09:06:13Z

You can merge whenever you want. I will add the changelog and release the new version to PyPI, this afternoon

Use a random subsample of points

67101e7

redhog marked this pull request as draft April 15, 2021 08:00

Egil added 2 commits April 16, 2021 13:21

Bugfix

205bc70

Bugfix

5c0d84b

redhog changed the base branch from master to crosskriging April 20, 2021 07:39

redhog changed the base branch from crosskriging to master April 20, 2021 07:39

redhog marked this pull request as ready for review April 20, 2021 07:39

Egil and others added 2 commits April 20, 2021 10:07

Merge branch 'master' of github.com:mmaelicke/scikit-gstat into proba…

5395566

…balistic-metric-space

Merge branch 'master' into probabalistic-metric-space

7b60c0a

mmaelicke requested changes Apr 27, 2021

View reviewed changes

Egil added 5 commits April 27, 2021 11:30

Serious bugfix

5a9cdce

Unit test for sampling

cd00ebb

Merge branch 'probabalistic-metric-space' of github.com:mmaelicke/sci…

59e496a

…kit-gstat into probabalistic-metric-space

Some more docs

3431860

Cleaner max_dist handling

dcd3c18

ValueError instead of assertion

e0e43bd

mmaelicke requested changes Apr 28, 2021

View reviewed changes

skgstat/MetricSpace.py Outdated Show resolved Hide resolved

skgstat/MetricSpace.py Outdated Show resolved Hide resolved

skgstat/MetricSpace.py Outdated Show resolved Hide resolved

Better random state handling

7b63fb9

mmaelicke approved these changes Apr 29, 2021

View reviewed changes

redhog merged commit 2320748 into master Apr 29, 2021

mmaelicke deleted the probabalistic-metric-space branch February 10, 2022 07:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Probabalistic metric space #87

Probabalistic metric space #87

redhog commented Apr 15, 2021 •

edited

Loading

mmaelicke commented Apr 15, 2021

redhog commented Apr 15, 2021 •

edited

Loading

redhog commented Apr 15, 2021

codecov bot commented Apr 20, 2021 •

edited

Loading

redhog commented Apr 26, 2021

mmaelicke left a comment

redhog commented Apr 27, 2021

mmaelicke left a comment

mmaelicke left a comment

mmaelicke commented Apr 29, 2021

Probabalistic metric space #87

Probabalistic metric space #87

Conversation

redhog commented Apr 15, 2021 • edited Loading

mmaelicke commented Apr 15, 2021

redhog commented Apr 15, 2021 • edited Loading

redhog commented Apr 15, 2021

codecov bot commented Apr 20, 2021 • edited Loading

Codecov Report

redhog commented Apr 26, 2021

mmaelicke left a comment

Choose a reason for hiding this comment

redhog commented Apr 27, 2021

mmaelicke left a comment

Choose a reason for hiding this comment

mmaelicke left a comment

Choose a reason for hiding this comment

mmaelicke commented Apr 29, 2021

redhog commented Apr 15, 2021 •

edited

Loading

redhog commented Apr 15, 2021 •

edited

Loading

codecov bot commented Apr 20, 2021 •

edited

Loading