implement the nugget on the diagonal to prevent exact interpolation #30

kvanlombeek · 2016-12-20T08:55:59Z

As suggested already in the code, implement the option that the variance is taken into account when you krige.

rth · 2016-12-20T09:53:58Z

@kvanlombeek Could you please explain a bit more what you mean by "taken into account", and in which section it is suggested in the code? I couldn't find any reference to it in the code (apart the fact that variance is calculated)... Thanks!

Edit: missed the description in the title again ) Could you explain a bit more "implement the sill on the diagonal to prevent exact interpolation"?

kvanlombeek · 2016-12-21T07:42:30Z

This is what I found in the code, and is exactly what I would like to implement:

    In the future, the code may include an extra 'exact_values' boolean flag that can be
    adjusted to specify whether to treat the measurements as 'exact'. Setting the flag
    to false would indicate that the variogram should not be forced to be zero at zero distance
    (i.e., when evaluated at data points). Instead, the uncertainty in the point will be
    equal to the nugget. This would mean that the diagonal of the kriging matrix would be set to
    the nugget instead of to zero.

Realized I made some mistakes in the description, sorry about that.

basaks · 2016-12-21T09:19:12Z

This is actually useful and is analogous to adding measurement noise to the observations.

bsmurphy · 2016-12-21T22:52:40Z

Yeah, this is one of my long-term goals... shouldn't be that hard to implement. I think it would basically just require (1) adding a new internal variable into the init function(s) and (2) adding one statement to set the diagonal of the kriging matrix to the nugget. Might be another sneaky addition that would need to be made, but I don't think it would require much more than that. @kvanlombeek, would you be willing to take out a pull request for this? Otherwise I can work on it in the coming weeks...

kvanlombeek · 2016-12-22T10:49:17Z

Yes I can give this a shot. I actually implemented this already here, but would like to do it all over again as it was a bit messy.

I personally have never understood why it is not standard, the data I krige (mostly houseprices) is always full of noise.

basaks · 2016-12-22T13:50:46Z

I think this should be implemented.

I personally have never understood why it is not standard, the data I krige (mostly houseprices) is always full of noise.

In the Gaussial Processes literature adding (white)noise to the Kernel functions is actually very common as demonstrated in the examples. Otherwise with enough basis functions you can fit your training data exactly (this will compare to zero nugget), and that can introduce overfitting.

kvanlombeek · 2016-12-23T10:45:16Z

I am having difficulties implementing this.

Do @rth @basaks @basaks you know why you fill in the negative of the variogram function, around line 381 in ok.py:

a[:n, :n] = - self.variogram_function(self.variogram_model_parameters, d)

If I replace the diagonal with the absolute values of the diagonal it works, but I can't get why.

rth · 2016-12-23T10:48:46Z

@bsmurphy would know more about this I think...

bsmurphy · 2016-12-24T01:23:30Z

That is a good question... I can't remember exactly right off the top of my head (and I don't have my reference texts handy at the moment), but I think it has to do with the sign convention that I used in this formulation of the kriging system...
What are you trying to put on the diagonal of the matrix currently? The negative of the nugget, or the actual (positive) nugget? In my comment above I think I mentioned just the diagonal of the kriging matrix; I think you also have to modify the formulation of the RHS of the matrix system (vector b) as well to add the nugget.

kvanlombeek · 2016-12-28T17:45:42Z

Hi all,

I have investigated a bit more, and came to the following:

The sign convention is a bit odd and hard to debug. I believe it is cleaner if matrix a would be filled with positive values. That implies changing the signs here and there in the code.
I have been looking at _exec_loop, as this is the easiest to read. I believer there is a mistake in the code, the calculation of sigmasq around line 470 is wrong, shouldn't it be sigmasq[j] = np.sum(x[:n, 0] * b[:n, 0]) + b[-1,0] instead of sigmasq[j] = np.sum(x[:, 0] * b[:, 0])

For the rest it looks pretty straight forward. Do you guys know a text book example that we can use to be sure our calculations are correct? Something with maybe only 5 values?

bsmurphy · 2017-01-10T04:32:03Z

Changing the sign convention could unleash all sorts of unanticipated problems, but with @rth's refactoring idea this would be the time to mess around with it. I'll need to look through my references again to make sure I'm remembering the sign business correctly... Regarding the calculation of the squared residuals, you're probably right -- I noticed that the kriging statistics calculations often produce incorrect (or even NaN) results, so something is definitely wrong with that whole business at some point. Again, I'll need to check the references... The Kitanidis text (the reference I use most often for all the kriging stuff) has an example I think, I'll try to pull it out.

lewisjared · 2017-04-21T03:34:26Z

Just wondering if this (being allowed to have uncertainties on the measurements for the kriging procedure) has been implemented yet? Is there a branch which contains the WIP?

kvanlombeek changed the title ~~implement the sill on the diagonal to prevent exact interpolation~~ implement the nugget on the diagonal to prevent exact interpolation Dec 21, 2016

bsmurphy mentioned this issue Feb 5, 2018

[Refactoring] N-dimenstional Kriging #31

Closed

9 tasks

bsmurphy mentioned this issue May 4, 2018

Measurement uncertaininty question #95

Open

MuellerSeb self-assigned this Jan 27, 2020

MuellerSeb added enhancement new feature Refactoring labels Jan 27, 2020

MuellerSeb added this to the v2.0 milestone Apr 5, 2020

nannau mentioned this issue May 22, 2020

Feature/exact values option #153

Merged

MuellerSeb linked a pull request Jun 24, 2020 that will close this issue

Feature/exact values option #153

Merged

MuellerSeb closed this as completed in #153 Jun 30, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

implement the nugget on the diagonal to prevent exact interpolation #30

implement the nugget on the diagonal to prevent exact interpolation #30

kvanlombeek commented Dec 20, 2016

rth commented Dec 20, 2016 •

edited

Loading

kvanlombeek commented Dec 21, 2016

basaks commented Dec 21, 2016

bsmurphy commented Dec 21, 2016

kvanlombeek commented Dec 22, 2016

basaks commented Dec 22, 2016 •

edited

Loading

kvanlombeek commented Dec 23, 2016

rth commented Dec 23, 2016

bsmurphy commented Dec 24, 2016

kvanlombeek commented Dec 28, 2016

bsmurphy commented Jan 10, 2017

lewisjared commented Apr 21, 2017

implement the nugget on the diagonal to prevent exact interpolation #30

implement the nugget on the diagonal to prevent exact interpolation #30

Comments

kvanlombeek commented Dec 20, 2016

rth commented Dec 20, 2016 • edited Loading

kvanlombeek commented Dec 21, 2016

basaks commented Dec 21, 2016

bsmurphy commented Dec 21, 2016

kvanlombeek commented Dec 22, 2016

basaks commented Dec 22, 2016 • edited Loading

kvanlombeek commented Dec 23, 2016

rth commented Dec 23, 2016

bsmurphy commented Dec 24, 2016

kvanlombeek commented Dec 28, 2016

bsmurphy commented Jan 10, 2017

lewisjared commented Apr 21, 2017

rth commented Dec 20, 2016 •

edited

Loading

basaks commented Dec 22, 2016 •

edited

Loading