-
Notifications
You must be signed in to change notification settings - Fork 0
pixels are being dropped during estimation #389
Comments
Nice issue dude. Also I like the red arrows. So big. On Fri, Jul 19, 2013 at 6:24 PM, Robin Kraft [email protected]:
|
Turns out these pixels are dropped because their trends values are bad - the ["500" 9 6 38 1592 827
[-61.55873192436046 -61.55873192436046 -61.55873192436046 -61.55873192436046 -61.55873192436046
....]
[-9999.0 -9999.0 -9999.0 -9999.0 -9999.0 ...]
[-9999.0 -9999.0 -9999.0 -9999.0 -9999.0 ...]
[0.7689173428291797 0.807434980881081 0.8212616584246215 0.8566016818004791 0.8321855724693694 ...]] Anything with a Those The Since a singular matrix is quite rare, it's strange that these pixels are all clustered near each other. It makes me think that the raw NDVI values are screwed up somehow. |
So it's actually an issue with rain - the rain values are all -999, the nodata value for that dataset. That seems unlikely given the location of the pixels (why no data on the Indian coast?!?) so I'm running the rain preprocessing to see how far back the issue actually goes in the workflow. |
I'll be examining pixel |
Turns out that the rain data is actually all -999s for that location. We're investigating whether this is an issue in the raw data or an artifact of our workflow. |
We now have a workaround (#393) that ignores constant cofactors in the long-trends function that was causing this issue. #391 is still open so that we will also address any reprojection issues that could be a more fundamental problem. |
Summary
About 300k pixels are being dropped during estimation. Here's the map (it's the "missing pixels" visualization in CartoDB):
http://cdb.io/1aybQpl
Initial analysis
It almost looks like specific tiles are being dropped - the edges of these groups of pixels are way too clean, and don't correspond to the edges of ecoregions or gadm admin areas.
But I don't think that's what's actually happening - these pixels show up as gaps in the data. The hole in India is surrounded by non-missing FORMA data from the same tile, for example.
Missing pixels
Gaps in GFW site
Indian gap only affects part of tile
Test data/info
Missing tiles
Here are the missing tiles I uncovered (there could be more):
Debugging
There's about 1.6gb of data for the pixels that get dropped. To run the estimation step for these pixels, do this from a cluster:
Conclusion
We really need to fix this since it's a long-standing bug (since late last year at least!) and so our coverage is incomplete! But we can still do updates and whatnot until that happens - no one has noticed the holes so far, and hopefully we can fix this before they do.
The text was updated successfully, but these errors were encountered: