-
Notifications
You must be signed in to change notification settings - Fork 370
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add the EnviroAtlas dataset #364
Conversation
Note my |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This dataset has the same problems as ChesapeakeCVPR
. If we find a way to improve sampling speed for RasterDataset
, these datasets won't benefit. Also, I'm not sure if these datasets can be combined with any other GeoDataset
, which violates a lot of the fundamental assumptions we make about GeoDataset
instances and nullifies the whole point of using GeoDataset
instead of VisionDataset
. I'm fine with merging this as is because we already have a similarly broken dataset in TorchGeo, but we need to seriously consider how to handle these kinds of datasets in the future.
This dataset extends
Why not? Does it not correctly extend
Do you have anything else in mind besides the above points? |
If I'm reading things correctly, this dataset doesn't doesn't warp things into a common CRS, correct? This is incompatible with other GeoDatasets. GeoDataset makes the assumption that the index and the data returned by |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Inconsistencies between GeoDataset and ChesapeakeCVPR/EnviroAtlas and refactoring so that RasterDataset code can be shared with these datasets can be done in a follow-up PR, I vote we merge this if everything else looks good.
The index is in a single CRS (EPSG:3857), but the problem is that The assumption that this (and ChesapeakeCVPR) breaks is that |
Any thoughts on how to clean up data.py / do it differently (or do we not care too much about those)? |
I'm not worried about |
* Add dataset * Add dataset to docs * Tests for enviroatlas * Test coverage * Added numpy type * Added plotting * Code review changes * Propagating code review comments to Chesapeake
Adds one of the datasets used in https://openreview.net/forum?id=AEa_UepnMDX