Update VHR-10 dataset plotting #2092

robmarkcole · 2024-05-28T08:10:36Z

Fix plotting using percentiles

Note: I have uploaded the dataset at https://huggingface.co/datasets/satellite-image-deep-learning/VHR-10 - can transfer to torchgeo org if you want to use it here

ashnair1 · 2024-05-28T12:04:08Z

I get the normalization fix but why is the README code snippet being changed? Once 0.6.0 is released, the snippet should work. Also as I understand it, the goal of the snippet is to show how the dataloader works not how to train using datamodules

robmarkcole · 2024-05-28T12:21:51Z

I just assumed the docs predate the datamodules but we would like to showcase this feature - personally I always use datamodules now but perhaps am in the minority there. Suggest a second opinion before I change the PR

robmarkcole · 2024-05-28T14:02:13Z

There is some fishy behaviour still - in the load_target the image id has one subtracted here - why not use as below?

annot = self.coco.loadAnns(self.coco.getAnnIds(imgIds=id_))

torchgeo/datasets/vhr10.py

robmarkcole · 2024-05-28T16:19:52Z

Applied black as per the guide, and this has changed ALL of the strings

ashnair1 · 2024-05-28T16:23:20Z

Applied black as per the guide, and this has changed ALL of the strings

torchgeo no longer uses black, isort etc and has recently switched to ruff for linting and code formatting.

ashnair1 · 2024-05-28T17:26:25Z

There is some fishy behaviour still - in the load_target the image id has one subtracted here - why not use as below?
annot = self.coco.loadAnns(self.coco.getAnnIds(imgIds=id_))

Refer to #847 (comment)

robmarkcole · 2024-05-29T08:09:55Z

@ashnair1 I checked the approach I suggested (imgIds=id_)) and this results in erroneous masks - strange as the API appears to support this approach, suggesting the implementation has some quirks

The approach I am using elsewhere:

Get list of image ids: self.ids = list(sorted(self.coco.imgs.keys())) and use in getitem: id_ = self.ids[index]
Access filename using id_: self.coco.imgs[id_]['file_name']
Access annotations using API: annot = self.coco.loadAnns(self.coco.getAnnIds(imgIds=id_))

adamjstewart · 2024-05-29T11:21:04Z

Regarding the README

The README has a few different sections. The section you're editing is designed to show off our benchmark datasets. There is a separate section below specifically for PyTorch Lightning, including data modules. I would prefer to keep using a dataset instead of a data module here.

Actually, I think we're better off choosing a different dataset (maybe EuroSAT?). VHR-10 is too complicated and makes it look like TorchGeo is hard to use. Maybe we could move VHR-10 to the Lightning section of the README since it has a pretty visualization.

Regarding plotting

Yes, this is a common issue with plotting via dataset vs. plotting via data module. I agree with your solution, and proposed basically the same thing for all datasets in #1263. This issue is not specific to VHR-10, but affects almost all datasets/data modules. I haven't had the time/energy to fix this for all datasets, but contributions are definitely welcome here, even if it's only 1 dataset at a time.

Can we separate the README and plotting changes into two separate PRs?

adamjstewart · 2024-05-29T11:23:01Z

Note: I have uploaded the dataset at https://huggingface.co/datasets/satellite-image-deep-learning/VHR-10 - can transfer to torchgeo org if you want to use it here

It's easier to download tar/zip/rar files instead of raw data. But I'm happy to change the download to your org or ours. Not sure if we should rehost all rar datasets as tar/zip to make them easier to extract?

robmarkcole · 2024-05-29T12:02:39Z

@adamjstewart this PR now just addresses the plotting.

RE dataset choice, I agree EuroSAT would be a nice choice - it also supports selecting subsets of bands so shows immediately what is possible with torchgeo

robmarkcole added 2 commits May 28, 2024 08:06

Update VHR10 dataset visualization method

af7b7ab

Update VHR10 info in readme

dee5199

github-actions bot added documentation Improvements or additions to documentation datasets Geospatial or benchmark datasets labels May 28, 2024

isort

087223c

ashnair1 changed the title ~~Update VHR-10 dataset plotting and README into, address #2069~~ Update VHR-10 dataset plotting and README code snippet May 28, 2024

Missing annotations handling in plotting

5a9776b

ashnair1 reviewed May 28, 2024

View reviewed changes

torchgeo/datasets/vhr10.py Outdated Show resolved Hide resolved

robmarkcole added 4 commits May 28, 2024 15:38

Update VHR10 dataset handling and visualization

cab99c6

remove whitespace

a0e17f0

Fix whitespace issue in VHR10 dataset handling

1baac80

black format

44cb52a

ruff format

8cff678

Merge branch 'main' into update-VHR-10

759904c

robmarkcole mentioned this pull request May 29, 2024

Add plot method for VHR10 dataset #847

Merged

adamjstewart modified the milestones: 0.5.3, 0.6.0 May 29, 2024

revert readme changes

36f796e

github-actions bot removed the documentation Improvements or additions to documentation label May 29, 2024

robmarkcole changed the title ~~Update VHR-10 dataset plotting and README code snippet~~ Update VHR-10 dataset plotting May 29, 2024

adamjstewart approved these changes May 29, 2024

View reviewed changes

adamjstewart merged commit 6a9cd53 into microsoft:main May 29, 2024
18 of 19 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update VHR-10 dataset plotting #2092

Update VHR-10 dataset plotting #2092

robmarkcole commented May 28, 2024 •

edited

Loading

ashnair1 commented May 28, 2024 •

edited

Loading

robmarkcole commented May 28, 2024

robmarkcole commented May 28, 2024

robmarkcole commented May 28, 2024

ashnair1 commented May 28, 2024

ashnair1 commented May 28, 2024

robmarkcole commented May 29, 2024 •

edited

Loading

adamjstewart commented May 29, 2024

adamjstewart commented May 29, 2024

robmarkcole commented May 29, 2024

Update VHR-10 dataset plotting #2092

Update VHR-10 dataset plotting #2092

Conversation

robmarkcole commented May 28, 2024 • edited Loading

ashnair1 commented May 28, 2024 • edited Loading

robmarkcole commented May 28, 2024

robmarkcole commented May 28, 2024

robmarkcole commented May 28, 2024

ashnair1 commented May 28, 2024

ashnair1 commented May 28, 2024

robmarkcole commented May 29, 2024 • edited Loading

adamjstewart commented May 29, 2024

Regarding the README

Regarding plotting

adamjstewart commented May 29, 2024

robmarkcole commented May 29, 2024

robmarkcole commented May 28, 2024 •

edited

Loading

ashnair1 commented May 28, 2024 •

edited

Loading

robmarkcole commented May 29, 2024 •

edited

Loading