Skip to content

[ICLR 2024] FairSeg: A Large-Scale Medical Image Segmentation Dataset for Fairness Learning Using Segment Anything Model with Fair Error-Bound Scaling

Notifications You must be signed in to change notification settings

Harvard-Ophthalmology-AI-Lab/FairSeg

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

53 Commits
 
 
 
 
 
 

Repository files navigation

FairSeg

[ICLR 24] FairSeg: A Large-Scale Medical Image Segmentation Dataset for Fairness Learning Using Segment Anything Model with Fair Error-Bound Scaling

by Yu Tian*, Min Shi*, Yan Luo*, Ava Kouhana, Tobias Elze, and Mengyu Wang.

Screenshot 2024-01-20 at 9 24 39 AM

Download Harvard-FairSeg Dataset

  • The Harvard-FairSeg dataset can be accessed via this link. If you have any questions, please email [email protected].

  • If you cannot directly download the Harvard-FairSeg dataset, please request access in the above Google Drive link, we will make sure to grant you access within 3-5 days.

  • Please refer to each of the folders for FairSeg with SAMed and TransUNet, respectively.

  • Note that, the modifier word “Harvard” only indicates that our dataset is from the Department of Ophthalmology of Harvard Medical School and does not imply an endorsement, sponsorship, or assumption of responsibility by either Harvard University or Harvard Medical School as a legal identity.

  • CVer中文讲解

Dataset Description

This dataset can only be used for non-commercial research purposes. At no time, the dataset shall be used for clinical decisions or patient care. The data use license is CC BY-NC-ND 4.0.

The dataset containing 10,000 patients includes 10,000 Scanning laser ophthalmoscopy (SLO) fundus images. The disc and cup masks, patient age, gender, race, ethnicity, language, and marital status information are also included in the data. Under the folder "ReadMe", the file "data_summary.csv" provides an overview of our data.

10,000 SLO fundus images with pixel-wise disc and cup masks are in the Google Drive folder: data_00001.npz data_00002.npz ... data_10000.npz

NPZ files have the following keys:

slo_fundus: Scanning laser ophthalmoscopy (SLO) fundus image
disc_cup_mask: disc and cup masks for the corresponding SLO fundus image
age: patient's age
gender: 0 - Female, 1 - Male
race: 0 - Asian, 1 - Black, 2 - White
ethnicity: 0 - Non-Hispanic, 1 - Hispanic, -1 - Unknown
language: 0 - English, 1 - Spanish, 2 - Others, -1 - Unknown
maritalstatus: 0 - Married or Partnered, 1 - Single, 2 - Divorced, 3 - Widowed, 4 - Legally Separated, -1 - Unknown

Acknowledgement and Citation

If you find this repository useful for your research, please consider citing our paper:

@inproceedings{tianfairseg,
  title={FairSeg: A Large-Scale Medical Image Segmentation Dataset for Fairness Learning Using Segment Anything Model with Fair Error-Bound Scaling},
  author={Tian, Yu and Shi, Min and Luo, Yan and Kouhana, Ava and Elze, Tobias and Wang, Mengyu},
  booktitle={The Twelfth International Conference on Learning Representations}
}

About

[ICLR 2024] FairSeg: A Large-Scale Medical Image Segmentation Dataset for Fairness Learning Using Segment Anything Model with Fair Error-Bound Scaling

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •