The actual datasets are hosted at https://huggingface.co/datasets/medviz/bike-datasets
Due to the large size of data files, please install git-lfs
and enable it first before downloading the data files.
Install git-lfs
:
brew install git-lfs
git lfs install
Add your SSH key to HuggingFace https://huggingface.co/settings/keys.
Then you can download the dataset from HuggingFace.
git clone [email protected]:datasets/medviz/bike-datasets
After downloading this repo, check the file size in the data/medviz
folder.
The file size of those .tsv
files should be quite large. If not, usually it's caused by missing of git-lfs
TBD
All datasets should follow the following naming convention.
- File name all upper case.
- File type and extension is lower case.
To ensure the loading by frontend, the .tsv
file must be in the correct format:
pid, title, abstract, journal, year, mesh_terms, color, x, y, size
The mesh_terms
column should be a list of keywords seperated by ;
, for example: Infections;Kidney;Kidney Diseases;Microscopy, Electron;Poultry Diseases;Specific Pathogen-Free Organisms