Kumar Ayush*, Burak Uzkent*, Chenlin Meng*, Kumar Tanmay, Marshall Burke, David Lobell, Stefano Ermon.
Stanford University
In ICCV, 2021.
This is a PyTorch implementation of Geography-Aware Self-Supervised Learning. We use the the official implementation of MoCo-v2 for developing our methods.
Functional Map of the Dataset can be downloaded from their website/repo.
You can create csvs similar to the ones in the csvs/
folder.
Map showing distribution of the fMoW dataset.
Install PyTorch and download the fMoW dataset.
Similar to official implementation of MoCo-v2, this implementation only supports multi-gpu, DistributedDataParallel training, which is faster and simpler; single-gpu or DataParallel training is not supported.
To do self-supervised pre-training of a ResNet-50 model on fmow using our MoCo-v2+Geo+TP model in an 4-gpu machine, run:
python moco_fmow/main_moco_geo+tp.py \
-a resnet50 \
--lr 0.03 \
--dist-url 'tcp://localhost:14653' --multiprocessing-distributed --moco-t 0.02 --world-size 1 --rank 0 --mlp -j 4 \
--loss cpc --epochs 200 --batch-size 256 --moco-dim 128 --aug-plus --cos \
--save-dir ${PT_DIR} \
--data fmow
To do self-supervised pre-training of a ResNet-50 model on fmow using our MoCo-v2+TP model in an 4-gpu machine, run:
python moco_fmow/main_moco_tp.py \
-a resnet50 \
--lr 0.03 \
--dist-url 'tcp://localhost:14653' --multiprocessing-distributed --moco-t 0.02 --world-size 1 --rank 0 --mlp -j 4 \
--loss cpc --epochs 200 --batch-size 256 --moco-dim 128 --aug-plus --cos \
--save-dir ${PT_DIR} \
--data fmow
With a pre-trained model, to train a supervised linear classifier on frozen features/weights in an 4-gpu machine, run:
python moco_fmow/main_lincls.py \
-a resnet50 \
--lr 1 \
--dist-url 'tcp://localhost:14653' --multiprocessing-distributed --world-size 1 --rank 0 -j 4 \
--pretrained=${PT_DIR} \
--save-dir ${PTDIR}/lincls \
--data fmow --batch-size 256
Our pre-trained ResNet-50 models can be downloaded as following:
epochs | model | |
---|---|---|
MoCo-v2 | 200 | download |
MoCo-v2-Geo | 200 | download |
MoCo-v2-TP | 200 | download |
MoCo-v2+Geo+TP | 200 | download |
Download the GeoImageNet - The instructions to download GeoImageNet set are given here. Using this repository, we can download in the order of 2M images together with their coordinates. In the paper, we use 540k images for the GeoImageNet. The download process should download the images into their representative class folder. We recommend parallelizing the download process for efficiency.
Clustering - Once, we download the GeoImageNet dataset, we can use a clustering algorithm to cluster the images using their geo-coordinates. In the paper, we use K-means clustering to cluster 540k images into 100 clusters, however, any clustering algorithm can be used. After K-means clustering, we need to create a csv file similar to ones in the ./csvs/ folder.
Perform Self-Supervised Learning - After downloading the GeoImageNet and clustering the images, we can perform self-supervised learning. To do it, you can execute the following command :
python moco_fmow/main_moco_geo+tp.py \
-a resnet50 \
--lr 0.03 \
--dist-url 'tcp://localhost:14653' --multiprocessing-distributed --moco-t 0.02 --world-size 1 --rank 0 --mlp -j 4 \
--loss cpc --epochs 200 --batch-size 256 --moco-dim 128 --aug-plus --cos \
--save-dir ${PT_DIR}
Linear Classification - After learning the representations with MoCo-v2-geo, we can train the linear layer to classify GeoImageNet images. With a pre-trained model, to train a supervised linear classifier on frozen features/weights in an 4-gpu machine, run:
python moco_fmow/main_lincls.py \
-a resnet50 \
--lr 1 \
--dist-url 'tcp://localhost:14653' --multiprocessing-distributed --world-size 1 --rank 0 -j 4 \
--pretrained=${PT_DIR} \
--save-dir ${PTDIR}/lincls \
--batch-size 256
We use Retina-Net implementation from this repository for object detection experiments on xView. We use PSANet implementation from this repository for semantic segmentation experiments on SpaceNet.
If you find our work useful, please consider citing:
@article{ayush2021geography,
title={Geography-Aware Self-Supervised Learning},
author={Ayush, Kumar and Uzkent, Burak and Meng, Chenlin and Tanmay, Kumar and Burke, Marshall and Lobell, David and Ermon, Stefano},
journal={ICCV},
year={2021}
}