Skip to content

Latest commit

 

History

History
69 lines (60 loc) · 2.31 KB

INSTALL.md

File metadata and controls

69 lines (60 loc) · 2.31 KB

Installation

  1. Clone and enter this repository:
    git clone [email protected]:acaelles97/DeVIS.git
    cd DeVIS
    
  2. Install packages for Python 3.8:
    1. Install PyTorch 1.11.0 and torchvision 0.12.0 from here. The tested CUDA version is 11.3.0
    2. pip3 install -r requirements.txt
    3. Install youtube-vis api
    pip install git+https://github.com/youtubevos/cocoapi.git#"egg=pycocotools&subdirectory=PythonAPI
    
    1. Install MultiScaleDeformableAttention package:
    cd src/models/ops/
    python setup.py build_ext install
    

Check this if you experience problems installing youtube-vis api

Dataset preparation

First step is to download and extract each dataset: COCO, YT-19, YT-21 & OVIS User must set DATASETS.DATA_PATH to the root data path. We refer to src/datasets/coco.py & src/datasets/vis.py to modify the expected format for COCO and VIS datasets respectively. We expect the following organization:

cfg.DATASETS.DATA_PATH/
└── COCO/
  ├── train2017/
  ├── val2017/
  └── annotations/
      ├── instances_train2017.json
      └── instances_val2017.json
 
└── Youtube_VIS-2019/
  ├── train/
      ├── JPEGImages
      └── train.json 
  └── valid/
      ├── JPEGImages
      └── valid.json 

└── Youtube_VIS-2021/
  ├── train/
      ├── JPEGImages
      └── instances.json 
  └── valid/
      ├── JPEGImages
      └── instances.json

└── OVIS/
  ├── train/
  ├── annotations_train.json/
  ├── valid/     
  └── annotations_valid.json/

Download pre-trained weights

We provide pre-trained weights for the Deformable Mask Head training, as well as DeVIS (including ablations). We expect them to be downloaded and unpacked under the weights directory

cd weights
wget https://vision.in.tum.de/webshare/u/meinhard/cad/ablation_pre-trained_weights.zip
unzip ablation_pre-trained_weights.zip