til-23-cv

DSTA BrainHack TIL 2023 Qualifiers Computer Vision Task Code

Notes for Judges

Object Detection

YOLOv5u6 models were chosen because 1280p pretraining, see https://docs.ultralytics.com/models/yolov5.
- Model download: https://github.com/ultralytics/assets/releases/tag/v0.0.0.
- Model is auto-downloaded by ultralytics, no explicit download required.
- yolov5m6u.pt was chosen based on vram limitations and effective batch size.
For modifications made to ultralytics, see https://github.com/Interpause/ultralytics/compare/f23a035...main.
- Notably, ultralytics hardcodes Albumentations.
- I wasn't going to wait for a PR so I just modified their hardcoded stuff.

Suspect Recognition

DINOv2 model was used as encoder/backbone via timm.
- Model download: https://huggingface.co/timm/vit_small_patch14_dinov2.lvd142m.
- Model is auto-downloaded by timm, no explicit download required.
- DINOv2 (ViT-S/14) chosen for similar size to ResNet50 and State of the Art Object-Centric Representations via Self-Supervised Learning.
  - Update: DINOv2 ViT-B/14 was used instead. Download from: https://huggingface.co/timm/vit_base_patch14_dinov2.lvd142m.
ArcFace loss was used to reshape model's latent space into a hypersphere for cosine similarity: https://arxiv.org/abs/1801.07698.
sklearn Silhouette Score was used for cluster evaluation.
Training code for Suspect Recognition is custom based on PyTorch Lightning.
- Especially Lightning CLI.

Others

If coming here from Colab, ignore all subsequent instructions as they are already included in the notebook.
For the competition platform, %%sh ./setup.sh is included in the notebooks
- Remember to regularly "Write changes to dataset"!
- ./data is symlinked to the "data" dataset to store datasets.
- ./models is symlinked to the "models" dataset to store final models.
- ./runs is symlinked to the "Storage" dataset to store logs and training checkpoints.
"I have code OCD. Poetry is love, Poetry is life. And GG users of Conda."
- As one said in a group chat by the great engineer, @Interpause (John-Henry Lim).
The real reason I chose DINOv2 was PTSD.
- Behold! My internship project "Self-Supervised Learning of Video Object Segmentation using DINOSAUR and SAVi": Interpause/dinosavi.

Installation

Only Python 3.9 is supported at the moment to speed up dependency resolution.

# Ignore submodules; Will be installed from source in requirements.txt.
git clone https://github.com/Interpause/til-23-cv.git
pip install -r requirements.txt

# Or...
# TODO: Support below; Key requirement is to code with module invocation in mind
pip install git+https://github.com/Interpause/til-23-cv.git

Data Preparation

Download & extract the following files from https://zindi.africa/competitions/brainhack-til-23-advanced-cv/data:
- Train.zip -> data/til23plush/images/train/*.png
- Validation.zip -> data/til23plush/images/val/*.png
- Test.zip -> data/til23plush/images/test/*.png
- train_labels.zip -> data/til23plush/labels/train/*.txt
- val_labels.zip -> data/til23plush/labels/val/*.txt
- suspects.zip -> data/til23plush/suspects/*.png

Write data/til23plush/dataset.yaml:

train: images/train
val: images/val
test: images/test
nc: 200
names: ['#0','#1','#2','#3','#4','#5','#6','#7','#8','#9','#10','#11','#12','#13','#14','#15','#16','#17','#18','#19','#20','#21','#22','#23','#24','#25','#26','#27','#28','#29','#30','#31','#32','#33','#34','#35','#36','#37','#38','#39','#40','#41','#42','#43','#44','#45','#46','#47','#48','#49','#50','#51','#52','#53','#54','#55','#56','#57','#58','#59','#60','#61','#62','#63','#64','#65','#66','#67','#68','#69','#70','#71','#72','#73','#74','#75','#76','#77','#78','#79','#80','#81','#82','#83','#84','#85','#86','#87','#88','#89','#90','#91','#92','#93','#94','#95','#96','#97','#98','#99','#100','#101','#102','#103','#104','#105','#106','#107','#108','#109','#110','#111','#112','#113','#114','#115','#116','#117','#118','#119','#120','#121','#122','#123','#124','#125','#126','#127','#128','#129','#130','#131','#132','#133','#134','#135','#136','#137','#138','#139','#140','#141','#142','#143','#144','#145','#146','#147','#148','#149','#150','#151','#152','#153','#154','#155','#156','#157','#158','#159','#160','#161','#162','#163','#164','#165','#166','#167','#168','#169','#170','#171','#172','#173','#174','#175','#176','#177','#178','#179','#180','#181','#182','#183','#184','#185','#186','#187','#188','#189','#190','#191','#192','#193','#194','#195','#196','#197','#198','#199']

Object Detection

See notebooks/data.ipynb for converting til23plush into til23plushonly dataset, where all classes are relabelled to plushie.

Suspect Recognition

See notebooks/data.ipynb for converting til23plush into til23reid dataset, which uses the typical image classification folder structure.

Training

Object Detection

See notebooks/yolo.ipynb or use the command below:

yolo detect train cfg=cfg/custom.yaml model=yolov5m6u.pt data=data/til23plushonly/dataset.yaml workers=8 batch=8

Suspect Recognition

See notebooks/reid.ipynb or use the command below:

python -m til23cv.reid fit --config cfg/reid.yaml

Refer to the aforementioned notebook for subsequent export of model to torchscript.

Inference

See notebooks/infer.ipynb.

Contribution Guide

We use Black for formatting, isort for import sorting, and Google-style docstrings.
To generate requirements.txt, poetry export -f requirements.txt -o requirements.txt --without-hashes.

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
.vscode		.vscode
cfg		cfg
notebooks		notebooks
til_23_cv		til_23_cv
ultralytics @ 5dfa731		ultralytics @ 5dfa731
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

til-23-cv

Notes for Judges

Object Detection

Suspect Recognition

Others

Installation

Data Preparation

Object Detection

Suspect Recognition

Training

Object Detection

Suspect Recognition

Inference

Contribution Guide

About

Releases 2

Packages

Languages

BRIANWACK/til-23-cv

Folders and files

Latest commit

History

Repository files navigation

til-23-cv

Notes for Judges

Object Detection

Suspect Recognition

Others

Installation

Data Preparation

Object Detection

Suspect Recognition

Training

Object Detection

Suspect Recognition

Inference

Contribution Guide

About

Resources

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages