This repository contains the second place solution to the Vesuvius First Letters challenge.
The approach uses an I3D architecture to detect ink from within the scrolls
I provide an image of my environment in the requirements.txt, I believe only the following packages need to be installed
pytorch-lightning
typed-argument-parser
segmentation_models_pytorch
albumentations
warmup_scheduler
wandb
Download the data and appropriate segments (instructions here).
Unzip the labels folder and place each {segmentid}_inklabel.png in its appropriate segment folder.
Adjust the CFG class with your compute in 64x64_256stride_i3d.py, these are the typical configs used in the kaggle competition.
python 64x64_256stride_i3d.py
the inference script runs a trained model , for more info about the arguments check the InferenceArgumentParser class in the inference script. a pretrained checkpoint is available here
e.g: python inference.py --segment_id 123 --model_path 'model.ckpt'