The vdetlib
python library serves to detection objects in videos. It was originally developed for the ImageNet VID challenge introduced in ILSVRC2015. It contains components such as region proposal, still-image object detection, generic object tracking, spatial max-pooling and temporal convolution.
The T-CNN framework contains many tools that utilizes vdetlib
. Please checkout that repository if you are interested.
If you find vdetlib useful in your research and related project, please consider citing the following work accepted in CVPR 2016.
@inproceedings{kang2016object,
Title = {Object Detection from Video Tubelets with Convolutional Neural Networks},
Author = {Kang, Kai and Ouyang, Wanli and Li, Hongsheng and Wang, Xiaogang},
Booktitle = {CVPR},
Year = {2016}
}
This project is released under the MIT License.
- caffe with
Python layer
andpycaffe
- FCN tracker
Matlab
with python engine
-
Clone the repository
$ git clone https://github.com/myfavouritekk/vdetlib.git
-
Compilation
$ cd vdetlib $ make
There are some basic protocol types for using this library. All of them are defined as python dictionaries and are saved as JSON
files. The definitions are written in the protocol.py
.
- detailed documentation
- demo script