GitHub - SJTUwxz/LoCoNet_ASD: code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection

LoCoNet: Long-Short Context Network for Active Speaker Detection

Dependencies

Start from building the environment

conda env create -f requirements.yml
conda activate loconet

export PYTHONPATH=project_dir/dlhammer:$PYTHONPATH and replace project_dir with your code base location

Data preparation

We follow TalkNet's data preparation script to download and prepare the AVA dataset.

python train.py --dataPathAVA AVADataPath --download

AVADataPath is the folder you want to save the AVA dataset and its preprocessing outputs, the details can be found in here . Please read them carefully.

After AVA dataset is downloaded, please change the DATA.dataPathAVA entry in the config file.

Training script

python -W ignore::UserWarning train.py --cfg configs/multi.yaml OUTPUT_DIR <output directory>

Pretrained model

Please download the LoCoNet trained weights on AVA dataset here.

python -W ignore::UserWarning test_multicard.py --cfg configs/multi.yaml  RESUME_PATH {model download path}

Citation

Please cite the following if our paper or code is helpful to your research.

@article{wang2023loconet,
  title={LoCoNet: Long-Short Context Network for Active Speaker Detection},
  author={Wang, Xizi and Cheng, Feng and Bertasius, Gedas and Crandall, David},
  journal={arXiv preprint arXiv:2301.08237},
  year={2023}
}

Acknowledge

The code base of this project is studied from TalkNet which is a very easy-to-use ASD pipeline.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
TalkSet		TalkSet
__pycache__		__pycache__
configs		configs
dlhammer		dlhammer
legacy		legacy
metrics		metrics
model		model
scripts		scripts
torchvggish		torchvggish
utils		utils
videoloaders		videoloaders
.DS_Store		.DS_Store
README.md		README.md
builder.py		builder.py
dataLoaderTalkSet.py		dataLoaderTalkSet.py
dataLoader_multiperson.py		dataLoader_multiperson.py
environment.yml		environment.yml
loconet.py		loconet.py
loss_multi.py		loss_multi.py
test_multicard.py		test_multicard.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LoCoNet: Long-Short Context Network for Active Speaker Detection

Dependencies

Data preparation

Training script

Pretrained model

Citation

Acknowledge

About

Releases

Packages

Languages

SJTUwxz/LoCoNet_ASD

Folders and files

Latest commit

History

Repository files navigation

LoCoNet: Long-Short Context Network for Active Speaker Detection

Dependencies

Data preparation

Training script

Pretrained model

Citation

Acknowledge

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages