[CVPR2022]Learning Optical Flow with Kernel Patch Attention

Ao Luo¹, Fan Fang², Xin Li², Shuaicheng Liu^3,1

1. Megvii Research, 2. Group 42

3. University of Electronic Science and Technology of China

This project provides the official implementation of 'Learning Optical Flow with Kernel Patch Attention'. (CVPR-2022)

Abstract

Optical flow is a fundamental method used for quantitative motion estimation on the image plane. In the deep learning era, most works treat it as a task of ‘matching of features’, learning to pull matched pixels as close as possible in feature space and vice versa. However, spatial affinity (smoothness constraint), another important component for motion understanding, has been largely overlooked. In this paper, we introduce a novel approach, called kernel patch attention (KPA), to better resolve the ambiguity in dense matching by explicitly taking the local context relations into consideration. Our KPA operates on each local patch, and learns to mine the context affinities for better inferring the flow fields. It can be plugged into contemporary optical flow architecture and empower the model to conduct comprehensive motion analysis with both feature similarities and spatial relations. On Sintel dataset, the proposed KPA-Flow achieves the best performance with EPE of 1.35 on clean pass and 2.36 on final pass, and it sets a new record of 4.60% in F1-all on KITTI-15 benchmark.

Presentation Video

[Youtube], [Bilibili]

Overview

We propose a novel approach, called kernel patch attention (KPA), to better resolve ambiguity in dense matching by explicitly taking the local context relations into consideration. Our KPA operates on each local patch, and learns to mine the context affinities for better inferring the flow fields. It can be plugged into contemporary optical flow architecture and empower the model to conduct comprehensive motion analysis with both feature similarities and spatial relations.

Requirements

Python 3.6 with following packages

pytorch==1.6.0
torchvision==0.7.0
matplotlib
scipy
opencv-python
tensorboard

(The code has been tested on Cuda 10.0.)

Usage

The trained weights are available on GoogleDrive. Put *.pth files into folder ./weights.
Download Sintel and KITTI dataset, and set the root path of each class in ./core/datasets.py.
Evaluation on Sintel

./eval_sintel.sh

Evaluation on KITTI

./eval_kitti.sh

We extend our approach by appending the simplified KPA module to the encoder network. Experimental results demonstrate that the extended version achieves better performance than the original one.

Results

Citation

If you think this work is helpful, please cite

@inproceedings{luo2022learning,
  title={Learning Optical Flow With Kernel Patch Attention},
  author={Luo, Ao and Yang, Fan and Li, Xin and Liu, Shuaicheng},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={8906--8915},
  year={2022}
}

If you have any questions, please contact me at ([email protected]).

Acknowledgement

The main framework is adapted from RAFT. We thank the authors for the contribution.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
alt_cuda_corr		alt_cuda_corr
core		core
README.md		README.md
chairs_split.txt		chairs_split.txt
eval_kitti.sh		eval_kitti.sh
eval_sintel.sh		eval_sintel.sh
evaluate.py		evaluate.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[CVPR2022]Learning Optical Flow with Kernel Patch Attention

Ao Luo¹, Fan Fang², Xin Li², Shuaicheng Liu^3,1

1. Megvii Research, 2. Group 42

3. University of Electronic Science and Technology of China

Abstract

Presentation Video

Overview

Requirements

Usage

Results

Citation

Acknowledgement

About

Releases

Packages

Contributors 2

Languages

megvii-research/KPAFlow

Folders and files

Latest commit

History

Repository files navigation

[CVPR2022]Learning Optical Flow with Kernel Patch Attention

Ao Luo1, Fan Fang2, Xin Li2, Shuaicheng Liu3,1

1. Megvii Research, 2. Group 42

3. University of Electronic Science and Technology of China

Abstract

Presentation Video

Overview

Requirements

Usage

Results

Citation

Acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Ao Luo¹, Fan Fang², Xin Li², Shuaicheng Liu^3,1

Packages