VIEWING WRITING AS VIDEO: OPTICAL FLOW BASED MULTI-MODAL HANDWRITTEN MATHEMATICAL EXPRESSION RECOGNITION [ICASSP 2024]

Offical implementation of Optical Flow Aware Network

In this work, we perceive the writing process as a video and introduce the Aggregated Optical Flow Map (AOFM) to represent the online modality, which is more compatible with the offline modality. Additionally, we propose the Op- tical Flow Aware Network (OFAN) in order to automatically extract, align, and fuse the features across online and offline modalities. Through experiment analysis, our method can be seamlessly applied to multiple existing offline HMER models, thereby yielding stable and substantial enhancements across CROHME 2014, 2016, and 2019 datasets

unzip the CROHME.zip

unzip CROHME.zip

train

training WAP-OFAN:

python train_optical.py

training WAP only with AOFM (online modality):

python train_optical_single.py

training original WAP (offline modality):

python train_wap.py

test

testing WAP-OFAN:

bash test_on.sh

testing test_on_single.sh:

bash test_on_single.sh

testing original WAP:

bash test.sh

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
CROHME.zip		CROHME.zip
README.md		README.md
compute-wer.py		compute-wer.py
decoder.py		decoder.py
decoder_optical.py		decoder_optical.py
dictionary.txt		dictionary.txt
encoder.py		encoder.py
encoder_decoder.py		encoder_decoder.py
encoder_decoder_concat.py		encoder_decoder_concat.py
encoder_decoder_optical.py		encoder_decoder_optical.py
encoder_decoder_single.py		encoder_decoder_single.py
load_encoder.py		load_encoder.py
logger.py		logger.py
pretrain_models.zip		pretrain_models.zip
test.sh		test.sh
test_on.sh		test_on.sh
test_on_single.sh		test_on_single.sh
train.sh		train.sh
train_optical.py		train_optical.py
train_optical_concat.py		train_optical_concat.py
train_optical_single.py		train_optical_single.py
train_wap.py		train_wap.py
translate.py		translate.py
translate_on.py		translate_on.py
translate_on_single.py		translate_on_single.py
two-stream-slim.png		two-stream-slim.png
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VIEWING WRITING AS VIDEO: OPTICAL FLOW BASED MULTI-MODAL HANDWRITTEN MATHEMATICAL EXPRESSION RECOGNITION [ICASSP 2024]

train

test

About

Releases

Packages

Languages

Hanbo-Cheng/OFAN

Folders and files

Latest commit

History

Repository files navigation

VIEWING WRITING AS VIDEO: OPTICAL FLOW BASED MULTI-MODAL HANDWRITTEN MATHEMATICAL EXPRESSION RECOGNITION [ICASSP 2024]

train

test

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages