📙 HTR-VT (Pattern Recognition)

Introduction

This is the official implementation of our Pattern Recognition(PR) 2024 paper "HTR-VT: Handwritten Text Recognition with Vision Transformer". It's a new and effective baseline for handwritten text recognition solely using Vision Transformer and CTC Loss.

[Project Page] [Paper] [arXiv] [Google Drive]

1. Overview

2. Visual Results

3. Installation

3.1. Environment

Our model can be learnt in a single GPU RTX-4090 24G

conda env create -f environment.yml
conda activate htr

The code was tested on Python 3.9 and PyTorch 1.13.0.

3.2. Datasets

Using IAM, READ2016 and LAM for handwritten text recognition.

IAM

Register at the FKI's webpage :https://fki.tic.heia-fr.ch/databases/iam-handwriting-database)
Download the dataset from here :https://fki.tic.heia-fr.ch/databases/download-the-iam-handwriting-database

wget https://zenodo.org/record/1164045/files/{Test-ICFHR-2016.tgz,Train-And-Val-ICFHR-2016.tgz}

LAM

Download the dataset from here: https://aimagelab.ing.unimore.it/imagelab/page.asp?IdPage=46

Download datasets to ./data/. Take IAM for an example: The structure of the file should be:

./data/iam/
├── train.ln
├── val.ln
├── test.ln
└── lines
      ├──a01-000u-00.png
      ├──a01-000u-00.txt
      ├──a01-000u-01.png
      ├──a01-000u-01.txt
      ...

4. Quick Start

We provide convenient and comprehensive commands in ./run/ to train and test on different datasets to help researchers reproducing the results of the paper.

5. Citation

If our project is helpful for your research, please consider citing :

@article{li2024htr,
  title={HTR-VT: Handwritten text recognition with vision transformer},
  author={Li, Yuting and Chen, Dexiong and Tang, Tinglong and Shen, Xi},
  journal={Pattern Recognition},
  pages={110967},
  year={2024},
  publisher={Elsevier}
}

6. Acknowledgement

We appreciate helps from public code: VAN and OrigamiNet.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
data		data
img		img
model		model
run		run
utils		utils
README.md		README.md
environment.yaml		environment.yaml
test.py		test.py
train.py		train.py
valid.py		valid.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📙 HTR-VT (Pattern Recognition)

Introduction

Table of Content

1. Overview

2. Visual Results

3. Installation

3.1. Environment

3.2. Datasets

4. Quick Start

5. Citation

6. Acknowledgement

About

Releases

Packages

Languages

YutingLi0606/HTR-VT

Folders and files

Latest commit

History

Repository files navigation

📙 HTR-VT (Pattern Recognition)

Introduction

Table of Content

1. Overview

2. Visual Results

3. Installation

3.1. Environment

3.2. Datasets

4. Quick Start

5. Citation

6. Acknowledgement

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages