TonEKYC: Vietnamese identity card reader

TODOs

Use graph convolutional neural network for KIE
Card alignment by deep models
OCR and extract information based on rules
Card alignment based on traditional digital image processing

Instant usage

Prerequisite

python 3.9 or higher
Ubuntu 18 or higher

To extract information from an image of identity card, just run the script below

python3 main.py --image [path/to/image]

if you want to dump the results into json or csv file, just add argument --savejson and --savecsv into the script, respectively.

Card alignment

Key-Information extractor requires an aligned card. Some traditional digital image processing methods are applied to perspective transform raw images. Alignment is integrated into the given pipeline.

Disclaiming: The card alignment process is still quite silly and naive because I'm researching deep models to perform it. Therefore, I have used Dlib and Haar face detection model and an edge detector to do it instead, it is better to choose a rotation angle of less than 30 degrees and the card should be put on a dark background.

Text detection and OCR

The OCR comes from the EasyOCR, which is a vigorous OCR library supporting variety languages such as Vietnamese.

Due to time limitation, I have just used rulebase method to extract the information. Naturally, it can't cover all situation so I'm researching some methods based on Graph Neural Network for the KIE problems.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
data		data
model		model
scripts		scripts
utils		utils
.gitignore		.gitignore
README.md		README.md
card_alignment.py		card_alignment.py
feat_ext.py		feat_ext.py
main.py		main.py
pick_preprocessing.py		pick_preprocessing.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TonEKYC: Vietnamese identity card reader

TODOs

Instant usage

Card alignment

Text detection and OCR

About

Releases

Packages

Languages

tungedng2710/TonEKYC

Folders and files

Latest commit

History

Repository files navigation

TonEKYC: Vietnamese identity card reader

TODOs

Instant usage

Card alignment

Text detection and OCR

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages