Skip to content

Extract Text from PDFs and Images Using Tesseract.

Notifications You must be signed in to change notification settings

pavsorab/text_grab

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Text Grabber

Extract Text from PDFs and Images Using Tesseract. Tesseract is an optical character recognition engine for various operating systems. It is free software, released under the Apache License, Version 2.0

Packages

  • Django
  • Tesseract
  • Pdf2Image

Usage

Prerequisites :

virtualenv venv

source venv/bin/activate

cd text_grabber

pip install -r requirements.txt

Run Django App :

python3 manage.py runserver 0.0.0.0:8000

About

Extract Text from PDFs and Images Using Tesseract.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published