feat(ocr): added support for RapidOCR engine #415

Swaymaw · 2024-11-22T07:42:07Z

Added RapidOCR Model as an OCR engine option.
Added Options for configuring RapidOCR model during document conversion using pipeline options.
Updates documentation, added tests and updated dependencies(extras) to reflect the added engine support.
Updated examples to demonstrate the use of RapidOcrOptions.

This change allows users to seamlessly work with RapidOCR-OnnxRuntime engine which provides higher accuracy and performance in use-cases which require working with complex PDF files.

Checklist:

Commit Message Formatting: Commit titles and messages follow guidelines in the
conventional commits.
Documentation has been updated, if necessary.
Examples have been added, if necessary.
Tests have been added, if necessary.

Signed-off-by: swayam-singhal <[email protected]>

mergify · 2024-11-22T07:42:41Z

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?:

Signed-off-by: Swaymaw <[email protected]>

dolfim-ibm · 2024-11-25T07:57:59Z

@Swaymaw would you suggest we need both PaddleOCR and RapidOCR in Docling? Or one of the two is enough?

dolfim-ibm · 2024-11-25T08:23:57Z

Please see the test results, can you please address those?

Swaymaw · 2024-11-25T09:27:32Z

@Swaymaw would you suggest we need both PaddleOCR and RapidOCR in Docling? Or one of the two is enough?

I would say that we can choose to only stick with RapidOCR as it is much faster than PaddleOCR with the same accuracy and at the same time much simpler to install and work with. RapidOCR, also makes it easier to train and run inference with custom detection , classification and recognition model paths which will improve the overall usability of the framework with use-case specific models.

Signed-off-by: Swaymaw <[email protected]>

dolfim-ibm · 2024-11-25T16:35:03Z

Ok, let's then focus on getting this PR running. There are still a few installation issue in CI for onnx.

Signed-off-by: Michele Dolfi <[email protected]>

docling/datamodel/pipeline_options.py

…ngle option for all models Signed-off-by: Swaymaw <[email protected]>

Signed-off-by: Swaymaw <[email protected]>

cau-git · 2024-11-27T10:31:02Z

@Swaymaw Thanks for the configuration options enhancements, this is matching what I had in mind.

However, to better align with an in-development global configuration system in docling (see here) without breaking this config interface down the line, we will take the liberty of temporarily hiding all the device-related configuration options to users in RapidOcrOptions and make the AUTO the implicit default. As such, we don't need to delay the merge of this PR and we will revisit how to expose the configuration options short-term.

Signed-off-by: Michele Dolfi <[email protected]>

cau-git

LGTM

swayam-singhal and others added 2 commits November 22, 2024 12:45

adding rapidocr engine for ocr in docling

9bb2e58

Signed-off-by: swayam-singhal <[email protected]>

Merge branch 'main' of https://github.com/DS4SD/docling into rapidocr

1b86a86

fixing styling format

cbaf2b5

Signed-off-by: Swaymaw <[email protected]>

dolfim-ibm self-requested a review November 25, 2024 07:58

dolfim-ibm requested a review from cau-git November 25, 2024 08:42

updating pyproject.toml and poetry.lock to fix ci bugs

ac1faeb

Signed-off-by: Swaymaw <[email protected]>

help poetry pinning for python3.9

686affe

Signed-off-by: Michele Dolfi <[email protected]>

dolfim-ibm reviewed Nov 26, 2024

View reviewed changes

docling/datamodel/pipeline_options.py Show resolved Hide resolved

dolfim-ibm mentioned this pull request Nov 26, 2024

feat(ocr): added support for PaddleOCR engine #393

Closed

4 tasks

Swaymaw added 2 commits November 27, 2024 10:26

simplifying rapidocr options so that device can be changed using a si…

0348cfb

…ngle option for all models Signed-off-by: Swaymaw <[email protected]>

fix styling issues and small bug in rapidOcrOptions

74e005d

Signed-off-by: Swaymaw <[email protected]>

dolfim-ibm added 3 commits November 27, 2024 11:39

use default device until we enable global management

c1b6442

Signed-off-by: Michele Dolfi <[email protected]>

Merge remote-tracking branch 'origin/main' into rapidocr

c228f34

Signed-off-by: Michele Dolfi <[email protected]>

Merge remote-tracking branch 'origin/main' into rapidocr

9f265e9

Signed-off-by: Michele Dolfi <[email protected]>

cau-git reviewed Nov 27, 2024

View reviewed changes

cau-git approved these changes Nov 27, 2024

View reviewed changes

cau-git merged commit 85b2999 into DS4SD:main Nov 27, 2024
7 checks passed

This was referenced Dec 6, 2024

feat(Accelerator): Introduce AI runtime configuration scheme #514

Merged

Adding PaddlePaddleOCR #541

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(ocr): added support for RapidOCR engine #415

feat(ocr): added support for RapidOCR engine #415

Swaymaw commented Nov 22, 2024

mergify bot commented Nov 22, 2024

dolfim-ibm commented Nov 25, 2024

dolfim-ibm commented Nov 25, 2024

Swaymaw commented Nov 25, 2024

dolfim-ibm commented Nov 25, 2024

cau-git commented Nov 27, 2024 •

edited

Loading

cau-git left a comment

feat(ocr): added support for RapidOCR engine #415

feat(ocr): added support for RapidOCR engine #415

Conversation

Swaymaw commented Nov 22, 2024

mergify bot commented Nov 22, 2024

Merge Protections

🟢 Enforce conventional commit

dolfim-ibm commented Nov 25, 2024

dolfim-ibm commented Nov 25, 2024

Swaymaw commented Nov 25, 2024

dolfim-ibm commented Nov 25, 2024

cau-git commented Nov 27, 2024 • edited Loading

cau-git left a comment

Choose a reason for hiding this comment

cau-git commented Nov 27, 2024 •

edited

Loading