Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Group relevant information into one block #10500

Closed
paulgromy opened this issue Jul 28, 2023 · 5 comments
Closed

Group relevant information into one block #10500

paulgromy opened this issue Jul 28, 2023 · 5 comments
Assignees
Labels
Code PR is needed This issue could inspire a code PR stale status/close

Comments

@paulgromy
Copy link

I figured out how to get text from a PDF file using this library. But I would like to improve my functionality.
The fact is that the recognized text is not presented in a structured way, and I would like to receive text at the output that can be grouped into blocks.

Perhaps I explained not quite clearly, here you can find more details at this link https://stackoverflow.com/questions/76787641/get-data-in-the-form-of-blocks-when-recognizing-a-file

I would be happy if you tell me how to implement this idea

@paulgromy paulgromy added the Code PR is needed This issue could inspire a code PR label Jul 28, 2023
@ToddBear
Copy link
Collaborator

ToddBear commented Aug 2, 2023

I can generally understand what you mean. But the link you shared seems to be dead, can you re-share it?

@shiyutang
Copy link
Collaborator

shiyutang commented Aug 2, 2023

Hi there, did you try ppstructure?

for example, it gets the right-side result on the left image
image

@ToddBear
Copy link
Collaborator

ToddBear commented Aug 2, 2023

From what I understand, are you trying to group the results of ocr in the appropriate blocks like below ?

Where the image is divided into different areas such as text, table, and figure.
image

And each text area contains multiple lines of text.
image

You may try the layout recovery module function in ppstructure, which can restores the input image to a word or pdf file with the same layout as the original image.

@paulgromy
Copy link
Author

@ToddBearThank you for paying attention to my question. Check the link again

Copy link
Contributor

github-actions bot commented Jan 3, 2024

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in 7 days if no further activity occurs. Thank you for your contributions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Code PR is needed This issue could inspire a code PR stale status/close
Projects
None yet
Development

No branches or pull requests

4 participants