Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bounding box for each list item #6

Open
kailigo opened this issue Oct 9, 2019 · 1 comment
Open

bounding box for each list item #6

kailigo opened this issue Oct 9, 2019 · 1 comment

Comments

@kailigo
Copy link

kailigo commented Oct 9, 2019

could you advise how to get bound box for each list item -- currently, a bounding box cover all list items; I would like to have a separate bound box for each item. Thanks.

@zhxgj
Copy link
Contributor

zhxgj commented Oct 10, 2019

Hi @kailigo bounding boxes for list items cannot be directly obtained from the raw PubLayNet. We consider the whole list as a basic layout element. To get the boxes for list items, you will need to extract the text within the list box from the pdf, and try to match it with the list items in the xml.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants