Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: preserve the order of shapes in partition_pptx output #193

Merged
merged 6 commits into from
Feb 3, 2023

Conversation

MthwRobinson
Copy link
Contributor

Summary

Orders the elements extracted from .pptx files based on their position on the page. Proceeds from top to bottom and left to right. Previously the elements were not in the correct order.

Testing

Run the following using the filename Matt provides you. The outputs should go top to bottom and left to right based on the position of the shapes.

from unstructured.partition.auto import partition

partition(filename=filename)

@MthwRobinson MthwRobinson requested a review from qued February 3, 2023 19:42
Copy link
Contributor

@qued qued left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, just 1 typo

CHANGELOG.md Outdated Show resolved Hide resolved
Co-authored-by: qued <[email protected]>
@MthwRobinson MthwRobinson enabled auto-merge (squash) February 3, 2023 22:04
@MthwRobinson MthwRobinson merged commit 014585e into main Feb 3, 2023
@MthwRobinson MthwRobinson deleted the feat/pptx-tweaks branch February 3, 2023 22:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants