-
Notifications
You must be signed in to change notification settings - Fork 970
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Extracting picture data for raster images found in PPTX #349
Conversation
Signed-off-by: Maksym Lysak <[email protected]>
Merge ProtectionsYour pull request matches the following merge protections and will not be merged until they are valid. 🟢 Enforce conventional commitWonderful, this rule succeeded.Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/
🟢 Require two reviewer for test updatesWonderful, this rule succeeded.When test data is updated, we require two reviewers
|
Signed-off-by: Maksym Lysak <[email protected]>
Added tests, ready for re-review |
doc.add_picture(parent=parent_slide, caption=None, prov=prov) | ||
doc.add_picture( | ||
parent=parent_slide, | ||
image=ImageRef.from_pil(image=pil_image, dpi=72), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we want to hard-code 72 DPI here? I guess we have no better information about the actual DPI?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Added another commit, now extracting image DPI from the input file
Signed-off-by: Maksym Lysak <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
This PR populates image data in docling documents by PPTX backend, also introduces basic PPTX tests.
Checklist: