many more Ocropy improvements #17

bertsky · 2019-10-22T19:59:01Z

Note: this depends on OCR-D/core#311 and OCR-D/core#327 (which is already in core:edge but not merged to master and not released yet – except via ocrd/core:edge on Dockerhub). So merging probably does not make sense yet.

- common.remove_noise: in addition to ocrolib.common.remove_noise on the binary image (which removes small foreground components), run on the inverted image (removing small "background components", i.e. white specks in letters) - denoise: increase default maxsize, and re-parameterize to unit pt (points), making it independent of image resolution

…(pad)

- when cropping a clipped segment image from its parent image, do not use the segment's width and height in absolute coordinates (with merely the offset in relative coordinates), but instead determine the segment's bounding box only in relative coordinates - after cropping, if the segment has an orientation angle, then rotate the segment image accordingly (but subtract any higher level angle already present) (Both fixes are required to meet the expectations of the core image API; otherwise relative coordinates cannot be calculated correctly for the generated images.)

- when an AlternativeImage already exists for a TextLine and hence its coordinates must be compared to the region segmentation on the line level instead of the region level, by cropping region labels to the same bounding box, do not use the line's width and height in absolute coordinates (with merely the offset in relative coordinates), but instead determine the line's bounding box only in relative coordinates - when creating a mask array from a polygon, ensure to also fill the outline (not just the interior) - when thresholding the size of a contour's parts, re-use and respect the existing threshold parameter

…d lines)

- filter deskewed images only on the level of operation - respect angle already applied on parent level by * rotating from that actual to the annotated target angle * annotating the sum of both in the result - do not propagate image features deskewed and rotated-X

bertsky · 2019-10-26T00:13:23Z

@finkf I updated the requirement to ocrd>=2.0.0a1, which is the PyPI release line for core:edge as detailed above. So this can be merged now.

Do you want me to release it on PyPI as well? (I don't see any versions there yet, also for the old package name. PyPI deployment is a requirement for many kinds of use cases / users...)

kba · 2019-10-26T19:24:58Z

Do you want me to release it on PyPI as well? (I don't see any versions there yet, also for the old package name. PyPI deployment is a requirement for many kinds of use cases / users...)

I will do that now, as v0.0.5. Happy to turn the package over to you and support you if you're unfamiliar with pypi.

kba · 2019-10-26T19:28:36Z

https://pypi.org/project/cis-ocrd/ 🎉

bertsky · 2019-10-26T22:25:47Z

@finkf same offer from me. Also: I can help you rework the rest of the repo to get ready for core 1.0 / 2.0, and end up with a good master – if you like.

Regarding core, I have seen a significant change for post-correction after ocrd==1.0.0b10: the TextEquiv.conf attribute is no longer a float, but a string again. (Remember, we used to have a problem in core at that point with newer generateDS code not correctly parsing string conf from PAGE strings into the PAGE DOM. The fix included a change on the other side, though: Now it generates the DOM with conf as string as well. So you have to convert to float yourself to get the old behaviour on the receiving end during post-correction. For examples of how I fixed this in my repos, see ocrd_keraslm and cor-asv-ann. The reason behind this is generateDS does not support conversion from simpleType for attributes.)

And in case you do not already know: ocrd 1.0 was a bit premature, because the much needed image API fixes (which have already been around for a while) already break it. So we introduced a new branch edge on core and on ocrd_tesserocr until we know these can be trusted themselves. To remedy installation of these, they got pre-released under ocrd==2.0.0a1 and ocrd_tesserocr==0.5.0 now. I know – lots of complexity, very confusing. Sorry!

finkf · 2019-10-28T09:35:39Z

Yes, help would be appreciated. I think, that I will merge this and add my changes to devel as well.
Then we can check things and update master.

As to the TextEquiv.conf I think that non of our post-correction python code touches the confidence value, so we should be good.

bertsky · 2019-10-28T10:18:00Z

Yes, help would be appreciated. I think, that I will merge this and add my changes to devel as well.
Then we can check things and update master.

Ok, let me know when I should take a look.

As to the TextEquiv.conf I think that non of our post-correction python code touches the confidence value, so we should be good.

Are you quite sure? It seems you are using word confidences the same way I was using glyph confidences. That line is going to fail after 1.0.0b10. You will need something like

conf = min([float(te0(x).conf or "1.") for x in word.region])

finkf · 2019-10-28T10:34:06Z

Yeah. You are right. I will fix this (after the merge). So I will merge this now, and fix TextEquiv.conf afterwards. OK?

bertsky · 2019-10-28T11:22:34Z

Whatever suits you better. (The PR adds the 2.0.0a1 requirement, so it would be "cleaner" to add all other updates here first. But if you already have changes yourself, it's probably not worth the effort, and synchronize afterwards.)

finkf · 2019-10-28T12:20:23Z

Yes. I guess you are right

lgtm-com · 2019-10-29T13:56:00Z

This pull request introduces 2 alerts and fixes 1 when merging 590c2f2 into a0dd028 - view on LGTM.com

new alerts:

1 for Unused import
1 for Wrong number of arguments in a call

fixed alerts:

1 for Syntax error

bertsky and others added 17 commits September 24, 2019 15:29

dewarp: check for bad cropping at top/bottom

c9b1a49

dewarp: when checks fail, pad vertically

7dec9b0

resegment: ignore regions with only 1 line

f5d9f4b

gradmaps do need doubling of estimated scale

6345fb9

ensure hmerge uses the full line

62e6dc5

dewarp: distinguish failures between invalid (ignore) and inadequate …

7dd6a42

…(pad)

dewarp: more robust detection of invalid (badly cropped) lines

010f5fb

deskew: more robust

dbb0968

deskew: use image rotation from core

6eab7b3

binarize: always request raw images for input

9276c08

dewarp: even more robust detection of invalid (badly cropped/segmente…

0f5cff9

…d lines)

clip: do not propagate image features deskewed and rotated-X from parent

25b8ea0

recognize: do not rely on xywh dict from core

ac68fe2

bertsky mentioned this pull request Oct 22, 2019

Fix image API (for rotation) OCR-D/core#327

Merged

requires core 2.0

da18088

Merge branch 'dev' into dev

590c2f2

finkf merged commit 590c2f2 into cisocrgroup:dev Oct 29, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

many more Ocropy improvements #17

many more Ocropy improvements #17

bertsky commented Oct 22, 2019

bertsky commented Oct 26, 2019

kba commented Oct 26, 2019

kba commented Oct 26, 2019

bertsky commented Oct 26, 2019

finkf commented Oct 28, 2019

bertsky commented Oct 28, 2019

finkf commented Oct 28, 2019

bertsky commented Oct 28, 2019

finkf commented Oct 28, 2019

lgtm-com bot commented Oct 29, 2019

many more Ocropy improvements #17

many more Ocropy improvements #17

Conversation

bertsky commented Oct 22, 2019

bertsky commented Oct 26, 2019

kba commented Oct 26, 2019

kba commented Oct 26, 2019

bertsky commented Oct 26, 2019

finkf commented Oct 28, 2019

bertsky commented Oct 28, 2019

finkf commented Oct 28, 2019

bertsky commented Oct 28, 2019

finkf commented Oct 28, 2019

lgtm-com bot commented Oct 29, 2019