Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

INCOMPATIBLE fix to hOCR line height information - fixes #225. #226

Merged
merged 1 commit into from
Feb 16, 2016

Conversation

tfmorris
Copy link
Contributor

This fixes the duplicate line IDs caused by inserting height information
into the middle of the ID and it moves the line height info into
the title attribute like everything else, rather than using non-standard
HTML attributes (which won't validate).

This change may break consumers of the HTML output, but 3.04 has only
been in the wild for 6 months and the current HTML is invalid, so I
believe the benefit outweighs the cost for the fix.

…r#225.

This fixes the duplicate line IDs caused by inserting height information
into the middle of the ID and it moves the line height info into
the title attribute like everything else, rather than using non-standard
HTML attributes (which won't validate).

This change may break consumers of the HTML output, but 3.04 has only
been in the wild for 6 months and the current HTML is invalid, so I 
believe the benefit outweighs the cost for the fix.
zdenop added a commit that referenced this pull request Feb 16, 2016
INCOMPATIBLE fix to hOCR line height information - fixes #225.
@zdenop zdenop merged commit e028274 into tesseract-ocr:master Feb 16, 2016
@tfmorris tfmorris deleted the issue225 branch February 17, 2016 19:53
zvezdochiot pushed a commit to ImageProcessing-ElectronicPublications/tesseract that referenced this pull request Mar 28, 2021
INCOMPATIBLE fix to hOCR line height information - fixes tesseract-ocr#225.
zvezdochiot pushed a commit to ImageProcessing-ElectronicPublications/tesseract that referenced this pull request Mar 28, 2021
INCOMPATIBLE fix to hOCR line height information - fixes tesseract-ocr#225.
zvezdochiot pushed a commit to ImageProcessing-ElectronicPublications/tesseract that referenced this pull request Mar 28, 2021
INCOMPATIBLE fix to hOCR line height information - fixes tesseract-ocr#225.
zvezdochiot pushed a commit to ImageProcessing-ElectronicPublications/tesseract that referenced this pull request Mar 28, 2021
INCOMPATIBLE fix to hOCR line height information - fixes tesseract-ocr#225.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants