-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow shape-element usage (IMPACT) #22
Comments
Changed xsd:ints to xsd:floats |
Examples of situations where polygons are used: The text line "Politiek behandelt buitenlanders teveel als taboe" that is also the title of the article must be described as a polygon on text line level. This page has photographs that are printed on top of each other. Use a polygon on block level to describe the shape of the photo. First line of the poem has a character 'e' that stretches across two lines. Suggest to make a polygon of the textline and the string. When bounding boxes are used in ghotic text, the bounding box of the lower text line many times includes bits and pieces of the text of the text line above. To avoid this use polygons to describe the shape of the textlines. The text blokcs on this page that are shaped around the picture can be described as polygons. |
Jean Philippe has an excellent description of the issues around languages at https://github.com/altoxml/schema/blob/master/v3/discussion%20of%20ALTO%20language%20support.pdf. |
Accidentally closed the issue. This is the document, about elements for textdirection in different standards, we discussed at the last meeting: https://github.com/altoxml/schema/blob/master/v3/Comparison%20of%20text%20direction%20elements.pdf |
This is a summary of the language support discussion so far. (Also part of the minutes of 2015-04-08) |
The changes necessary to implement issue 22 are added to the 3.1 draft version https://github.com/altoxml/schema/blob/master/v3/alto-3-1.draft.xsd |
On the version 3-1 the differences / changes to original version are not visible. |
As we agreed on in the last meeting, I have added code examples on the documentation page as much as I had time for. |
ACCEPT |
1 similar comment
ACCEPT |
ACCEPT Hi Evelien, For "Headline on more than one row" and "Wrapped text" would be good to have the sample code as well. If you provide the images I could process with docWorks and then adapt the shape into it. |
ACCEPT |
1 similar comment
ACCEPT |
in the meeting of oktober 14 there were no comment on the shape issue. The only concern came from Joachim. He wondered about the inheritance of the rotation characteristic. Is it done conform inheritance of other attributes in the ALTO. So far the two attributes that are most like rotation are LANG and STYLE.
|
ACCEPT |
Accept |
1 similar comment
Accept |
Request for public comment was made on the Impact Centre of Competence website on december 3 2015: http://www.digitisation.eu/blog/input-for-alto-3-1-version-request/ |
This issue is implemented in version 3.1 of the schema, which is now officially released. |
Submitter: IMPACT
Submitted: 2013-02
Use Case
ALTO 2.0 uses four attributes (HEIGHT, WIDTH,HPOS, VPOS) to describe the location and size of a text line. These coordinates do not describe the text line as such, but a bounding box around the text line. This box is always a rectangle.
For analysis purposes the shape need be described more precisely and coordinate information must not just be limited to text lines. The shape should also be recorded for individual characters, words, text lines, blocks and the print space as such.
Implementation
The element should store either polygon, rectangle, ellipse or circle. There must not be a sibling element for the same parent. The element is optional. A element can be added to the following parent elements:
The
<shape>
element can only have a single child element. This child element describes the type of shape with the exact coordinates. All coordinates are expressed as float values. The following shape types are supported by the appropriate elements:<Polygon>
<Circle>
<Ellipse>
For backward compatibility, rectangles will continue to be expressed using the existing HEIGHT, WIDTH, HPOS and VPOS attributes on all blocktypes that can carry the new Shape element. However, their use will be changed to optional, if a Shape element exists. Each Shape child (Polygon, etc.) will contain it's coordinates (required).
Changes
current (ALTO 3.0)
proposed
current
proposed
current
proposed
current
proposed
current
proposed
current
proposed
The text was updated successfully, but these errors were encountered: