Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding dataset GOM_OCR_GT #169

Open
yanishk opened this issue Dec 19, 2024 · 0 comments
Open

Adding dataset GOM_OCR_GT #169

yanishk opened this issue Dec 19, 2024 · 0 comments

Comments

@yanishk
Copy link

yanishk commented Dec 19, 2024

Hello !

Here is our dataset YAML file:

schema: https://htr-united.github.io/schema/2023-06-27/schema.json
title: GOM_OCR_GT
url: https://github.com/kodda/GOM_OCR_GT
authors:
 - name: David
   surname: Colliaux
   roles:
     - project-manager
     - quality-control
     - digitization
 - name: Yanis
   surname: Harkouk
   roles:
     - aligner
     - quality-control
institutions: []
description: >-
 19th Century Manuals on Parisian Market Gardeners

 This dataset contains 10 pages (from p.30 to p.39) selected from each of 17
 manuals, for a total of 170 pages, all of which describe the practices of
 Parisian market gardeners in the 19th century.
project-name: GOM - The Good Old Manuals Project
project-website: https://sonycslparis.github.io/gom-webapp/#/project
language:
 - fra
production-software: Kraken
automatically-aligned: false
script:
 - iso: Latn
script-type: only-manuscript
time:
 notBefore: '1800'
 notAfter: '1920'
hands:
 count: more-than-10
 precision: estimated
license:
 name: CC-BY 4.0
 url: https://creativecommons.org/licenses/by/4.0/
format: Image-Text-Pairs
volume:
 - metric: pages
   count: 170
transcription-guidelines: >-
 Chapter titles and page numbers have been excluded from the transcription to
 focus solely on the content.

 Line breaks have also been excluded from the transcription.

 Image captions and footnotes have been included in the transcription, even if
 they appear in the middle of a sentence, to ensure all relevant information is
 captured.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant