You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Could you please add the Sloane Lab HTR Model to the HTR United repository?
Many thanks and best wishes
Marco
Here is our dataset YAML file:
schema: https://htr-united.github.io/schema/2023-06-27/schema.jsontitle: The Sloane Lab HTR Modelurl: https://github.com/sloanelab-org/HTR-Modelauthors:
- name: Marcosurname: Humbelorcid: 0000-0003-1861-162Xroles:
- aligner
- name: 'Andreas 'surname: Vlachidisroles:
- project-manager
- name: 'Julianne 'surname: Nyhanroles:
- project-manager
- name: 'The British Museum 'surname: ''roles:
- digitizationinstitutions:
- name: AEL Data Serviceroles:
- transcriberdescription: > This repository contains Handwritten Text Recognition training data (layout segmentation and transcriptions ) for the Sloane Lab HTR model. The HTR model is trained on the handwriting of Hans Sloane (1660-1753). Funding:
Enlightenment Architectures: Leverhulme Trust Project Grant 2016-21The Sloane Lab: Towards a National Collection – AHRC AH/W003457/1project-name: 'The Sloane Lab: Looking back to build future shared collections'project-website: https://sloanelab.org/language:
- engproduction-software: Transkribusautomatically-aligned: falsescript:
- iso: Latnscript-type: only-manuscripttime:
notBefore: '1680'notAfter: '1750'hands:
count: less-than-11precision: estimatedlicense:
name: CC BY-NC-SA 4.0url: https://creativecommons.org/licenses/by-nc-sa/4.0/deed.enformat: Alto-XMLsources:
- reference: >- Sloan, K., Ortolja-Baird, A., Nyhan, J., Pickering, V., & Fleming, M. (Eds.). (2019). Sir Hans Sloane’s Miscellanea which comprises his catalogues of Miscellanies, Antiquities, Seals, Pictures, Mathematical Instruments, Agate Handles and Agate Cups, Bottles, Spoons (Digital Edition). link: >- https://enlightenmentarchitectures.reconstructingsloane.org/cataloguemiscellanies/index.htmlvolume:
- metric: pagescount: 196citation-file-link: https://github.com/sloanelab-org/HTR-Model/blob/main/Citation_SL_HTR_Model.cff
The text was updated successfully, but these errors were encountered:
Hello Marco, I'm sorry for responding only now, I missed your issue.
Based on the documents available in the dataset repository, I suggest adding the following elements:
transcription-guidelines: >-
Transcription rules can be found alongside the dataset. They include the
following rules:
- Exclusion of overwritten text from training data
- Exclusion of text not identified by the automated layout recognition
- Exclusion of faded text
- Inserted words are treated as separate text lines
- Exclusion of textual features such as dotted lines
- Base line separation for text written apart
I already added them in the pull request I opened. Is that ok?
Hello,
Could you please add the Sloane Lab HTR Model to the HTR United repository?
Many thanks and best wishes
Marco
Here is our dataset YAML file:
The text was updated successfully, but these errors were encountered: