Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

more intuitive ID for output file, #26 #27

Closed
wants to merge 3 commits into from
Closed

Conversation

kba
Copy link
Member

@kba kba commented Dec 6, 2018

Generate the output ID and filename from the input file ID reduced to its numbers.

@finkf

@kba
Copy link
Member Author

kba commented Dec 6, 2018

Could this lead to problems

Absolutely, yes.

The alternative is to not change the ID at all and accept that it gets slightly long, e.g. OCR-D-OCR-TESS_OCR-D-IMG-BIN-TESS_1234.

@finkf
Copy link

finkf commented Dec 6, 2018

or maybe even better:

ID = concat_padded(self.output_file_grp, os.path.basename(input_file.url)[:-4])

why generate ids if output_file_grp + basename of file without extension is unique?

@bertsky
Copy link
Collaborator

bertsky commented Mar 4, 2019

I am very much in favour of the solution by @finkf, but I would also like to keep the .xml extension in the old version (because most PAGE viewers rely on it). The patch does not apply anymore, so should I make a new PR?

@bertsky
Copy link
Collaborator

bertsky commented Jun 28, 2019

Closing as this is superseded (and hopefully resolved to satisfaction) by #48.

@bertsky bertsky closed this Jun 28, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants