Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Option to un-escape characters #180

Open
aubertc opened this issue Mar 14, 2022 · 4 comments
Open

Option to un-escape characters #180

aubertc opened this issue Mar 14, 2022 · 4 comments

Comments

@aubertc
Copy link

aubertc commented Mar 14, 2022

Right now, the following options are offered:

  --escape, --no-escape
      Escape special characters, such as umlaut. This ensures correct typesetting
      with latex. Enabled by default.

Would it be possible to add a third option, "--unescape", that replaces e.g. {\"{u}} with ü?

As far as I can tell, incompatibilities in character encoding is a matter of the past (ish), so getting the other way around would be nice too.

@FlamingTempura
Copy link
Owner

As far as I can tell, incompatibilities in character encoding is a matter of the past (ish), so getting the other way around would be nice too.

Do you know any more about this? This answer on stack exchange suggests bibtex is not unicode aware, but the other answer suggests support can be added using \usepackage[utf8]{inputenc}.

@aubertc
Copy link
Author

aubertc commented Mar 15, 2022

Thanks for your comment. The answer you are linking to dates from 2009. Most of the sources I can find dates back to ~2010 and highlight issues, but as no recent post that I could find list encoding as a problem, I suspect it "went away" thanks to better unicode support.

Further evidence:

biber handles us-ascii, 8-bit encodings such as Latin 1, and utf-8. It features true Unicode support and is capable of reencoding the bib data on the fly in a robust way.

Unfortunately, I could not find an authoritative answer…

@FlamingTempura
Copy link
Owner

Unfortunately, I could not find an authoritative answer…
As is the way with bibtex!

Thanks for checking. I can certainly see unescape as a useful option (particularly for biber/biblatex). I will try to find some time to work on it or will be happy to accept a PR.

@pedropaulofb
Copy link

pedropaulofb commented May 17, 2022

I just came here to suggest the same feature proposed by @aubertc. I am using the BibTeX files to generate csv data for later publishing, so my objectives are increasing the reading and searching capability of the resulting data, and not being compliant with latex. In this case an --unescape feature would be perfect for me!

Thank you @FlamingTempura for considering it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants