Skip to content

earthlingsoft/UnicodeChecker-UTF8Unshredder

Repository files navigation

UTF-8 Unshredder

UTF-8 Unshredder is a Utility for UnicodeChecker that attempts to fix the encoding problems of UTF-8 encoded text that has been read with the wrong encoding and saved as UTf-8 again.

You typically recognise such problems when seeing character combinations with letters like à where your non-ASCII roman characters should be.

Examples

  • Character: é
  • … encoded as UTF-8: C3 A9
  • … read as ISO Latin-1: é

If you end up with such garbled text, UTF-8 Unshredder will try to figure out which text encodings may have played a role in garbling the text and what the original string may have been. This way you can both reconstruct the original string and figure out how exactly it was broken.

Example with two iterations:

  • Garbled text: Grundgeb‚àö¬∫hr
  • … may have been accidentally processed as Mac OS Roman: Grundgeb√ºhr
  • … may have been accidentally processed as Mac OS Roman: Grundgebühr

Screenshots

UTF-8 Unshredder in UnicodeChecker’s Utility window:

![UTF-8 Unshredder in Utility window](Web/UTF-8 Unshredder Utility.png)

Encoding selection popup menu:

![Encoding popup menu](Web/UTF-8 Unshredder Encodings Menu.png)

Copyright & License

2008-2014 by Sven-S. Porst, earthlingsoft <[email protected]>

MIT Licensed

About

UnicodeChecker Utility to fix wrongly encoded UTF-8

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published