You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
_AZ="t%C3%AAte-%C3%A0-t%C3%AAte"
...
// __ $_AZ: |t%C3%AAte-%C3%A0-t%C3%AAte|
it should be: "tête-à-tête"
How do you make recode give you UTF-8 regardless of the input string (which encoding should be easy to figure out based on the patterns of the input string)?
The text was updated successfully, but these errors were encountered:
How do you make recode give you UTF-8 regardless of the input string (which encoding should be easy to figure out based on the patterns of the input string)?
I don't see any HTML character entities. Your examples look like URL escaping, not HTML character entities.
which encoding should be easy to figure out based on the patterns of the input string
Recode does not attempt to guess what encoding its input uses, it uses the encoding you tell it; you'd need another tool to guess encodings.
No, I don't think recode supports URL encoding. That would be a good thing to add.
rrthomas
changed the title
recode swallows fine unicode, but not plain HTML char entities for French characters?
Add URL encoding: https://en.wikipedia.org/wiki/URL_encoding
May 18, 2023
_AZ="激光, 這兩個字是甚麼意思"
_AZ=$(echo "${_AZ}" | recode html..utf-8)$_AZ: |$ {_AZ}|"
echo "// __
// __ $_AZ: |激光, 這兩個字是甚麼意思|
_AZ="t%C3%AAte-%C3%A0-t%C3%AAte"
...
// __ $_AZ: |t%C3%AAte-%C3%A0-t%C3%AAte|
it should be: "tête-à-tête"
How do you make recode give you UTF-8 regardless of the input string (which encoding should be easy to figure out based on the patterns of the input string)?
The text was updated successfully, but these errors were encountered: