-
-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
readall() silently produces invalid UTF8Strings. #5977
Comments
I don't see how |
Notice the I hadn't tried |
I still don't understand:
|
@pygy Do you have a test vector for this? I suspect that might help. |
Indeed, my bad. I hadn't tested I've edited the bug report. Back to the point, how about: abstract Bytestring
immutable (UTF8 | ASCII | Raw)String <: ByteString
data::Array{UInt8,1}
end It would allow the hypothetical |
Ok, that's a relief. I think I'd prefer the |
I edited my former post while you were posting, but I agree with you about Having
|
closing as dup of #1792. |
Probably related to #1792.
txt = readall(foo)
always produces UTF8Strings, with no regards for the incoming byte sequence. It can result in invalid strings (is_valid_utf8(txt)
fails).Having a
RawString
type, and a package that deals with non-(UTF-8/ASCII) encodings may help.The text was updated successfully, but these errors were encountered: