-
-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
better documentation for string reversal #23793
Conversation
"êxa" | ||
|
||
julia> join(reverse(collect(graphemes("ax̂e")))) # reverses graphemes | ||
"ex̂a" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
base/strings/types.jl
Outdated
regex (regular expression) searches. See also [`reverseind`](@ref) to convert indices | ||
in `s` to indices in `reverse(s)` and vice-versa, and [`graphemes`](@ref) | ||
to operate on user-visible "characters" (graphemes) rather than codepoints | ||
(which is more [visually pleasing but less useful](https://github.com/JuliaLang/julia/issues/6165) for reverse searching). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I wouldn't refer to GitHub issues in the official docs. Anyway people are unlikely to use graphemes
unless they have a reason to, given that it's easier to iterate over/reverse codepoints.
base/strings/types.jl
Outdated
@@ -117,10 +117,23 @@ end | |||
|
|||
Reverses a string. | |||
|
|||
Technically, this function reverses the "codepoints" in a string, and its | |||
main utility is for reversed-order string processing, especially for reversed | |||
regex (regular expression) searches. See also [`reverseind`](@ref) to convert indices |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just "regular expression"? Also, which use quotes around "codepoints"?
base/strings/types.jl
Outdated
@@ -131,7 +144,7 @@ reverse(s::RevString) = s.string | |||
""" | |||
reverseind(v, i) | |||
|
|||
Given an index `i` in `reverse(v)`, return the corresponding index in `v` so that | |||
Given an index `i` in [`reverse(v)`](@ref), return the corresponding index in `v` so that | |||
`v[reverseind(v,i)] == reverse(v)[i]`. (This can be nontrivial in the case where `v` is a | |||
Unicode string.) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
BTW, "Unicode string" is a weird terminology since now all strings support Unicode. Maybe just remove that sentence? Or say that it's nontrivial because of non-ASCII characters.
In looking back at #6165 (due to #23612), I realized that we never clearly documented the behavior or rationale.