Regression in string readuntil benchmark #50615

gbaraldi · 2023-07-20T21:47:55Z

run(BaseBenchmarks.SUITE[["string", "readuntil", "target length 2"]]) regressed in #48273 @stevengj it gained some allocations and became 2x slower.

The text was updated successfully, but these errors were encountered:

stevengj · 2023-11-28T16:52:33Z

This is apparently benchmarking:

buffer = IOBuffer("A" ^ 50000)

g = addgroup!(SUITE, "readuntil")
for len in (1, 2, 1000, 50000)
    g["target length $len"] = @benchmarkable readuntil(seekstart($buffer), $("A" ^ len))
end

Reading 2-byte strings from an IOBuffer probably got slower because of this diff, which traded off worse performance for reading extremely short lines for better performance reading lines with 20 characters or more. (See the comment "A single loop is 2x faster for nout=5.") Actually that portion of the PR sholdn't be relevant here because that is only for delim::UInt8, whereas this function uses a string delimiter, which is a completely different part of the codebase.

For string delimiters, one change in that PR is that it changed the initial allocated size to 70 bytes, to match the initial allocation in other parts of the code: https://github.com/JuliaLang/julia/pull/48273/files#diff-74f71402c994a78b21ded0ac040485d12bfc27e9ffaf538fa0f71f6d284e2991R1038 — maybe that is slowing us down for reading 2-byte strings? Someone could try changing that line back to StringVector(0) to see if it helps. (Of course, that might slow things down for reading longer strings.)

(In general, my feeling is that if you care about performance for reading lots of very short strings, you are better off using the new functionality with something like StringViews.jl to read the strings in-place into a re-used buffer.)

gbaraldi added the regression Regression in behavior compared to a previous version label Jul 20, 2023

brenhinkeller added the performance Must go faster label Aug 6, 2023

KristofferC mentioned this issue Oct 28, 2024

Julia v1.11.1 regression: open(::Cmd) is 4X slower than v1.10.4 #56352

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regression in string readuntil benchmark #50615

Regression in string readuntil benchmark #50615

gbaraldi commented Jul 20, 2023

stevengj commented Nov 28, 2023 •

edited

Loading

Regression in string readuntil benchmark #50615

Regression in string readuntil benchmark #50615

Comments

gbaraldi commented Jul 20, 2023

stevengj commented Nov 28, 2023 • edited Loading

stevengj commented Nov 28, 2023 •

edited

Loading