Remove temporary ByteArrayHelpers and use Span.Equals #27654

ahsonkhan · 2018-03-02T15:14:51Z

The issue - https://github.com/dotnet/corefx/issues/21395 - has been resolved.

ahsonkhan · 2018-03-02T15:36:10Z

src/System.Net.Http/src/System/Net/Http/ByteArrayHelpers.cs

-    {
-        // TODO #21395:
-        // Replace with the MemoryExtensions implementation of Equals once it's available
-        internal static bool EqualsOrdinalAsciiIgnoreCase(string left, ReadOnlySpan<byte> right)


Any perf concerns with using the ReadOnlySpan<char> overload available now that will have the "IsAscii" check per iteration which is unnecessary in this case - https://github.com/dotnet/coreclr/blob/47c39edc2fbfbfbddca13ee1a47699ecaaaee204/src/mscorlib/shared/System/Globalization/CompareInfo.cs#L571?

I don't see how the changes are valid. Casting a span of ASCII bytes to a span of chars will misinterpret them. Does this pass our tests?

Yep, you are definitely right. Here, two bytes would be considered one character rather than individual characters. Tests don't pass. We have to pad the byte span to turn it into a char span.

You'd either need to Widen the bytes to short/char and compare
https://github.com/aspnet/KestrelHttpServer/blob/300453396a57cd14bcb297627b1507de83dc88ab/src/Kestrel.Core/Internal/Infrastructure/StringUtilities.cs#L12-L109
Or Narrow the string chars to bytes; which is probably dodgier (as would be discarding high byte)

Can vectorize the casing with something like (not a very good version)

public static void LowerCaseSIMD(byte[] data) { var A = new Vector<byte>(65); // A var Z = new Vector<byte>(90); // Z for (var o = 0; o < data.Length - Vector<byte>.Count; o += Vector<byte>.Count) { var v = new Vector<byte>(data, o); v = Vector.ConditionalSelect( Vector.BitwiseAnd( Vector.GreaterThanOrEqual(v, A), Vector.LessThanOrEqual(v, Z) ), Vector.BitwiseOr(new Vector<byte>(0x20), v), // 0010 0000 v ); // Now Vector.Widen

I was trying to see if there was some char to byte conversion happening up-stack, but the data is coming in as bytes... http://source.dot.net/#System.Net.Http/System/Net/Http/Managed/HttpConnection.cs,354

If we "Widen", won't we need to allocate a 2x temporary buffer?

If we "Widen", won't we need to allocate a 2x temporary buffer?

For the the vector part its two local Vector<short> variables; for shorter a long, int or char; no buffers needed

ahsonkhan · 2018-03-02T17:41:24Z

As an aside, @stephentoub, this leftover TODO can be cleaned up now, correct?
http://source.dot.net/#System.Net.Http/System/Net/Http/Managed/HttpConnection.cs,565

This issue - dotnet/roslyn#17287 - got resolved.

geoffkizer · 2018-03-02T17:56:44Z

The better approach here (at least for TryGetKnownHeader) is to precalculate the header names as Span<byte> instead of string, then do the compare -- so both operands are Span<byte>. We actually sorta have this precalculation today, we're just not using it. I think we filed an issue on this somewhere.

We still need to do a case-insensitive compare here, so we can't just use SequenceEqual. But perhaps we could do some clever vectorization as per @benaadams suggestion above. And we can guarantee the precalculated header name bytes are all lowercase already.

Or we could do something slightly different: Header names usually use consistent casing (i.e. Content-Length, not CONTENT-LENGTH) so we could do a SequenceEqual on the expected casing first, which will typically succeed, and fall back to a more expensive case-insensitive compare if that fails.

benaadams · 2018-03-02T18:01:09Z

The better approach here (at least for TryGetKnownHeader) is to precalculate the header names as Span instead of string, then do the compare

Store a byte[] version of the known headers and use that rather than the string for compare? Makes sense, half the data to compare also.

geoffkizer · 2018-03-02T18:04:18Z

Yeah, exactly. GitHub ate the <byte> part of Span<byte> in my original comment. Edited it for clarity.

stephentoub · 2018-03-02T18:04:33Z

Store a byte[] version of the known headers

We do; that's what @geoffkizer meant when he said "We actually sorta have this precalculation today, we're just not using it". If we have a case-insensitive comparison that works on bytes, then we can use that; otherwise let's just stick with what we have.

stephentoub · 2018-03-02T18:14:40Z

As an aside, @stephentoub , this leftover TODO can be cleaned up now, correct?
http://source.dot.net/#System.Net.Http/System/Net/Http/Managed/HttpConnection.cs,565 This issue - dotnet/roslyn#17287 - got resolved.

No. It just needs a different issue number now:
dotnet/csharplang#1331

stephentoub · 2018-03-02T18:16:25Z

No. It just needs a different issue number now:

Actually, it was already updated; the code you linked to is stale.

stephentoub · 2018-03-06T22:57:59Z

@ahsonkhan, at this point is there more to do here other than just delete the TODO comment?

ahsonkhan · 2018-03-07T00:01:56Z

at this point is there more to do here other than just delete the TODO comment?

Yes, if we decide to keep the internal ByteArrayHelpers method. In which case, I think we can close this PR.

stephentoub · 2018-03-07T02:37:56Z

Yes, if we decide to keep the internal ByteArrayHelpers method. In which case, I think we can close this PR.

If we find a good way to avoid it in the future, I'd be happy to see it go away. In the meantime, though, I don't think we have anything better / shared we can use instead.

stephentoub · 2018-03-07T02:38:09Z

Thanks, though!

Remove temporary ByteArrayHelpers and use Span.Equals

d934588

ahsonkhan added the area-System.Net.Http label Mar 2, 2018

ahsonkhan self-assigned this Mar 2, 2018

ahsonkhan commented Mar 2, 2018

View reviewed changes

stephentoub closed this Mar 7, 2018

karelz added this to the 2.1.0 milestone Mar 10, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove temporary ByteArrayHelpers and use Span.Equals #27654

Remove temporary ByteArrayHelpers and use Span.Equals #27654

ahsonkhan commented Mar 2, 2018 •

edited

Loading

ahsonkhan Mar 2, 2018 •

edited

Loading

stephentoub Mar 2, 2018 •

edited

Loading

ahsonkhan Mar 2, 2018 •

edited

Loading

benaadams Mar 2, 2018

benaadams Mar 2, 2018 •

edited

Loading

ahsonkhan Mar 2, 2018

benaadams Mar 2, 2018

ahsonkhan commented Mar 2, 2018

geoffkizer commented Mar 2, 2018 •

edited

Loading

benaadams commented Mar 2, 2018

geoffkizer commented Mar 2, 2018

stephentoub commented Mar 2, 2018

stephentoub commented Mar 2, 2018 •

edited

Loading

stephentoub commented Mar 2, 2018

stephentoub commented Mar 6, 2018

ahsonkhan commented Mar 7, 2018

stephentoub commented Mar 7, 2018

stephentoub commented Mar 7, 2018

Remove temporary ByteArrayHelpers and use Span.Equals #27654

Remove temporary ByteArrayHelpers and use Span.Equals #27654

Conversation

ahsonkhan commented Mar 2, 2018 • edited Loading

ahsonkhan Mar 2, 2018 • edited Loading

Choose a reason for hiding this comment

stephentoub Mar 2, 2018 • edited Loading

Choose a reason for hiding this comment

ahsonkhan Mar 2, 2018 • edited Loading

Choose a reason for hiding this comment

benaadams Mar 2, 2018

Choose a reason for hiding this comment

benaadams Mar 2, 2018 • edited Loading

Choose a reason for hiding this comment

ahsonkhan Mar 2, 2018

Choose a reason for hiding this comment

benaadams Mar 2, 2018

Choose a reason for hiding this comment

ahsonkhan commented Mar 2, 2018

geoffkizer commented Mar 2, 2018 • edited Loading

benaadams commented Mar 2, 2018

geoffkizer commented Mar 2, 2018

stephentoub commented Mar 2, 2018

stephentoub commented Mar 2, 2018 • edited Loading

stephentoub commented Mar 2, 2018

stephentoub commented Mar 6, 2018

ahsonkhan commented Mar 7, 2018

stephentoub commented Mar 7, 2018

stephentoub commented Mar 7, 2018

ahsonkhan commented Mar 2, 2018 •

edited

Loading

ahsonkhan Mar 2, 2018 •

edited

Loading

stephentoub Mar 2, 2018 •

edited

Loading

ahsonkhan Mar 2, 2018 •

edited

Loading

benaadams Mar 2, 2018 •

edited

Loading

geoffkizer commented Mar 2, 2018 •

edited

Loading

stephentoub commented Mar 2, 2018 •

edited

Loading