refactor: vortex-buffer #1742

gatesn · 2024-12-20T23:28:57Z

Just a small one for y'all.

vortex-buffer/src/lib.rs is a decent starting point for this change.

This PR introduces Buffer<T> and BufferMut<T> for as (im)mutable zero-copy runtime-aligned buffers.
You can think of BufferMut as a Vec, but every time it re-allocates to resize, it ensures the new buffer remains aligned as requested.

This is neat. Unlike Vec<T>, it allows you to build buffers with much larger alignment, for example always 128 bytes for FastLanes, or 4096/8192 for page-aligned buffers.

Limitation: no zero-copy from `Vec<T>`

The implementation essentially wraps Bytes and BytesMut to provide this functionality. As a result, there is one major annoying limitation: we cannot go from Vec<T> to Buffer<T> and back to Vec<T> with zero-copy. Bytes::from_owner is able to take an externally allocated buffer (i.e. Vec<T>), but it will not return it back to you.

I've decided to entirely disallow going from Vec<T> to Buffer<T> to avoid this odd performance foot-gun. Although in doing this, I force a copy Buffer::<T>::copy_from_vec... whereas if we did allow conversion from Vec<T> then we would only pay for the copy if we tried to turn it into a BufferMut<T>. So.... not sure.

We can see an example of this when encoding ALP arrays. The result our library returns is a Vec<T>. So we are forced to copy. One way around this is to allow zero-copy into Buffer<T>, the other way is to make the ALP library generic over V: Default + Extend<T> to allow the library to initialize and append to a collection of elements in some arbitrary container type.

Perhaps an argument for allowing From<Vec<T>> is that we would rarely benefit from into_mut for arrays constructed by hand by users in-memory (in-memory arrays constructed by us should use Buffer<T>). And any arrays loaded from disk, would be loaded into a Buffer<T>...

Wire Break

This PR adds an alignment: u16 field to every flatbuffer Array's buffers. This lets us know the desired alignment for a given buffer deep within the I/O stack, helping to avoid copies later on.

encodings/bytebool/src/array.rs

lwwmanning · 2024-12-26T13:49:52Z

encodings/bytebool/src/compute.rs


        let arr = match validity {
            Validity::AllValid | Validity::NonNullable => {
                let bools = match_each_integer_ptype!(indices.ptype(), |$I| {
-                    indices.maybe_null_slice::<$I>()
+                    indices.as_slice::<$I>()


IMO, the name maybe_null_slice was better. Seems to easy for a caller to mistakenly expect as_slice to return the semantic values rather than a slice over underlying buffer that ignores validity

encodings/fastlanes/src/bitpacking/compress.rs

encodings/fastlanes/src/bitpacking/mod.rs

lwwmanning · 2024-12-26T16:56:07Z

vortex-flatbuffers/src/lib.rs

-    /// Write the flatbuffer into a [`Buffer`].
-    fn write_flatbuffer_bytes(&self) -> Buffer;
+    /// Write the flatbuffer into a [`Bytes`].
+    fn write_flatbuffer_bytes(&self) -> Bytes;


should this also write into the FlatBuffer alias of ConstBuffer?

I don't know...

So FlatBuffers don't have any prescribed alignment, so long as the individual values are sufficiently aligned. e.g. if you store a int64 it must be 8 byte aligned.

So the vec that comes out of FlatBufferBuilder might not be 8 byte aligned. We could force it to be (with a possible copy), but the flatbuffer isn't wrong or broken. So it seems an odd API to force the copy when it may not be necessary.

Does that make sense?

lwwmanning · 2024-12-26T16:57:49Z

vortex-io/src/object_store.rs

@@ -92,7 +92,7 @@ impl VortexReadAt for ObjectStoreReadAt {
            //  it's coming from a network stream. Internally they optimize the File implementation
            //  to only perform a single allocation when calling `.bytes().await`, which we
            //  replicate here by emitting the contents directly into our aligned buffer.
-            let mut buffer = Vec::with_capacity(len);
+            let mut buffer = BytesMut::with_capacity(len);


we should put this into an aligned thing, no?

See my comment about bytes::BufMut. Vortex IO can be built almost entirely around the bytes crate. vortex-buffer provides an implementation of BufMut trait that internally preserves alignment. So the caller can choose to pass in a BytesMut or a BufferMut and get the behavior they want.

lwwmanning · 2024-12-26T16:59:22Z

vortex-ipc/src/iterator.rs

+    /// Collects the IPC bytes into a single `Bytes`.
+    pub fn collect_to_buffer(self) -> VortexResult<Bytes> {
+        let buffers: Vec<Bytes> = self.try_collect()?;
+        let mut buffer = BytesMut::with_capacity(buffers.iter().map(|b| b.len()).sum());


when we're allocating these big buffers, should use our aligned version, no? (to at least 8 byte align everything)

vortex-ipc/src/messages/decoder.rs

vortex-buffer/src/buffer_mut.rs

gatesn added 30 commits December 18, 2024 19:20

Store desired alignment in the array buffer

79e6ba2

Buffer alignment

33b911c

Buffer alignment

602f509

Buffer alignment

b54f546

Buffer alignment

7db32d1

Buffer alignment

7852ff3

Buffer alignment

1525de5

Merge branch 'develop' into ngates/buffers

83ad2ba

AlignedBufferMut

8209d76

AlignedBufferMut

897d063

AlignedBufferMut

1ef552e

AlignedBufferMut

b4cb50a

AlignedBufferMut

c5ab1ac

AlignedBufferMut

59bc34a

AlignedBufferMut

d14bc95

AlignedBufferMut

3baafa7

AlignedBufferMut

50438aa

AlignedBufferMut

7689fc8

AlignedBufferMut

6f67551

Fix transmute

3989cca

Fix transmute

1f5eca7

Fix transmute

f61bd02

Fix transmute

c75b39a

Fix transmute

f247f1a

Fix transmute

434b635

Fix transmute

cf6049f

Combine into single ScalarBuffer

a21f595

Benchmark from_iter

6b37397

Benchmark from_iter

3338746

Benchmark from_iter

20e4c7e

lwwmanning added 3 commits December 26, 2024 14:58

remove unnecessary allocations in bitpacking compress

ce9c8fd

nits

3c1735a

more nits

885eb4b

lwwmanning reviewed Dec 26, 2024

View reviewed changes

fixup locks

530c69c

gatesn commented Dec 29, 2024

View reviewed changes

vortex-buffer/src/buffer_mut.rs Show resolved Hide resolved

gatesn added 4 commits December 29, 2024 07:22

Zero-copy IO

4b51cf4

Zero-copy IO

eba9a69

Zero-copy IO

768c692

Zero-copy IO

2f46737

gatesn mentioned this pull request Dec 29, 2024

Dk/clone into maybe null slice considered harmful #1750

Closed

gatesn added 3 commits December 29, 2024 07:43

Zero-copy IO

039ed78

Zero-copy IO

aec6081

Zero-copy IO

6dafbdc

gatesn mentioned this pull request Dec 29, 2024

Add an ArrayBuffer that declares alignment #1720

Closed

gatesn added 3 commits December 29, 2024 07:51

Zero-copy IO

120b1c7

Zero-copy IO

d31f847

Appease Miri

978a2fb

lwwmanning approved these changes Dec 30, 2024

View reviewed changes

Merge remote-tracking branch 'origin/develop' into ngates/buffers

cdf2bf9

lwwmanning disabled auto-merge December 30, 2024 16:23

lwwmanning enabled auto-merge (squash) December 30, 2024 16:23

lwwmanning changed the title ~~vortex-buffer~~ refactor: vortex-buffer Dec 30, 2024

lwwmanning disabled auto-merge December 30, 2024 16:27

lwwmanning enabled auto-merge (squash) December 30, 2024 16:27

lwwmanning merged commit 920b2d2 into develop Dec 30, 2024
20 checks passed

lwwmanning deleted the ngates/buffers branch December 30, 2024 16:34

gatesn restored the ngates/buffers branch December 30, 2024 16:43

gatesn mentioned this pull request Dec 30, 2024

Clean up alignment #115

Closed

lwwmanning mentioned this pull request Jan 1, 2025

Patching BoolArray always copies (never in-place) #1774

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: vortex-buffer #1742

refactor: vortex-buffer #1742

gatesn commented Dec 20, 2024 •

edited

Loading

lwwmanning Dec 26, 2024

lwwmanning Dec 26, 2024 •

edited

Loading

gatesn Dec 29, 2024

lwwmanning Dec 26, 2024

gatesn Dec 29, 2024

lwwmanning Dec 26, 2024

refactor: vortex-buffer #1742

refactor: vortex-buffer #1742

Conversation

gatesn commented Dec 20, 2024 • edited Loading

Limitation: no zero-copy from Vec<T>

Wire Break

lwwmanning Dec 26, 2024

Choose a reason for hiding this comment

lwwmanning Dec 26, 2024 • edited Loading

Choose a reason for hiding this comment

gatesn Dec 29, 2024

Choose a reason for hiding this comment

lwwmanning Dec 26, 2024

Choose a reason for hiding this comment

gatesn Dec 29, 2024

Choose a reason for hiding this comment

lwwmanning Dec 26, 2024

Choose a reason for hiding this comment

gatesn commented Dec 20, 2024 •

edited

Loading

Limitation: no zero-copy from `Vec<T>`

lwwmanning Dec 26, 2024 •

edited

Loading