fix: copy arrays when placing them in the v2 writer's accumulation queue #2249

westonpace · 2024-04-23T23:46:34Z

When a record batch crosses the FFI layer we lose the ability to deallocate that batch one array at a time (every array in the batch keeps a reference counted pointer to the batch). This is a problem for the writer because we usually want to flush some arrays to disk and keep other arrays in memory for a while longer. However, we do want to release the arrays we flush to disk to avoid memory leaks.

This issue was highlighted in a test / demo that attempted to write video data to a lance file. In this situation we were writing the data one row at a time. There were only 30,000 rows in the source data but this was hundreds of GB. None of the metadata columns would ever flush to disk (e.g. a 4 byte int column @ 30,000 rows is only 120KB). These metadata arrays were keeping the video data alive in memory and the writer was taking up too much data.

The solution in this PR is to perform a deep copy of any data we are planning on flushing to disk. This is unfortunate, and there is a configuration parameter to disable this copy (I have, intentionally, chosen not to expose this in the python API since data from python always crosses an FFI boundary and is susceptible to this problem). However, copying this data by default is a much safer course of action.

In the future we could investigate alternative ways of passing the data across the FFI boundary (e.g. instead of sending the entire record batch we could send individual arrays across and then reassemble them into a record batch on the other side). However, I don't think we need to worry too much about this until we see writer CPU performance become an issue. In most cases the batches will probably be in the CPU cache already and so this should be a pretty quick write. Also, writers that are writing any significant amount of data will be I/O bound.

codecov-commenter · 2024-04-24T00:13:08Z

Codecov Report

Attention: Patch coverage is 82.35294% with 15 lines in your changes are missing coverage. Please review.

Project coverage is 81.08%. Comparing base (e55e9fc) to head (1e1bfca).
Report is 1 commits behind head on main.

Files	Patch %	Lines
rust/lance-arrow/src/deepcopy.rs	76.59%	11 Missing ⚠️
rust/lance-encoding/src/encoder.rs	80.00%	3 Missing ⚠️
rust/lance-file/src/v2/writer.rs	50.00%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2249      +/-   ##
==========================================
+ Coverage   81.03%   81.08%   +0.05%     
==========================================
  Files         186      187       +1     
  Lines       53753    53963     +210     
  Branches    53753    53963     +210     
==========================================
+ Hits        43558    43756     +198     
- Misses       7715     7730      +15     
+ Partials     2480     2477       -3

Flag	Coverage Δ
unittests	`81.08% <82.35%> (+0.05%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

…mulation queue, to avoid memory leaks

eddyxu · 2024-04-24T18:26:09Z

rust/lance-arrow/src/deepcopy.rs

+use arrow_data::ArrayData;
+
+pub fn deep_copy_buffer(buffer: &Buffer) -> Buffer {
+    Buffer::from(Vec::from(buffer.as_slice()))


Do we need to copy data here?
Can we just do buffer.data.clone()?

https://docs.rs/arrow-buffer/51.0.0/src/arrow_buffer/buffer/immutable.rs.html#34

We cannot. When the buffer is imported over FFI, the refcount of the buffer is tied to the entire batch. Thus, it can't be freed until all other columns in the same batch are also ready to be freed. By copying the buffer into Rust, we gain control over the lifetime and can free it earlier.

Like Weston said in the PR description, we can solve this by importing each column individually over FFI.

wjones127 · 2024-04-24T20:55:39Z

rust/lance-encoding/src/encodings/logical/primitive.rs

+            // Push into buffered_arrays without copy since we are about to flush anyways
+            self.buffered_arrays.push(array);


eddyxu requested a review from wjones127 April 23, 2024 23:47

westonpace added 4 commits April 23, 2024 17:14

Force a copy, by default, when caching data in the file writer's accu…

ed3915c

…mulation queue, to avoid memory leaks

Don't require optional args

11515a4

Rebase and remove accidentally added println

25dff96

Apply clippy suggestions

1e1bfca

westonpace force-pushed the fix/copy-arrays-when-writing branch from bb54a47 to 1e1bfca Compare April 24, 2024 00:21

eddyxu requested a review from BubbleCal April 24, 2024 02:02

eddyxu reviewed Apr 24, 2024

View reviewed changes

wjones127 approved these changes Apr 24, 2024

View reviewed changes

westonpace merged commit fb43192 into lancedb:main Apr 25, 2024
16 of 17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: copy arrays when placing them in the v2 writer's accumulation queue #2249

fix: copy arrays when placing them in the v2 writer's accumulation queue #2249

westonpace commented Apr 23, 2024

codecov-commenter commented Apr 24, 2024 •

edited

Loading

eddyxu Apr 24, 2024

wjones127 Apr 24, 2024

wjones127 Apr 24, 2024

		// Push into buffered_arrays without copy since we are about to flush anyways
		self.buffered_arrays.push(array);

fix: copy arrays when placing them in the v2 writer's accumulation queue #2249

fix: copy arrays when placing them in the v2 writer's accumulation queue #2249

Conversation

westonpace commented Apr 23, 2024

codecov-commenter commented Apr 24, 2024 • edited Loading

Codecov Report

eddyxu Apr 24, 2024

Choose a reason for hiding this comment

wjones127 Apr 24, 2024

Choose a reason for hiding this comment

wjones127 Apr 24, 2024

Choose a reason for hiding this comment

codecov-commenter commented Apr 24, 2024 •

edited

Loading