Reduce the heap use of BKDReader instances #13464

original-brownbear · 2024-06-06T17:39:29Z

We consume a lot of memory for the indexIn slices. If indexIn is of type MemorySegmentIndexInput the overhead of keeping loads of slices around just for cloning is far higher than the extra 12b per reader this adds (the slice description alone often costs a lot). In a number of Elasticsearch example uses with high segment counts I investigated, this change would save up to O(GB) of heap.

We consume a lot of memory for the `indexIn` slices. If `indexIn` is of type `MemorySegmentIndexInput` the overhead of keeping loads of slices around just for cloning is far higher than the extra 12b per reader this adds (the slice description alone often costs a lot). In a number of Elasticsearch example uses with high segment counts I investigated, this change would save up to O(GB) of heap.

iverase

makes sense to me

jpountz

Agreed that the intermediate slice is not helping here. The new code is also consistent with e.g. doc values, which only create slices when pulling doc-values instances.

We consume a lot of memory for the `indexIn` slices. If `indexIn` is of type `MemorySegmentIndexInput` the overhead of keeping loads of slices around just for cloning is far higher than the extra 12b per reader this adds (the slice description alone often costs a lot). In a number of Elasticsearch example uses with high segment counts I investigated, this change would save up to O(GB) of heap.

original-brownbear added 2 commits June 6, 2024 19:33

shorter

45b770a

iverase approved these changes Jun 6, 2024

View reviewed changes

jpountz approved these changes Jun 6, 2024

View reviewed changes

jpountz merged commit c7a7d48 into apache:main Jun 7, 2024
3 checks passed

jpountz added this to the 9.12.0 milestone Jun 7, 2024

original-brownbear deleted the reduce-heap-overhead-bkd-reader branch June 7, 2024 12:17

original-brownbear mentioned this pull request Jun 26, 2024

Reduce overhead for FSTs in FieldReader #13524

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce the heap use of BKDReader instances #13464

Reduce the heap use of BKDReader instances #13464

original-brownbear commented Jun 6, 2024

iverase left a comment

jpountz left a comment

Reduce the heap use of BKDReader instances #13464

Reduce the heap use of BKDReader instances #13464

Conversation

original-brownbear commented Jun 6, 2024

iverase left a comment

Choose a reason for hiding this comment

jpountz left a comment

Choose a reason for hiding this comment