perf: dynamic dispatch overhead #1922

PSeitz · 2023-03-02T09:24:11Z

This flamegraph is an term query aggregation on 10mio documents selected via a "*" query.
Only around 60% is spent in the aggregation, a large chunk seems to be calling overhead, e.g the dynamic dispatch on the callback in for_each_docset.

pub(crate) fn for_each_docset<T: DocSet + ?Sized>(docset: &mut T, callback: &mut dyn FnMut(DocId)) {
    let mut doc = docset.doc();
    while doc != TERMINATED {
        callback(doc);
        doc = docset.advance();
    }
}

The text was updated successfully, but these errors were encountered:

fulmicoton · 2023-03-02T09:46:11Z

considering you do buffering in the aggregation layer, should we add some buffering solution in at the DocSet level maybe? can you experiment?

PSeitz · 2023-03-02T10:56:10Z

Yes, I think we have two options

Buffering like in aggregations. It probably needs to be on the Collector, but could be combined with DocSet
Remove the callback dynamic dispatch (maybe not feasible)

Buffering may have some free positive side effects on the generated code, but the (sometimes unused) score may complicate things a bit

PSeitz · 2023-03-20T02:31:50Z

#1937

PSeitz closed this as completed Mar 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: dynamic dispatch overhead #1922

perf: dynamic dispatch overhead #1922

PSeitz commented Mar 2, 2023

fulmicoton commented Mar 2, 2023

PSeitz commented Mar 2, 2023

PSeitz commented Mar 20, 2023

perf: dynamic dispatch overhead #1922

perf: dynamic dispatch overhead #1922

Comments

PSeitz commented Mar 2, 2023

fulmicoton commented Mar 2, 2023

PSeitz commented Mar 2, 2023

PSeitz commented Mar 20, 2023