`DiscreteNonParametric` and `Categorical` Construction Issue #1832

btmit · 2024-01-30T18:44:32Z

Construction of a Categorical distribution seems to make a copy of the p vector. I see this through profiling, @btime and the fact that I can't see changes in the original vector after I create the Categorical. There are three issues I see:

Categorical docstring includes the following: "Note: The input vector p is directly used as a field of the constructed distribution, without being copied." which seems incorrect.
Performance issues in critical sections of code where this allocation can really add up
Bugs such as the following:

using Distributions
x = rand(3,5)
x = x ./ sum(x, dims=1)  # each column is a valid probability vector
c = Categorical.(eachcol(x))

julia> c = Categorical.(eachcol(x))
ERROR: MethodError: Cannot convert an object of type Vector{Float64} to an object of type SubArray{Float64, 1, Matrix{Float64}, Tuple{Base.Slice{Base.OneTo{Int64}}, Int64}, true}

I believe the underlying issue is that the DiscreteNonParametric inner constructor tries to sort and reorder everything, which creates a copy and then the constructor doesn't update the type.

The text was updated successfully, but these errors were encountered:

JockLawrie · 2025-01-08T23:29:01Z

Just an anecdote, I'm hitting the performance implication of this issue. I'm running discrete event simulations that construct Categorical distributions from several statistical models 100s of billions of times. The ability to reuse p would help greatly here. Alternatively, supplying a tuple instead of a vector would work too.

devmotion · 2025-01-08T23:43:54Z

Did you compare it with #1908?

JockLawrie · 2025-01-09T03:34:27Z

Yes PR 1908 looks like it fixes this issue, thanks. Is there anything blocking it being merged?

devmotion · 2025-01-09T07:51:00Z

It hasn't been approved yet.

devmotion linked a pull request Oct 2, 2024 that will close this issue

Fix the constructor of DiscreteNonParametric #1908

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`DiscreteNonParametric` and `Categorical` Construction Issue #1832

`DiscreteNonParametric` and `Categorical` Construction Issue #1832

btmit commented Jan 30, 2024 •

edited

Loading

JockLawrie commented Jan 8, 2025 •

edited

Loading

devmotion commented Jan 8, 2025

JockLawrie commented Jan 9, 2025

devmotion commented Jan 9, 2025

DiscreteNonParametric and Categorical Construction Issue #1832

DiscreteNonParametric and Categorical Construction Issue #1832

Comments

btmit commented Jan 30, 2024 • edited Loading

JockLawrie commented Jan 8, 2025 • edited Loading

devmotion commented Jan 8, 2025

JockLawrie commented Jan 9, 2025

devmotion commented Jan 9, 2025

`DiscreteNonParametric` and `Categorical` Construction Issue #1832

`DiscreteNonParametric` and `Categorical` Construction Issue #1832

btmit commented Jan 30, 2024 •

edited

Loading

JockLawrie commented Jan 8, 2025 •

edited

Loading