[BUG] Some metrics don't work with int64 data type arrays unlike their sklearn counterparts #4784
Labels
? - Needs Triage
Need team to review and classify
bug
Something isn't working
inactive-30d
inactive-90d
Describe the bug
Currently some metrics like
entropy
support onlyint32
type of arrays. I realised that even after allowingint64
dtype by modifying this line, the output was wrong due to the reason described below.Some metrics like
homogeneity
which use the underlyingentropy
metric return wrong output for arrays of type int64 (or if no data type is provided). The issue is because of a bug in thecub
module being used in theentropy
metric, specifically at this line as it doesn't return the correct count from the histogram.The root cause behind this seems to be this in the cub repo.
Steps/Code to reproduce bug
Expected behavior
The API should work with int64 arrays as well just like the sklearn API.
Environment details (please complete the following information):
branch-22.08
The text was updated successfully, but these errors were encountered: