[BUG] make_blobs
doesn't behave like cuML and scikit-learn counterparts, and shuffle
doesn't really shuffle
#1127
Labels
bug
Something isn't working
This is better understood by example.
cuML and scikit-learn behavior:
raft behavior:
The difference between the unshuffled versions is cosmetic (cuML and sci-kit learn have continuous labels, raft has the index modulo the number of labels). However, the "shuffled" version in raft is not properly shuffled, as the labels appear cyclically in the same order.
This is due to how raft attempts to shuffle them, using an affine transform. Modulo congruence is not sensitive to multiplication and addition. That is, if two points have the same label pre-transform, they have the same label post-transform.
We should use a different method to shuffle.
The text was updated successfully, but these errors were encountered: