ENH: support 'duplicated' functionality for ExtensionArrays #27264
Labels
duplicated
duplicated, drop_duplicates
Enhancement
ExtensionArray
Extending pandas with custom dtypes or arrays.
For the
factorize
,unique
,groupby
hashtable-based functionalities, we included a_values_for_factorize()
/factorize()
method on the ExtensionArray. So for those methods, it is working nicely. However, for some of the other hashtable-based methods such asduplicated()
ordrop_duplicates
, this machinery is not used and the EA is still coerced to a numpy array before passing to the algos code.Small illustration that this is the fact by patching the IntegerArray to print when being coerced to a numpy array:
The text was updated successfully, but these errors were encountered: