ENH/PERF: allow mask to be optional in our masked ExtensionArrays #30435
Labels
Enhancement
ExtensionArray
Extending pandas with custom dtypes or arrays.
Missing-data
np.nan, pd.NaT, pd.NA, dropna, isnull, interpolate
NA - MaskedArrays
Related to pd.NA and nullable extension arrays
Performance
Memory or execution speed performance
Our nullable, mask-based ExtensionArrays (currently integer and boolean, inheriting from MaskedArray) have a
_data
and_mask
numpy arrays stored under the hood. SO we use a numpy boolean array as mask (8bit), also when there are no missing values.One, relatively easy, memory + performance improvement could be achieved by allowing the mask to be None when there are no missing data. Since the mask data is completely internal to the Array implementations, this should be possible to do.
(to be checked how involved the ops code would become to handle this as optional)
The text was updated successfully, but these errors were encountered: