You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When we are handling single tabular data, it is very common to see some constraints that require the values of some numeric columns to be Positive (>0) and the values of some numeric columns to be non-negative (>=0).
Expected behavior
I would expect some constraint class to be integrated in the library like this:
@npatki and I were discussing that we can have general design, and perhaps we expose multiple constraints with specific names like ispositive and is-nonnegative but the underlying call will use the general purpose greaterthan. It may give better usability, and interpretability. [minor item]
+1 We'll at least need to support greater_than, greater_than_or_equal_to, less_than, and less_than_or_equal_to as base cases to cover all possibilities. Doing this will allow us to layer additional functionality for ease-of-use. We can prioritize based on usage but I assume we'll at least want to have:
Hey, having above mentioned features would be very valuable. Very often datasets contain columns which from an engineering view would always have to be >0 or bigger than any chosen number.
Iam actually solving this at the moment, with just generating a lot of synth data and then filtering the data for rows with values only > 0.
I am not 100% sure if this approach will keep the overall quality of the synth data intact?
npatki
changed the title
Non-negative and Positive constraints across multiple columns
Add Non-negative and Positive constraints across multiple columns
Jul 12, 2021
Problem Description
When we are handling single tabular data, it is very common to see some constraints that require the values of some numeric columns to be Positive (>0) and the values of some numeric columns to be non-negative (>=0).
Expected behavior
I would expect some constraint class to be integrated in the library like this:
Or maybe have even more general design like this:
The text was updated successfully, but these errors were encountered: