You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Technically this works, but it is weird to make this a "numerical" sdtype because it's not actually a statistically valid numerical column. (taking an average, correlation, etc. of this column does not make sense).
Expected
Instead, I propose the upgrade_metadata script should instead make this a "text" sdtype with a Regex that generates indices. This follows our expectation: ID columns will have structured text.
When applied to an int column, the synthesizer will return integer values that increase with every row: 0, 1, 2, ... (up to a max of 10^30 rows, which is enough for most purposes).
The text was updated successfully, but these errors were encountered:
Environment Details
Background
I have the following metadata (written in the old format, pre-SDV 1.0):
This conveys that the user id is an index that increments
0, 1, 2, ...
Observed
When I call
updgrade_metadata
, I get the following:Technically this works, but it is weird to make this a
"numerical"
sdtype because it's not actually a statistically valid numerical column. (taking an average, correlation, etc. of this column does not make sense).Expected
Instead, I propose the
upgrade_metadata
script should instead make this a"text"
sdtype with a Regex that generates indices. This follows our expectation: ID columns will have structured text.When applied to an int column, the synthesizer will return integer values that increase with every row:
0, 1, 2, ...
(up to a max of 10^30 rows, which is enough for most purposes).The text was updated successfully, but these errors were encountered: