Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement dt.verify_integrity() function #7

Closed
st-pasha opened this issue May 10, 2017 · 0 comments
Closed

Implement dt.verify_integrity() function #7

st-pasha opened this issue May 10, 2017 · 0 comments
Assignees

Comments

@st-pasha
Copy link
Contributor

This function should check whether the data in the datatable is consistent, and correct it if possible. In particular, the following checks are needed:

  • all the checks already performed by the window() function
  • check that boolean columns contain only values 0, 1, and NA
  • check that string/categorical columns contain valid UTF-8 strings
  • check that string/categorical columns have offsets that are monotonically increasing (in abs value) and do not go out-of-range
  • check that string data column in a string section is padded with 0xFFs
  • check validity of all rollup values
  • (to be continued)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant