-
Notifications
You must be signed in to change notification settings - Fork 924
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] tail method sometimes fail #2495
Comments
@jangorecki this should be fixed in the latest nightlies. This was due to nulls being improperly handled as Python |
@kkraus14
|
@jangorecki Does it fail always or depends on data generated for 0.9? |
@rgsl888prabhu depends on the data, among 4 different cases of cardinality factor ("K") the issue manifests only in one case. You can generate exact data that cause the problem by following initial instructions. |
@jangorecki I tried to reproduce using 0.9, but I wasn't able to do so. If you have that |
@rgsl888prabhu I have the csv but it is 45 GB size. |
Do you remember the random seed that you had set, I don't see it in the script. |
There is a random seed set in the script: wget https://raw.githubusercontent.com/h2oai/db-benchmark/master/groupby-datagen.R
Rscript groupby-datagen.R 1e9 2 0 0 |
Thank you @jangorecki, I am able to reproduce scenario. |
Simplified code to reproduce
|
After running a query, I am getting
ans
frame.head
method works fine on it, buttail
method fails. This happens rarely and strongly depends on the data. Using0.8.0+0.g8fa7bd3.dirty
.I can provide reproducible example but it will not be minimal... the one provided in #2494 (comment) might work after changing K=2.
The text was updated successfully, but these errors were encountered: