-
Notifications
You must be signed in to change notification settings - Fork 47
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MovieLens Datasets #147
MovieLens Datasets #147
Conversation
Dsantra92
commented
Jun 23, 2022
•
edited
Loading
edited
- MovieLens Struct
- Base.show
- Docs
- MovieLens 100K
- MovieLens 1m
- MoviLens 20m
- MovieLens 25m
- Tests
Codecov Report
@@ Coverage Diff @@
## master #147 +/- ##
==========================================
+ Coverage 38.68% 44.01% +5.32%
==========================================
Files 39 40 +1
Lines 1755 2029 +274
==========================================
+ Hits 679 893 +214
- Misses 1076 1136 +60
Continue to review full report at Codecov.
|
src/datasets/graphs/movielens.jl
Outdated
user_data["gender"] = user_df[!, 3] .== "M" # I hope I don't get cancelled for binarizing this field | ||
user_data["occupation"] = user_df[!, 4] | ||
user_data["zipcode"] = user_df[!, 5] | ||
return user_data |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The indentation is not uniform in this file. It should be 4 blanks everywhere
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, vim messed it up somewhere, I will fix these things in a cleanup commit later.
Forgot to add the indentation fix change, will be added in later commit. |
There are inconsistencies in data storing format across the 3 variations: 100k,1m and current datasets(20m, 25m etc.). Will address the issue when all of them have working APIs. |
Are all tests passing locally? |
Yes |