MovieLens Datasets #147

Dsantra92 · 2022-06-23T14:46:41Z

codecov-commenter · 2022-06-23T14:57:39Z

Codecov Report

Merging #147 (71ee554) into master (63b865f) will increase coverage by 5.32%.
The diff coverage is 76.27%.

@@            Coverage Diff             @@
##           master     #147      +/-   ##
==========================================
+ Coverage   38.68%   44.01%   +5.32%     
==========================================
  Files          39       40       +1     
  Lines        1755     2029     +274     
==========================================
+ Hits          679      893     +214     
- Misses       1076     1136      +60

Impacted Files	Coverage Δ
src/datasets/graphs/movielens.jl	`76.19% <76.19%> (ø)`
src/MLDatasets.jl	`100.00% <100.00%> (ø)`
src/utils.jl	`61.22% <0.00%> (+10.20%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 63b865f...71ee554. Read the comment docs.

CarloLucibello · 2022-06-24T21:57:46Z

src/datasets/graphs/movielens.jl

+  user_data["gender"] = user_df[!, 3] .== "M" # I hope I don't get cancelled for binarizing this field
+  user_data["occupation"] = user_df[!, 4]
+  user_data["zipcode"] = user_df[!, 5]
+  return user_data


The indentation is not uniform in this file. It should be 4 blanks everywhere

Yeah, vim messed it up somewhere, I will fix these things in a cleanup commit later.

Dsantra92 · 2022-06-24T22:48:13Z

Forgot to add the indentation fix change, will be added in later commit.

Dsantra92 · 2022-06-24T22:52:02Z

There are inconsistencies in data storing format across the 3 variations: 100k,1m and current datasets(20m, 25m etc.). Will address the issue when all of them have working APIs.

CarloLucibello · 2022-06-28T15:27:12Z

Are all tests passing locally?

Dsantra92 · 2022-06-28T15:41:02Z

Are all tests passing locally?

Yes

src/datasets/graphs/movielens.jl

MovieLens 100k

3e7e0a2

Dsantra92 changed the title ~~MovieLens 100k~~ MovieLens Datasets Jun 23, 2022

Dsantra92 linked an issue Jun 23, 2022 that may be closed by this pull request

Movielens datasets #104

Closed

MovieLens 100k modular with test

bec5d63

CarloLucibello reviewed Jun 24, 2022

View reviewed changes

MovieLens 1m + doc_error_fix + indentation fix

836c82c

Dsantra92 added 5 commits June 27, 2022 03:53

20m+25m - tests - consistency

169f413

tests and better metadata

47b5267

fix expected type

d02e3ba

ml-10m

40d7832

Docs

0912241

Dsantra92 marked this pull request as ready for review June 28, 2022 15:03

Dsantra92 requested a review from CarloLucibello June 28, 2022 15:03

Don't run ogbn-mag on Windows + remove comment

0f0d6a6

Fix windows issue

ebb7868

CarloLucibello reviewed Jun 30, 2022

View reviewed changes

CarloLucibello added 6 commits June 30, 2022 05:11

Update src/datasets/graphs/movielens.jl

1d47122

Update src/datasets/graphs/movielens.jl

04c4296

Update src/datasets/graphs/movielens.jl

e7cbded

Update src/datasets/graphs/movielens.jl

04c1da8

Update src/datasets/graphs/movielens.jl

36fde71

Update src/datasets/graphs/movielens.jl

71ee554

CarloLucibello merged commit 917665f into JuliaML:master Jun 30, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MovieLens Datasets #147

MovieLens Datasets #147

Dsantra92 commented Jun 23, 2022 •

edited

Loading

codecov-commenter commented Jun 23, 2022 •

edited

Loading

CarloLucibello Jun 24, 2022

Dsantra92 Jun 24, 2022

Dsantra92 commented Jun 24, 2022

Dsantra92 commented Jun 24, 2022

CarloLucibello commented Jun 28, 2022

Dsantra92 commented Jun 28, 2022

MovieLens Datasets #147

MovieLens Datasets #147

Conversation

Dsantra92 commented Jun 23, 2022 • edited Loading

codecov-commenter commented Jun 23, 2022 • edited Loading

Codecov Report

CarloLucibello Jun 24, 2022

Choose a reason for hiding this comment

Dsantra92 Jun 24, 2022

Choose a reason for hiding this comment

Dsantra92 commented Jun 24, 2022

Dsantra92 commented Jun 24, 2022

CarloLucibello commented Jun 28, 2022

Dsantra92 commented Jun 28, 2022

Dsantra92 commented Jun 23, 2022 •

edited

Loading

codecov-commenter commented Jun 23, 2022 •

edited

Loading