[💡SUG] Do you have training and evaluation speed reference benchmarks? #518

tszumowski · 2020-11-20T17:47:09Z

Is your feature request related to a problem? Please describe.
Will you be able to post how long it typically takes to train and evaluate for an epoch for the models? Even just for one large dataset this can be helpful for the community (i.e. MovieLens-1M)

I notice #484 and #485. I understand from that PR there are no plans to keep a scoreboard.

However, it's a bit difficult to determine whether or now it is worth benchmarking an algorithm because any given algorithm may take hours to run a single epoch on a GPU.

For example, in a private dataset comparable to MovieLens-10M, I'm seeing drastically different training times across the general recommenders using a P100 GPU, from a few seconds/epoch to several minutes/epoch.

Having preliminary train/evaluation times would help a user understand accuracy vs. speed tradeoffs. It will also help users and developers benchmark speeds against other open-source implementation.

Describe the solution you'd like
A preliminary list of training time per-epoch and evaluation time per-epoch using default configurations for each recommender, using MovieLens-1M dataset.

Describe alternatives you've considered
N/A

Additional context
N/A

batmanfly · 2020-11-21T00:55:31Z

Thanks for this nice suggestion. We will discuss on this point and update the response soon.

-Wayne Xin Zhao

tszumowski · 2020-11-21T03:40:07Z

To add a bit. The intent is not to creste a top performing benchmark in speed or accuracy. Rather, it would be a rough guide for users that provide parameters that work on a common platform (e.g. Colab K80) and and example of what to expect for runtimes.

Thank you for the consideration.

batmanfly · 2020-11-21T03:55:41Z

To add a bit. The intent is not to creste a top performing benchmark in speed or accuracy. Rather, it would be a rough guide for users that provide parameters that work on a common platform (e.g. Colab K80) and and example of what to expect for runtimes.

Thank you for the consideration.

Our team just had a discussion on this issue. We would arrange the test and give a rough time estimate of the implemented algorithms on some selected datasets with varying sizes. Hopefully, we would update these efficiency results on the main page or otherwhere before next Wednesday.

We would also inform you on this issue page.

BTW, your mentioned LightGCN issue is also important. I think if such a speed board was available, that issue might be clear. Our team also asked the implementer to locate the lines that are likely to yield the thrown memory exception. Will get back to you with the answer soon. A practical hint is that different algorithms may scale to varying-sized datasets. Graph based algorithms are likely to take up more space than other kinds of algorithms, which is likely to throw memory exception on large-scale datasets (e.g., Gowalla dataset). That is why we provide a series of data preprocessing functions in the library, e.g., K-core filtering. In the future, we would consider accelerating some competitive algorithms with slow speed (that would take some time, probably in 2021=) ).

Thanks again for your efforts with these suggestions!

tszumowski · 2020-11-25T12:26:35Z

@batmanfly (and @ShanleiMu )I saw this post today, which provides links to time and memory costs for general recommenders and sequential recommenders. Thank you.

I had a few questions/requests for those lists and figured this is a good Issue thread to post.

I believe the times here are in seconds-per-epoch, corrrect? (sec/epoch). If so, adding that will help clarify for new users.
I believe the memory is the GPU memory, correct? If so, adding that will help clarify.
Would it be possible to run on the Context-Aware recommenders too? I tried some of those yesterday and realized that adding side-features can dramatically slow down training in some cases (depending on # features, feature structure, etc)

Thank you again!

batmanfly · 2020-11-25T12:58:14Z

@batmanfly (and @ShanleiMu )I saw this post today, which provides links to time and memory costs for general recommenders and sequential recommenders. Thank you.

I had a few questions/requests for those lists and figured this is a good Issue thread to post.

I believe the times here are in seconds-per-epoch, corrrect? (sec/epoch). If so, adding that will help clarify for new users.

I believe the memory is the GPU memory, correct? If so, adding that will help clarify.

Would it be possible to run on the Context-Aware recommenders too? I tried some of those yesterday and realized that adding side-features can dramatically slow down training in some cases (depending on # features, feature structure, etc)

Thank you again!

@tszumowski Nice suggestions. We will add these details to clarity our results.

For context- and knowledge- aware algorithms, their results are on the way=) We do find that some context-aware algorithms run more slowly than general recommendation algorithms, so that we didn't obtain their results by now. Their results are expected to be ready on this weekend based on current intermediate results.

tszumowski · 2020-11-25T13:00:44Z

@batmanfly great! You're all so fast and responsive!

ShanleiMu · 2020-11-29T03:20:40Z

@tszumowski We have added more details to clarify our results and updated the time and memory costs of context-aware recommenders and knowledge-based recommenders.

tszumowski · 2020-11-30T21:44:03Z

@ShanleiMu this is great! Thank you. I'll close this issue given all the great docs!

tszumowski added the enhancement New feature or request label Nov 20, 2020

tszumowski mentioned this issue Nov 25, 2020

[🐛BUG] Out-of-memory for LightGCN on the gowalla dataset #519

Closed

tszumowski closed this as completed Nov 30, 2020

hyp1231 mentioned this issue Dec 30, 2020

[🐛BUG] Waiting time for 1st epoch #627

Closed

Sherry-XLL added benchmark labels Feb 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[💡SUG] Do you have training and evaluation speed reference benchmarks? #518

[💡SUG] Do you have training and evaluation speed reference benchmarks? #518

tszumowski commented Nov 20, 2020 •

edited

Loading

batmanfly commented Nov 21, 2020

tszumowski commented Nov 21, 2020

batmanfly commented Nov 21, 2020

tszumowski commented Nov 25, 2020 •

edited

Loading

batmanfly commented Nov 25, 2020

tszumowski commented Nov 25, 2020

ShanleiMu commented Nov 29, 2020

tszumowski commented Nov 30, 2020

[💡SUG] Do you have training and evaluation speed reference benchmarks? #518

[💡SUG] Do you have training and evaluation speed reference benchmarks? #518

Comments

tszumowski commented Nov 20, 2020 • edited Loading

batmanfly commented Nov 21, 2020

tszumowski commented Nov 21, 2020

batmanfly commented Nov 21, 2020

tszumowski commented Nov 25, 2020 • edited Loading

batmanfly commented Nov 25, 2020

tszumowski commented Nov 25, 2020

ShanleiMu commented Nov 29, 2020

tszumowski commented Nov 30, 2020

tszumowski commented Nov 20, 2020 •

edited

Loading

tszumowski commented Nov 25, 2020 •

edited

Loading