feat: add performance investigation #12

ddanielsantos · 2022-09-29T14:17:25Z

closes #8

details

Implemented a benchmark for the map_to_html function, to do it, i moved the content from src/util.rs to src/lib.rs (see library crate)

I also tried to add a benchmark the download_blogs function, but the 100 iterations would take a lot of time:

Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 1603.1s, or reduce sample count to 10.
Benchmarking map to html: Collecting 100 samples in estimated 1603.1 s (100 iterations)

I can reduce the iterations or try to figure out another solution, what do you think fits better?

AntoniosBarotsis · 2022-09-29T15:00:37Z

Made a small change to make adding more benchmarks easier.

The one you added works great and definitely shows that the performance bottleneck is in the HTTP calls.

I was thinking of adding a benchmark that just runs the code once for the downloads, that should be representative enough for me. (could you work on that?) I would be interested in getting a call heatmap for that.

I was looking into pprof-rs but I keep getting compilation errors whenever I try to add it, wondering if you could get it to work somehow.

Also let me know if you are participating in hacktober because in that case I'll merge this in a few days :)

AntoniosBarotsis · 2022-09-29T15:23:38Z

Ok so after digging into it for a little bit, it seems that the library is currently not officially supported on windows.

Found this issue and this MR but it's not fully implemented yet, unfortunately.

ddanielsantos · 2022-09-29T16:07:07Z

I was thinking of adding a benchmark that just runs the code once for the downloads, that should be representative enough for me. (could you work on that?) I would be interested in getting a call heatmap for that.

Yeah, for sure, I'll dig more into the docs later and try to do this.

Also let me know if you are participating in hacktober because in that case I'll merge this in a few days :)

Yes, I am. Also, this project looks good btw, I'll try to contribute to it whenever possible.

AntoniosBarotsis · 2022-09-29T20:15:07Z

Another thing that might be useful is to be able to view download times per link (in case one specific website is disproportionately slow for some reason, it would be nice to know). Not sure how this would work with a proper benchmark lib (we could also just manually time the downloads) but just throwing ideas out there 👍

I basically want to have a good enough idea of what hinders performance before I decide whether I want async or not

ddanielsantos · 2022-09-30T14:57:37Z

I was searching about setting a lower number of samples for Criterion benchmarks (to bench the download_blog function) and discovered that the number of samples can't be < 10

here's the documentation about this

running the bench with 10 samples resulted in this:

Benchmarking download blogs/download blogs: Warming up for 3.0000 s
Warning: Unable to complete 10 samples in 5.0s. You may wish to increase target time to 204.2s.
download blogs/download blogs                                                                          
                        time:   [5.4904 s 7.1811 s 9.3531 s]
Found 1 outliers among 10 measurements (10.00%)
  1 (10.00%) high mild

what do you think?

AntoniosBarotsis · 2022-09-30T15:52:43Z

So the < 10 limit makes sense for Criterion since it's supposed to be a "statistics-driven" benchmarking lib but this works fine for me.

We could increase the measurement time with this function but I'm not sure if there's a reason to do that since that will always depend on a lot of external factors.

I think this is basically what I wanted! Could you push your work so I could take a look?

AntoniosBarotsis · 2022-09-30T15:56:14Z

Converting this to a draft so I don't click the big green "Merge" button accidentally

AntoniosBarotsis · 2022-09-30T19:32:14Z

Ok so, I made a per link benchmark and I found out that one of the feeds I was using is disproportionally slow which is interesting

But that's about it, everything else is great, and thanks a lot for contributing!! I'll merge in 1-2 days to make sure the MR counts.

AntoniosBarotsis · 2022-09-30T20:32:38Z

I fixed some linting warnings I was getting after the introduction of lib.rs, thanks again!

AntoniosBarotsis · 2022-09-30T20:37:34Z

From the hacktober website Your PR/MRs must be created between October 1 and October 31 (in any time zone, UTC-12 thru UTC+14). so just merging it after the 1st doesn't count. You might want to update your local branch (pull in the last few changes I made) and create another MR.

ddanielsantos · 2022-09-30T21:39:02Z

Oh no 😅, ok I'll do it. Thanks again man, if you find any other issue someday, feel free to contact me

AntoniosBarotsis · 2022-10-01T14:28:00Z

@ddanielsantos waiting for the MR again 👀

ddanielsantos · 2022-10-01T15:25:57Z

done 🫡

ddanielsantos and others added 4 commits September 29, 2022 10:48

feat(cargo): add criterion for benchmarking

d3a8873

refactor(util): move utilities to a library package

a30e671

feat(bench): add benchmark for map_to_html

8c0ab86

Split benchmarks by file

98e3868

AntoniosBarotsis marked this pull request as draft September 30, 2022 15:55

feat(bench): add benchmark for the download_blogs function

0f396af

AntoniosBarotsis added 4 commits September 30, 2022 21:33

Added per-feed benchmark

429564a

fmt

bd4efeb

Fixed unused method warning

1213766

Cleaned up some lints

aeebc59

AntoniosBarotsis assigned ddanielsantos Sep 30, 2022

AntoniosBarotsis added the hacktoberfest label Sep 30, 2022

ddanielsantos marked this pull request as ready for review September 30, 2022 21:52

ddanielsantos closed this Sep 30, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add performance investigation #12

feat: add performance investigation #12

ddanielsantos commented Sep 29, 2022

AntoniosBarotsis commented Sep 29, 2022 •

edited

Loading

AntoniosBarotsis commented Sep 29, 2022

ddanielsantos commented Sep 29, 2022 •

edited

Loading

AntoniosBarotsis commented Sep 29, 2022

ddanielsantos commented Sep 30, 2022

AntoniosBarotsis commented Sep 30, 2022

AntoniosBarotsis commented Sep 30, 2022

AntoniosBarotsis commented Sep 30, 2022

AntoniosBarotsis commented Sep 30, 2022

AntoniosBarotsis commented Sep 30, 2022 •

edited

Loading

ddanielsantos commented Sep 30, 2022

AntoniosBarotsis commented Oct 1, 2022 •

edited

Loading

ddanielsantos commented Oct 1, 2022

feat: add performance investigation #12

feat: add performance investigation #12

Conversation

ddanielsantos commented Sep 29, 2022

details

AntoniosBarotsis commented Sep 29, 2022 • edited Loading

AntoniosBarotsis commented Sep 29, 2022

ddanielsantos commented Sep 29, 2022 • edited Loading

AntoniosBarotsis commented Sep 29, 2022

ddanielsantos commented Sep 30, 2022

AntoniosBarotsis commented Sep 30, 2022

AntoniosBarotsis commented Sep 30, 2022

AntoniosBarotsis commented Sep 30, 2022

AntoniosBarotsis commented Sep 30, 2022

AntoniosBarotsis commented Sep 30, 2022 • edited Loading

ddanielsantos commented Sep 30, 2022

AntoniosBarotsis commented Oct 1, 2022 • edited Loading

ddanielsantos commented Oct 1, 2022

AntoniosBarotsis commented Sep 29, 2022 •

edited

Loading

ddanielsantos commented Sep 29, 2022 •

edited

Loading

AntoniosBarotsis commented Sep 30, 2022 •

edited

Loading

AntoniosBarotsis commented Oct 1, 2022 •

edited

Loading