-
Notifications
You must be signed in to change notification settings - Fork 487
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refactored dataloader #1040
Refactored dataloader #1040
Conversation
Model Benchmark
|
Codecov Report
@@ Coverage Diff @@
## main #1040 +/- ##
=======================================
Coverage 90.26% 90.27%
=======================================
Files 21 21
Lines 4736 4740 +4
=======================================
+ Hits 4275 4279 +4
Misses 461 461
Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great improvement to have it once and not with every iteration 👍
🔬 Background
__getitem__
function to retrieve a sample from the dataset. Since for a dataset with 10.000 samples that is trained for 100 epochs, this function is called 1 Mio. times, pre-computing the samples should bring a speed-up of the model training.🔮 Key changes
📋 Review Checklist
Please make sure to follow our best practices in the Contributing guidelines.