-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ENH: speed up vak.csv.has_unlabeled
#243
Comments
vak.csv.has_unlabeled
Looking at this again.
I should test to see if this really improves anything but my guess is it will
|
It does look like we can cut the time about in half by simply using Timing is not affected by calling a "helper" function that runs logic on a single So:
|
Thinking about this more Edge cases:
|
Rewrote new version that we use duration and try to catch edge cases in comment above. Will stop obsessing and just make this minor change to close the issue |
Still obsessing. After thinking more:
|
Refactor "has unlabeled", fixes #243
vak.csv.has_unlabeled goes over all files and looks for 'unlabeled'
The function can run faster if it just returns
True
the first time it finds a file with unlabeledThe text was updated successfully, but these errors were encountered: