-
Notifications
You must be signed in to change notification settings - Fork 915
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve performance of nvtext::edit_distance #13912
Improve performance of nvtext::edit_distance #13912
Conversation
Benchmark shows improvement of about 2x overall except for longer strings.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Missing docs, otherwise looks fine. I'll do a final review pass once docs are added.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM 👍
/merge |
Fixes a bug in `nvtext::edit_distance_matrix` where the internal offsets vector is initialized to 0. This error was introduced in #13912 The bug was found while working on a different PR which re-ordered the nvtext gtests execution causing device memory to be reused from the rmm pool in a different way. Authors: - David Wendt (https://github.com/davidwendt) Approvers: - Bradley Dice (https://github.com/bdice) - Mark Harris (https://github.com/harrism) - Nghia Truong (https://github.com/ttnghia) URL: #14283
Description
Improves performance of
nvtext::edit_distance
by reworking the algorithm with shorter working buffer and simpler logic.Checklist