-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
refactor: Simplify how list splits are tracked. #5920
Conversation
Currently the code keeps tracks of the split startUids and updates them as the posting lists are split or empty parts are removed. Instead, update the list of startUids at the end of each step based on the contents of the map of startUids to posting list. This removes the need of keeping track of the same piece of information in two different ways. Also, fix a bug where the empty parts were removed from the list of splits but not from the map. This caused empty posting lists to be written to disk although it didn't affect normal execution because those parts were not accessible from the main posting list.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewed 2 of 2 files at r1.
Reviewable status:complete! all files reviewed, all discussions resolved (waiting on @manishrjain)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewable status:
complete! all files reviewed, all discussions resolved (waiting on @manishrjain)
Currently the code keeps tracks of the split startUids and updates them as the posting lists are split or empty parts are removed. Instead, update the list of startUids at the end of each step based on the contents of the map of startUids to posting list. This removes the need of keeping track of the same piece of information in two different ways. Also, fix a bug where the empty parts were removed from the list of splits but not from the map. This caused empty posting lists to be written to disk although it didn't affect normal execution because those parts were not accessible from the main posting list. (cherry picked from commit c51d007)
Currently the code keeps tracks of the split startUids and updates them as the posting lists are split or empty parts are removed. Instead, update the list of startUids at the end of each step based on the contents of the map of startUids to posting list. This removes the need of keeping track of the same piece of information in two different ways. Also, fix a bug where the empty parts were removed from the list of splits but not from the map. This caused empty posting lists to be written to disk although it didn't affect normal execution because those parts were not accessible from the main posting list. (cherry picked from commit c51d007)
Currently the code keeps tracks of the split startUids and updates them as the posting lists are split or empty parts are removed. Instead, update the list of startUids at the end of each step based on the contents of the map of startUids to posting list. This removes the need of keeping track of the same piece of information in two different ways. Also, fix a bug where the empty parts were removed from the list of splits but not from the map. This caused empty posting lists to be written to disk although it didn't affect normal execution because those parts were not accessible from the main posting list. (cherry picked from commit c51d007)
Currently the code keeps tracks of the split startUids and updates them as the posting lists are split or empty parts are removed. Instead, update the list of startUids at the end of each step based on the contents of the map of startUids to posting list. This removes the need of keeping track of the same piece of information in two different ways. Also, fix a bug where the empty parts were removed from the list of splits but not from the map. This caused empty posting lists to be written to disk although it didn't affect normal execution because those parts were not accessible from the main posting list. (cherry picked from commit c51d007)
Currently the code keeps tracks of the split startUids and updates them
as the posting lists are split or empty parts are removed. Instead,
update the list of startUids at the end of each step based on the
contents of the map of startUids to posting list. This removes the need
of keeping track of the same piece of information in two different ways.
Also, fix a bug where the empty parts were removed from the list of
splits but not from the map. This caused empty posting lists to be
written to disk although it didn't affect normal execution because those
parts were not accessible from the main posting list.
This change is
Docs Preview: