Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Avoid repeated term serialization #156

Closed
jacoscaz opened this issue Feb 1, 2023 · 2 comments
Closed

Avoid repeated term serialization #156

jacoscaz opened this issue Feb 1, 2023 · 2 comments

Comments

@jacoscaz
Copy link
Collaborator

jacoscaz commented Feb 1, 2023

We can improve write performance levels by avoiding duplicate and unnecessary term serializations. Terms can simply be serialized once and their serialized versions can then be concatenated around based on each index's permutation of terms. Whether the ROI of doing this is high enough to justify the effort is TBD.

@jacoscaz jacoscaz converted this from a draft issue Feb 1, 2023
@jacoscaz jacoscaz changed the title **Avoid repeated term serialization** Avoid repeated term serialization Feb 1, 2023
@jacoscaz
Copy link
Collaborator Author

jacoscaz commented Feb 1, 2023

In-depth profiling on a private instance working with roughly 250k quads indicates that this might potentially accelerate ingestion by 10 - 15%.

@jacoscaz
Copy link
Collaborator Author

closed via 5a39a65

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant