-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Pattern acceleration #85
Conversation
…n, primary constructor for IndexStatementCondition explanatory comments for LatestCondition and EarliestCondition
…quals and notEquals tests
… class, refactor tests
…atementConditionTest
… testing for ConditionWalker bloom search terms
…t and apply spotless" This reverts commit 7dd5e33.
|
||
public BloomFilterTempTable(DSLContext ctx, Table<?> parentTable, long bloomTermId, Set<Token> tokenSet) { | ||
this.ctx = ctx; | ||
this.parentTable = parentTable; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
would be ideal if the primary constructor had no code
filterBAOS.close(); | ||
} | ||
catch (IOException e) { | ||
throw new UncheckedIOException(e); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
perhaps it would be worth it to add some extra context to this exception in addition to the original exception to help debugging in the future.
|
||
Set<Token> tokenSet = new PatternMatch(ctx, "test").tokenSet(); | ||
BloomFilterTempTable tempTable = new BloomFilterTempTable(ctx, table, 0L, tokenSet); | ||
Assertions.assertThrows(RuntimeException.class, tempTable::generateCondition); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
check that RuntimeException.getMessage() is of expected value?
} | ||
|
||
@Test | ||
void singleTalbePatternMatchTest() { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
typo
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
please rebase this on the latest main branch, it has commits that do not belong here. i.e. license changes
Moving to another branch |
Pattern acceleration feature to activate bloom filtering on set regex pattern. Goal is to limit bloom filtering to certain patterns like UUID.
Flow:
Notes: Tokenizer max token count is set to 0 to get only major tokens since that is what dpf_03 currently uses to tokenize the bloom filter tables