-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Extending join fuzz tests to support join filtering #10728
Conversation
Thanks @edmondop I'm planning to read it through soon |
Hi @edmondop are you planning to proceed on this PR? |
Yes I was stuck because one test doesn't pass, will need to investigate unless you have some hints to share |
If you referring to CI test failed, it is
which should probably be straightforward to fix, we can check row counts first? |
Thanks for the tip, checking for the row count effectively show that the right_join filtered return a different number of rows when using SMJ and HashJoin. What could be the root cause? I have tried to modify the column used for the filter un-commenting the section commented below, but this breaks all the other test cases too fn less_than_10_join_filter(schema1: Arc<Schema>, _schema2: Arc<Schema>) -> JoinFilter {
let less_than_100 = Arc::new(BinaryExpr::new(
Arc::new(Column::new("a", 0)),
Operator::Lt,
Arc::new(Literal::new(ScalarValue::from(100))),
)) as _;
let column_indices = vec![
ColumnIndex {
index: 0,
side: JoinSide::Left,
},
// ColumnIndex {
// index: 0,
// side: JoinSide::Right,
// },
];
let intermediate_schema =
Schema::new(vec![schema1.field_with_name("a").unwrap().to_owned()]);
JoinFilter::new(less_than_100, column_indices, intermediate_schema)
} |
Thanks @edmondop, would you be able to fetch the exact example where HJ and SMJ mismatches? |
These are the two tests that fails
right join passes on my machine, I think I need to rebase, but also it seems easier to solve
|
@edmondop I think it is a solid start already, lets comment this tests for now and address them separately |
I took the liberty to ignore tests |
Right filtered join fuzz tests failures probably related to #10882 |
Fixed formatting @comphead all the checks are passing now |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm well done @edmondop
We plan to address anti and right filtered joins separately and then uncomment the test
FYI, CI is failing on main after merge. |
* Extending join fuzz tests to support join filtering --------- Co-authored-by: Oleks V <[email protected]>
Thanks @ozankabak and @edmondop |
* Extending join fuzz tests to support join filtering --------- Co-authored-by: Oleks V <[email protected]>
Which issue does this PR close?
Closes
#10659