-
Notifications
You must be signed in to change notification settings - Fork 3.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Test Iceberg cost-based plans on TPC-H and TPC-DS #14489
Conversation
763aed1
to
6949624
Compare
6949624
to
5928b3f
Compare
rebased after #14497 merged |
Migrate convenience methods from HiveMinioDataLake to Minio itself, so that it's viable to use MinIO alone without Hadoop container.
cf24555
to
0d81a3c
Compare
0d81a3c
to
53bbeac
Compare
For completeness, I've pulled TPC-H dataset metadata as well. |
@@ -142,13 +151,20 @@ protected void generate() | |||
getQueryPlanResourcePath(queryResourcePath)); | |||
createParentDirs(queryPlanWritePath.toFile()); | |||
write(generateQueryPlan(readQuery(queryResourcePath)).getBytes(UTF_8), queryPlanWritePath.toFile()); | |||
System.out.println("Generated expected plan for query: " + queryResourcePath); | |||
log.info("Generated expected plan for query: %s", queryResourcePath); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: could be separate commit :P
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done
@@ -16,8 +16,8 @@ | |||
import java.util.stream.Stream; | |||
|
|||
/** | |||
* This class tests cost-based optimization rules related to joins. It contains unmodified TPCH queries. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: comment improvements in existing code could be separate commit
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done
System.out goes to logs anyway, just the context is stripped.
This uses TPC-H sf1000 and TPC-DS sf1000 Iceberg ORC data sets' metadata files generated on Starburst's benchmark infrastructure. The tables have no history and were created using single CTAS.
CI #14519 |
53bbeac
to
ede2e27
Compare
No description provided.