-
Notifications
You must be signed in to change notification settings - Fork 3.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Replace usage of hadoop.fs.Path with String in Delta Lake #16256
Conversation
@ebyhr Could you run tests with secretes? |
/test-with-secrets sha=fe0df4c21f640800bb37143aba3cc6e24f43f686 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good overall, couple small things.
I'd also dd Delta
or Delta Lake
to the commit message somewhere.
@@ -724,13 +724,13 @@ public void createTable(ConnectorSession session, ConnectorTableMetadata tableMe | |||
} | |||
Path targetPath = new Path(location); | |||
ensurePathExists(session, targetPath); | |||
Path deltaLogDirectory = getTransactionLogDir(targetPath); | |||
String deltaLogDirectory = getTransactionLogDir(location); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we migrate targetPath
two lines up as well?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It's still used in code which will be removed in #16200. So I would wait for that PR to land and remove it during rebase.
plugin/trino-delta-lake/src/main/java/io/trino/plugin/deltalake/StringPathUtil.java
Outdated
Show resolved
Hide resolved
@@ -107,7 +107,7 @@ public void flush() | |||
} | |||
|
|||
String clusterId = commitInfoEntry.get().getCommitInfo().getClusterId(); | |||
logSynchronizer.write(session, clusterId, logEntry, bos.toByteArray()); | |||
logSynchronizer.write(session, clusterId, new Path(logEntry), bos.toByteArray()); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we migrate logSynchronizer as well? can be follow up, I know there are a lot of these
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I was planning to do that in separate commit as some implementation of LogSynchronizer
are strongly coupled with org.apache.hadoop.fs.Path
rebased to resolve conflicts |
plugin/trino-delta-lake/src/main/java/io/trino/plugin/deltalake/StringPathUtil.java
Outdated
Show resolved
Hide resolved
...rc/test/java/io/trino/plugin/deltalake/transactionlog/checkpoint/TestTransactionLogTail.java
Outdated
Show resolved
Hide resolved
plugin/trino-delta-lake/src/main/java/io/trino/plugin/deltalake/StringPathUtil.java
Outdated
Show resolved
Hide resolved
plugin/trino-delta-lake/src/main/java/io/trino/plugin/deltalake/DeltaLakeWriter.java
Outdated
Show resolved
Hide resolved
plugin/trino-delta-lake/src/main/java/io/trino/plugin/deltalake/StringPathUtil.java
Outdated
Show resolved
Hide resolved
Rebased to use
|
@@ -825,7 +824,7 @@ public DeltaLakeOutputTableHandle beginCreateTable(ConnectorSession session, Con | |||
Path targetPath = new Path(location); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Did you omit this occurence of org.apache.hadoop.fs.Path
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Those changes are in follow up PR: #16469
/test-with-secrets sha=1fb60c69ab92fdbc7487c74d8944b581bb12b9d4 |
...-delta-lake/src/main/java/io/trino/plugin/deltalake/transactionlog/TransactionLogAccess.java
Outdated
Show resolved
Hide resolved
...rc/test/java/io/trino/plugin/deltalake/transactionlog/checkpoint/TestTransactionLogTail.java
Outdated
Show resolved
Hide resolved
...rc/test/java/io/trino/plugin/deltalake/transactionlog/checkpoint/TestTransactionLogTail.java
Outdated
Show resolved
Hide resolved
The CI workflow run with tests that require additional secrets finished as failure: https://github.com/trinodb/trino/actions/runs/4425399001 |
/test-with-secrets sha=94a4e3e6bcb2eb8c5f031290930d0c75a66bcac1 |
The CI workflow run with tests that require additional secrets finished as failure: https://github.com/trinodb/trino/actions/runs/4435274449 |
...rc/test/java/io/trino/plugin/deltalake/transactionlog/checkpoint/TestTransactionLogTail.java
Outdated
Show resolved
Hide resolved
/test-with-secrets sha=94a4e3e6bcb2eb8c5f031290930d0c75a66bcac1 |
Rebased to include fix for skipped tests in CI |
Description
Origin of this change is to migrate to TrinoFileSystem
Relates to #16020
Some hadoop.fs.Path will be migrated in following PR as it needs some additional path resolution functions.
Release notes
(x) This is not user-visible or docs only and no release notes are required.