Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix table listing concurrency in FileHiveMetastore #20134

Conversation

findepi
Copy link
Member

@findepi findepi commented Dec 15, 2023

Before the change, FileHiveMetastore would rely on fileSystem.listFiles to find schemas or tables within a schema. listFiles is recursive and, on a local file system, fails when files/directories are being modified concurrently. This commit replaces recursive listFiles listing with non-recursive listDirectories leveraging the fact that listed entities are represented as directories.

Fixes only partially #20130

@cla-bot cla-bot bot added the cla-signed label Dec 15, 2023
@github-actions github-actions bot added tests:hive hive Hive connector labels Dec 15, 2023
@findepi findepi added the no-release-notes This pull request does not require release notes entry label Dec 15, 2023
@findepi findepi force-pushed the findepi/fix-table-listing-concurrency-in-filehivemetastore-c0459c branch 2 times, most recently from 4906d33 to 95487d8 Compare December 15, 2023 12:30
Before the change, `FileHiveMetastore` would rely on
`fileSystem.listFiles` to find schemas or tables within a schema.
`listFiles` is recursive and, on a local file system, fails when
files/directories are being modified concurrently. This commit replaces
recursive `listFiles` listing with non-recursive `listDirectories`
leveraging the fact that listed entities are represented as directories.
@findepi findepi force-pushed the findepi/fix-table-listing-concurrency-in-filehivemetastore-c0459c branch from 95487d8 to 5a10cbd Compare December 15, 2023 12:41
@findepi findepi merged commit b6eabd5 into trinodb:master Dec 15, 2023
@findepi findepi deleted the findepi/fix-table-listing-concurrency-in-filehivemetastore-c0459c branch December 15, 2023 15:49
@github-actions github-actions bot added this to the 436 milestone Dec 15, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cla-signed hive Hive connector no-release-notes This pull request does not require release notes entry
Development

Successfully merging this pull request may close these issues.

2 participants