You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fileset is a new concept brought in 0.5.0 to manage the non-tabular data, the current implementation uses HCFS to manage the physical data. With HCFS, the Hadoop catalog should support different underlying storage, but currently we only verified local file system and HDFS.
In this issue, we should also support S3, to make the fileset hadoop catalog work with S3 object store.
Motivation
The reason to support S3 is that it is vastly used on the public cloud, we should add this support anyway.
Describe the solution
No response
Additional context
No response
The text was updated successfully, but these errors were encountered:
… catalog (apache#4232)
### What changes were proposed in this pull request?
Add S3 support for Fileset Hadoop catalog. We only add hadoop-aws
dependency actually, most of the work is conducting tests.
### Why are the changes needed?
Fix: apache#3379
### Does this PR introduce _any_ user-facing change?
No.
### How was this patch tested?
IT.
---------
Co-authored-by: zhanghan18 <[email protected]>
Co-authored-by: yuqi <[email protected]>
Describe the feature
Fileset is a new concept brought in 0.5.0 to manage the non-tabular data, the current implementation uses HCFS to manage the physical data. With HCFS, the Hadoop catalog should support different underlying storage, but currently we only verified local file system and HDFS.
In this issue, we should also support S3, to make the fileset hadoop catalog work with S3 object store.
Motivation
The reason to support S3 is that it is vastly used on the public cloud, we should add this support anyway.
Describe the solution
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: