Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add support for Fabric OneLake, it is already supported by Delta_rs #38726

Open
djouallah opened this issue Nov 15, 2023 · 1 comment
Open

add support for Fabric OneLake, it is already supported by Delta_rs #38726

djouallah opened this issue Nov 15, 2023 · 1 comment

Comments

@djouallah
Copy link

Describe the enhancement requested

Delta_rs added support for Fabric OneLake recently, it will be nice to add the support for pyarrow dataset to read parquet and csv etc from OneLake Fabric

delta-io/delta-rs#1642

currently I am using this code to read from dataset and save ad delta table but it works only with local path

import pyarrow.dataset as ds
from deltalake.writer import write_deltalake
aadToken = mssparkutils.credentials.getToken('storage')
storage_options={"bearer_token": aadToken, "use_fabric_endpoint": "true"}
sf=100
rowgroup           = 2000000
nbr_rowgroup_File  = 8 * rowgroup
for tbl in ['lineitem','nation','region','customer','supplier','orders','part','partsupp'] :
     print(tbl)
     dataset = ds.dataset(f'/lakehouse/default/Files/{sf}/{tbl}',format="parquet")
     write_deltalake(f"abfss://[email protected]/test.Lakehouse/Tables/{tbl}"\
     ,dataset\
     ,mode='overwrite',overwrite_schema=True,max_rows_per_file =nbr_rowgroup_File,min_rows_per_group=rowgroup,max_rows_per_group=rowgroup\
     ,storage_options=storage_options)

Component(s)

Format, Integration, Python

@AlenkaF
Copy link
Member

AlenkaF commented Dec 4, 2023

As the support for Azure Blob Storage is being in progress in Apache Arrow C++ (https://github.com/search?q=repo%3Aapache%2Farrow+%5BC%2B%2B%5D%5BFS%5D%5BAzure%5D&type=issues) and it will be available in Python as a follow-up also, would that be something we can use to read from OneLake Fabric?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants