-
Notifications
You must be signed in to change notification settings - Fork 2.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Large file downloads hang indefinitely #28713
Comments
Thank you for the feedback @YuvalItzchakov . We will investigate and get back to you asap. |
Hi @YuvalItzchakov Yuval, I believe you are running into a couple issues related to timeouts which are causing this behavior. Firstly, you are setting the To configure client-side timeouts, see this section of our README. For reads, you would likely be interested in The other issue here is that before version NOTE: In |
Thank you for the elaborate explanation @jalauzon-msft. Will try it and report back. |
Hi @jalauzon-msft. Updating the SDK version has indeed mitigated the issue using A follow up question: Any reason why we're seeing these chunk downloads fail particularly on large files? Could there be something problematic at the blob storage API level? |
Hi @YuvalItzchakov Yuval, because the adjustments to If you have not changed the default settings for chunk sizes, the SDK will download blob in 4 MiB chunks and the new default Given that information and the error you linked in the original post There are a number of things that could cause this, but something you could try is lowering your If you still think the network on your end is okay, you can try opening a Support ticket to have the backend team take a look. They will have more resources to determine the cause but could come to the same conclusion I have. |
Hi @YuvalItzchakov. Thank you for opening this issue and giving us the opportunity to assist. We believe that this has been addressed. If you feel that further discussion is needed, please add a comment with the text “ |
Hi @YuvalItzchakov, since you haven’t asked that we “ |
Package Name:
azure-core 1.25.1
azure-storage-blob 12.8.1
azure-storage-common 2.1.0
azure-storage-file 2.1.0
Operating System
Python Version:
Python 3.7.13 (Server) / Python 3.8.13 (Local)
Describe the bug
While trying to download large files >= 17GB, the Python SDK and the Azure CLI respectively get stuck before finishing the download, usually around 99.(3/5)%. This is due to some of the chunks being downloaded hanging and never timing out / retrying.
While trying to download a file with the following parameters, using
BlobClient.download_blob
:The chunks that are stuck, yield the following stack trace (this is from the live stuck process):
Locally:
Remote (server):
And the following error message is printed out (this is from the Azure CLI):
Using the Azure CLI this reproduces every time when trying to fetch a large file. When using the Python SDK directly it will sometimes fail and sometimes succeed.
To Reproduce
Steps to reproduce the behavior (Azure CLI):
az storage blob download
passing the above parameters to the commandSteps to reproduce the behavior (Python SDK directly):
Expected behavior
Either the chunk that is hanging should fail on timeout and retry, or if it exhausted all retries it should fail the download. Otherwise, the download should complete.
Additional Context
Related bugs:
The text was updated successfully, but these errors were encountered: