You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I think the intent is that it streams the output continuously to a file, but from what I can tell, this is actually not the case. I added a print statement into it to test how many times it loops and how many bytes readavailable(stream) returns, and in all cases it just returns the whole file in one go at the end of the download (even if the file is 1 GiB).
Unless the issue is on my end, this can be a problem when downloading large, multi-gigabyte files, as memory use unnecessarily blows up. Also, it actually looks like at some point there are multiple copies of the data in memory, but I haven't looked into it very closely (there is a memory spike where ~1GiB file needs several GiBs).
Looking at how
s3_get_file
is implementedAWSS3.jl/src/AWSS3.jl
Lines 176 to 191 in f3989bc
I think the intent is that it streams the output continuously to a file, but from what I can tell, this is actually not the case. I added a print statement into it to test how many times it loops and how many bytes
readavailable(stream)
returns, and in all cases it just returns the whole file in one go at the end of the download (even if the file is 1 GiB).Unless the issue is on my end, this can be a problem when downloading large, multi-gigabyte files, as memory use unnecessarily blows up. Also, it actually looks like at some point there are multiple copies of the data in memory, but I haven't looked into it very closely (there is a memory spike where ~1GiB file needs several GiBs).
Package versions
The text was updated successfully, but these errors were encountered: