-
Notifications
You must be signed in to change notification settings - Fork 370
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
What is the best way to write large DataFrames efficiently and with high performance in Julia while minimizing memory usage? #3406
Comments
Writing a data frame is not a part of DataFrames.jl functionality. I would assume that serializing it using the |
Which data format is the most efficient in Julia? The .arrow format consumes a lot of memory. For example, in R, the .fst format is considered the best for efficient memory usage and high performance. Similarly, in Julia, which format is optimal for writing DataFrames with high performance and memory efficiency? |
There is no single format that is best in all aspects, so "the best" depends on many factors.
(and even here you see that your question ends up a 5-criteria problem) Also these are reported when run on a laptop using 1 thread. Benchmarks might be different when wanting best performance on a mulit-core server scenario. Your question is essentially open ended and unrelated with DataFrames.jl (it is a general Julia question). Such questions are welcome, but it is best to discuss them in an open-ended forum, as you might get the best advice there (as me or other DataFrames.jl maintainers might not be aware of all the options). I recommend you to post it on https://discourse.julialang.org/. |
Thank you very much for this great information. |
No description provided.
The text was updated successfully, but these errors were encountered: