-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[WISH] Choosing columns in LOAD-CSV #167
Comments
I don't see how it would be faster. You should still load the whole row, then you remove unused columns from it. Same computational complexity as loading all of the rows and then removing unused columns from all of them, only with higher peak RAM usage. In fact such per-row filtering would even be slower if one uses Besides, I don't think we should bake into If this is a so common thing you do, why not simply wrap the
Ultimately we want the codecs to be incremental, so you would also be able to filter out data as it appears, and that would also eliminate the issue of parsing multiple resulting formats that a decoder can produce or an encoder can accept. This also ties to the idea of having a table! datatype, where row/column operations would be a given. |
You are right, somehow, I thought |
I usually work with relatively big CSV files with lots of columns (over 1000) exported from other systems and then I process them with Red.
Even though it is not difficult to add an intermediate step to delete unwanted columns from a CSV file, it would be nice to have a refinement to choose which columns will be loaded. This way loading would also be faster for big files.
The text was updated successfully, but these errors were encountered: