-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow specifying options in load/write functions #108
Comments
Relevant point. Defining the Map in the VTL script seems to be impossible. I imagine two ways to enable custom configuration:
I suggest we discuss this during our future call. |
Hi @jornfranke, We discussed the possibilities with @hadrienk:
And instantiate with At this point, parameters are only positional in VTL, which would give an ugly syntax if we want to expose many options as parameters (see this issue posted on the VTL TF repo).
What do you think? |
Thanks for the feedback. We did internally some workaround and will look into the second option you propose. The first option would probably makes sense once Trevas supports it. |
Currently, one can load/write data only with specific default options without being able to customize them.
For instancce, in loadCsv or writeCsv I cannot specify a different separator from ";". There are many other options relevant for reading/writing CSV files.
loadParquet/writeParquet does not allow to specify options, such as compression (cf. here).
I propose to have additional functions, e.g. loadCsvWithParameters that takes as input the path and a Map<String,String> which allows to specify any options. I am not exactly sure how one can pass a Map<String,String> in VTL initialized with data. Alternatively one can provide simply a "config" Dataset.class which contains in a dataset with two columns (key,value).
The text was updated successfully, but these errors were encountered: