-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cobrix with RDD as input #576
Comments
I'm not sure, Cobrix just implements data source interfaces. You can try and let me know. There is a longer path. Take a look at https://github.com/AbsaOSS/cobrix#reading-ascii-text-file |
Loading of CSV files this way is a Spark feature specifically for CSV. But it makes perfect sense to have a similar functionality in Cobrix. It could look like
I'm thinking supporting This feature could be a nice alternative to custom record extractors. Does it make sense for your use case? |
yes, this helps |
I was thinking... this feature is going to be available only from Scala (or in Python via JVM gateway). Still useful? |
we use only python api. if only scala case, we won't be able to use |
That's unfortunate. Still, this is a very nice feature to have. |
Sure and Thank you |
spark.read.csv(file or rdd) supports input of file or rdd as input, does cobrix also supports the same.
we are getting some additional rows. using rdd, we are able to remove extra rows. if rdd's can be used with cobrix, this eliminates the extra processing in single step.
would like to know whether cobrix supports using "rdds" as input.
The text was updated successfully, but these errors were encountered: