-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PySPark write to OpenSearch #153
Comments
What is needed for that? Is it possible that the current implementation already works by using this jar in this fashion: https://www.bmc.com/blogs/write-apache-spark-elasticsearch-python/ |
I think it's the same as #76 |
Thanks @junhl. I closed the other one. @wbeckler that doesn't work in case of elasticsearch you use the jar to write pyspark dataframe like this: (dataFrame.write |
This functionality is not relate just to pySpark but also to Scala Spark apps that uses Spark structured streaming df.write.format... |
Can you test it with the current client on the main branch? |
Hi, can PySpark even be used to read/write from an Opensearch index? |
Yes. For example w/ OpenSearch 2.7 Docker command:
Build the snapshot (until it's formally released). And load the
Test it out w/
OpenSearch log output:
Closing this issue as complete. |
Has the 3.0.0 version been released yet? I still see 1.0.1 from May of last year. |
What's your motivation for opensearch-hadoop v3? Is it for compatibility with OpenSearch v3? The major versions should be cross compatible, so latest opensearch-hadoop should work with latest 2.x and 3.x of OpenSearch. |
This did not work @nknize , receiving following exceptions: `Traceback (most recent call last):
Caused by: java.lang.ClassNotFoundException: org.opensearch.spark.sql.DefaultSource |
With the Jar we can write Spark-Scala dataframe to OpenSearch can we add the necessary functionality to support PySpark write to OpenSearch?
@nknize can you please help?
The text was updated successfully, but these errors were encountered: