Skip to content

montara-io/lucia-spark-listener

 
 

Repository files navigation

Lucia Spark Listenr

The New & Improved Spark UI and Spark History Server by Montara

Lucia spark listener is collecting metrics from Spark Jobs and send them to Lucia Backend Infrastructure

Usage

  1. Before running your Spark job with Lucia listener, please setup Lucia on your environment (Local/Cloud) for more information

  2. After your Lucia environment is runinng, you can start use Lucia listener

Fields:

  1. endpoint

    • Cloud - tbd

    • Localenv - When you are running Lucia on your local env and running your are runing Spark job from cloud, you should use Ngrok:

      ngrok http 8181
      
      

      and set the endpoint Ngrok forwarding

    • Localenv - When you are running Lucia on your local env and running your Spark job from local, you should set the enpoint to http://localhost:8181

  2. pipeline_id - tbd

  3. pipeline_run_id - tbd

Example:


spark-submit \
  --packages io.github.montara-io:sparklistener_2.12:1.0.3 \
  --repositories https://s01.oss.sonatype.org/content/repositories/snapshots \
  --conf spark.lucia.sparklistener.url=<endpoint>/events \
  --conf spark.lucia.sparklistener.pipelineId=<pipeline_id> \
  --conf spark.lucia.sparklistener.pipelineRunId=<pipeline_run_id> \
  --conf spark.extraListeners=io.montara.lucia.sparklistener.LuciaSparkListener \
  <spark_job>

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages

  • Scala 100.0%