Skip to content

Commit

Permalink
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Update build hibench readme
Browse files Browse the repository at this point in the history
* Update 2.4 version to specify Spark Version.
* Add Specify Hadoop version documentation.
* Add Build using JDK 11 documentation.

Signed-off-by: Luis Ponce <[email protected]>
luisfponce committed Jun 24, 2019
1 parent c30ecfd commit 6455c21
Showing 1 changed file with 13 additions and 1 deletion.
14 changes: 13 additions & 1 deletion docs/build-hibench.md
Original file line number Diff line number Diff line change
@@ -28,7 +28,7 @@ Because some Maven plugins cannot support Scala version perfectly, there are som


### Specify Spark Version ###
To specify the spark version, use -Dspark=xxx(1.6, 2.0, 2.1 or 2.2). By default, it builds for spark 2.0
To specify the spark version, use -Dspark=xxx(1.6, 2.0, 2.1, 2.2 or 2.4). By default, it builds for spark 2.0

mvn -Psparkbench -Dspark=1.6 -Dscala=2.11 clean package
tips:
@@ -37,6 +37,11 @@ default . For example , if we want use spark2.0 and scala2.11 to build hibench.
package` , but for spark2.0 and scala2.10 , we need use the command `mvn -Dspark=2.0 -Dscala=2.10 clean package` .
Similarly , the spark1.6 is associated with the scala2.10 by default.

### Specify Hadoop Version ###
To specify the spark version, use -Dhadoop=xxx(3.2). By default, it builds for hadoop 2.4

mvn -Psparkbench -Dhadoop=3.2 -Dspark=2.4 -Dscala=2.12 clean package

### Build a single module ###
If you are only interested in a single workload in HiBench. You can build a single module. For example, the below command only builds the SQL workloads for Spark.

@@ -48,3 +53,10 @@ Supported modules includes: micro, ml(machine learning), sql, websearch, graph,
For Spark 2.0 and Spark 2.1, we add the benchmark support for Structured Streaming. This is a new module which cannot be compiled in Spark 1.6. And it won't get compiled by default even if you specify the spark version as 2.0 or 2.1. You must explicitly specify it like this:

mvn -Psparkbench -Dmodules -PstructuredStreaming clean package

### Build using JDK 1.11
If you are interested in building using Java 11 specify scala, spark and hadoop version as below

mvn -Psparkbench -Pflinkbench -Phadoopbench -Pstormbench -Dhadoop=3.2 -Dspark=2.4 -Dscala=2.12 clean package

Supported frameworks only: hadoopbench, sparkbench, flinkbench, stormbench

0 comments on commit 6455c21

Please sign in to comment.