Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP][test-hadoop3.2] Test Hadoop 3.2 on jenkins #24044

Closed
wants to merge 56 commits into from
Closed
Show file tree
Hide file tree
Changes from 13 commits
Commits
Show all changes
56 commits
Select commit Hold shift + click to select a range
0644e94
Only hadoop-3.1 upgrades built-in Hive to 2.3.4
wangyum Feb 14, 2019
3c0c72e
Fix test failed:
wangyum Feb 14, 2019
fc10762
Fix checkstyle error:
wangyum Feb 15, 2019
bb84f79
Upgrade hadoop to 3.2.0 fix test error
wangyum Feb 15, 2019
32ebcad
Fix Hadoop-2.7.x test error
wangyum Feb 16, 2019
a9f4344
Add new hive-thriftserver to support Hadoop-3.1
wangyum Feb 17, 2019
feb1ee7
Fix CliSuite test error
wangyum Feb 18, 2019
e734373
Clean some dependency
wangyum Feb 19, 2019
e39cc2d
Fix mvn install error
wangyum Feb 25, 2019
5fbfdaa
Release Kryo
wangyum Feb 28, 2019
bf47619
Merge remote-tracking branch 'upstream/master' into hadoop-3.2
wangyum Mar 10, 2019
71421d6
Merge master branch and try to test Hadoop-3.1 on jenkins
wangyum Mar 10, 2019
19d52e7
Use hadoop-3.1 checkstyle
wangyum Mar 11, 2019
3b2e454
Error testing can be passed on my machine.
wangyum Mar 12, 2019
4c1ec25
Workaround HADOOP-16180
wangyum Mar 12, 2019
d38c542
Rename hive-thriftserver2 to hive-thriftserverV2
wangyum Mar 13, 2019
d20a215
Merge remote-tracking branch 'upstream/master' into hadoop-3.1-on-jen…
wangyum Mar 19, 2019
3628057
Fix error
wangyum Mar 20, 2019
b7b5853
Merge remote-tracking branch 'upstream/master' into hadoop-3.1-on-jen…
wangyum Mar 22, 2019
94fd638
Verify SPARK-27177
wangyum Mar 22, 2019
3f11dbe
Make change smaller
wangyum Mar 23, 2019
0352585
Merge branch 'master' into hadoop-3.1-on-jenkins
wangyum Mar 29, 2019
d41b2cc
Make hive-llap-client scope to test.
wangyum Apr 5, 2019
de55cfd
update depts
wangyum Apr 5, 2019
b5813ec
This commit to test do we really need org.apache.logging.log4j:log4j-…
wangyum Apr 7, 2019
2898cb9
Merge remote-tracking branch 'upstream/master' into hadoop-3.1-on-jen…
wangyum Apr 7, 2019
1762f5c
improve test
wangyum Apr 7, 2019
7206a0b
Update pom.xml based on review
wangyum Apr 7, 2019
f4c4d48
Update depts
wangyum Apr 7, 2019
fea81ac
hive.parquet.version -> 1.8.1
wangyum Apr 8, 2019
fc7888d
isHive2 -> isHive23
wangyum Apr 8, 2019
5e5ffe6
Merge remote-tracking branch 'upstream/master' into hadoop-3.1-on-jen…
wangyum Apr 8, 2019
3357d42
Workaround `[NOT FOUND ] commons-httpclient#commons-httpclient;3.0.1…
wangyum Apr 8, 2019
f15f6d3
Remove hive-thriftserverV2
wangyum Apr 9, 2019
0c3f962
Fix hive-thriftserver with Hive 2.3.4 test issue.
wangyum Apr 10, 2019
9905ee2
Upgrade Hive Parquet to 1.10.1:
wangyum Apr 10, 2019
fecda5f
Remove duplicate code
wangyum Apr 12, 2019
5f2faf6
Add duplicate code
wangyum Apr 13, 2019
fdaf4bc
shade parquet
wangyum Apr 13, 2019
7d7848d
Merge remote-tracking branch 'upstream/master' into hadoop-3.1-on-jen…
wangyum Apr 22, 2019
2a06356
Revert shade parquet
wangyum Apr 22, 2019
8303bba
Fix ERROR XSLAN:
wangyum Apr 22, 2019
b1e025f
This commit to test revert the exclusion of commons-httpclient
wangyum Apr 22, 2019
aa09632
This commit to test revert the exclusion of commons-httpclient:
wangyum Apr 22, 2019
fffd2b7
This commit to test add commons-httpclient:commons-httpclient:3.1 to …
wangyum Apr 22, 2019
5b7b704
This commit to test commons-httpclient with hadoop-2.7
wangyum Apr 23, 2019
12e06d7
Revert httpclient to 3.0.1 to verify download it on our jenkins
wangyum Apr 23, 2019
54502d9
Revert httpclient to 3.0.1 to verify download it on our jenkins
wangyum Apr 24, 2019
c95cfbe
Merge remote-tracking branch 'upstream/master' into hadoop-3.1-on-jen…
wangyum Apr 24, 2019
09d749d
Revert httpclient related changes
wangyum Apr 24, 2019
0e148c5
Update deps
wangyum Apr 24, 2019
bdc1d2c
Add derby to SharedClass
wangyum Apr 26, 2019
26f7e89
Test Hive 2.3.5-SNAPSHOT
wangyum Apr 27, 2019
1158349
revert hive to 2.3.4
wangyum Apr 30, 2019
f1e550c
Verify Hive 2.3.5-SNAPSHOT (HIVE-21680 Backport HIVE-17644 to branch-…
wangyum May 7, 2019
c28fc2d
Upgrade Hive to 2.3.5
wangyum May 17, 2019
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -83,6 +83,7 @@ TempStatsStore/
metastore/
metastore_db/
sql/hive-thriftserver/test_warehouses
sql/hive-thriftserver2/test_warehouses
warehouse/
spark-warehouse/

Expand Down
10 changes: 10 additions & 0 deletions assembly/pom.xml
Original file line number Diff line number Diff line change
Expand Up @@ -178,6 +178,16 @@
</dependency>
</dependencies>
</profile>
<profile>
<id>hive-thriftserver2</id>
<dependencies>
<dependency>
<groupId>org.apache.spark</groupId>
<artifactId>spark-hive-thriftserver2_${scala.binary.version}</artifactId>
<version>${project.version}</version>
</dependency>
</dependencies>
</profile>
<profile>
<id>spark-ganglia-lgpl</id>
<dependencies>
Expand Down
62 changes: 38 additions & 24 deletions dev/deps/spark-deps-hadoop-3.1
Original file line number Diff line number Diff line change
Expand Up @@ -15,6 +15,7 @@ arpack_combined_all-0.1.jar
arrow-format-0.12.0.jar
arrow-memory-0.12.0.jar
arrow-vector-0.12.0.jar
audience-annotations-0.5.0.jar
automaton-1.11-8.jar
avro-1.8.2.jar
avro-ipc-1.8.2.jar
Expand All @@ -29,7 +30,7 @@ commons-cli-1.2.jar
commons-codec-1.10.jar
commons-collections-3.2.2.jar
commons-compiler-3.0.11.jar
commons-compress-1.8.1.jar
commons-compress-1.9.jar
commons-configuration2-2.1.1.jar
commons-crypto-1.0.0.jar
commons-daemon-1.0.13.jar
Expand All @@ -42,13 +43,14 @@ commons-logging-1.1.3.jar
commons-math3-3.4.1.jar
commons-net-3.1.jar
commons-pool-1.5.4.jar
commons-text-1.6.jar
compress-lzf-1.0.3.jar
core-1.1.2.jar
curator-client-2.12.0.jar
curator-framework-2.12.0.jar
curator-recipes-2.12.0.jar
curator-client-2.13.0.jar
curator-framework-2.13.0.jar
curator-recipes-2.13.0.jar
datanucleus-api-jdo-3.2.6.jar
datanucleus-core-3.2.10.jar
datanucleus-core-4.1.17.jar
datanucleus-rdbms-3.2.9.jar
derby-10.12.1.1.jar
dnsjava-2.1.7.jar
Expand All @@ -60,20 +62,30 @@ gson-2.2.4.jar
guava-14.0.1.jar
guice-4.0.jar
guice-servlet-4.0.jar
hadoop-annotations-3.1.0.jar
hadoop-auth-3.1.0.jar
hadoop-client-3.1.0.jar
hadoop-common-3.1.0.jar
hadoop-hdfs-client-3.1.0.jar
hadoop-mapreduce-client-common-3.1.0.jar
hadoop-mapreduce-client-core-3.1.0.jar
hadoop-mapreduce-client-jobclient-3.1.0.jar
hadoop-yarn-api-3.1.0.jar
hadoop-yarn-client-3.1.0.jar
hadoop-yarn-common-3.1.0.jar
hadoop-yarn-registry-3.1.0.jar
hadoop-yarn-server-common-3.1.0.jar
hadoop-yarn-server-web-proxy-3.1.0.jar
hadoop-annotations-3.2.0.jar
hadoop-auth-3.2.0.jar
hadoop-client-3.2.0.jar
hadoop-common-3.2.0.jar
hadoop-hdfs-client-3.2.0.jar
hadoop-mapreduce-client-common-3.2.0.jar
hadoop-mapreduce-client-core-3.2.0.jar
hadoop-mapreduce-client-jobclient-3.2.0.jar
hadoop-yarn-api-3.2.0.jar
hadoop-yarn-client-3.2.0.jar
hadoop-yarn-common-3.2.0.jar
hadoop-yarn-registry-3.2.0.jar
hadoop-yarn-server-common-3.2.0.jar
hadoop-yarn-server-web-proxy-3.2.0.jar
hive-common-2.3.4.jar
hive-llap-client-2.3.4.jar
hive-llap-common-2.3.4.jar
hive-serde-2.3.4.jar
hive-service-rpc-2.3.4.jar
hive-shims-0.23-2.3.4.jar
hive-shims-2.3.4.jar
hive-shims-common-2.3.4.jar
hive-shims-scheduler-2.3.4.jar
hive-storage-api-2.6.0.jar
hk2-api-2.4.0-b34.jar
hk2-locator-2.4.0-b34.jar
hk2-utils-2.4.0-b34.jar
Expand All @@ -88,8 +100,8 @@ jackson-core-2.9.8.jar
jackson-core-asl-1.9.13.jar
jackson-databind-2.9.8.jar
jackson-dataformat-yaml-2.9.8.jar
jackson-jaxrs-base-2.7.8.jar
jackson-jaxrs-json-provider-2.7.8.jar
jackson-jaxrs-base-2.9.5.jar
jackson-jaxrs-json-provider-2.9.5.jar
jackson-mapper-asl-1.9.13.jar
jackson-module-jaxb-annotations-2.9.8.jar
jackson-module-paranamer-2.9.8.jar
Expand All @@ -116,12 +128,13 @@ jersey-container-servlet-core-2.22.2.jar
jersey-guava-2.22.2.jar
jersey-media-jaxb-2.22.2.jar
jersey-server-2.22.2.jar
jetty-webapp-9.4.12.v20180830.jar
jetty-xml-9.4.12.v20180830.jar
jetty-webapp-9.3.24.v20180605.jar
jetty-xml-9.3.24.v20180605.jar
jline-2.14.6.jar
joda-time-2.9.3.jar
jodd-core-3.5.2.jar
jpam-1.1.jar
json-1.8.jar
json-smart-2.3.jar
json4s-ast_2.12-3.5.3.jar
json4s-core_2.12-3.5.3.jar
Expand Down Expand Up @@ -173,6 +186,7 @@ okhttp-3.8.1.jar
okio-1.13.0.jar
opencsv-2.3.jar
orc-core-1.5.4-nohive.jar
orc-core-1.5.4.jar
orc-mapreduce-1.5.4-nohive.jar
orc-shims-1.5.4.jar
oro-2.0.8.jar
Expand Down Expand Up @@ -214,5 +228,5 @@ woodstox-core-5.0.3.jar
xbean-asm7-shaded-4.12.jar
xz-1.5.jar
zjsonpatch-0.3.0.jar
zookeeper-3.4.9.jar
zookeeper-3.4.13.jar
zstd-jni-1.3.2-2.jar
2 changes: 1 addition & 1 deletion dev/lint-java
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@
SCRIPT_DIR="$( cd "$( dirname "$0" )" && pwd )"
SPARK_ROOT_DIR="$(dirname $SCRIPT_DIR)"

ERRORS=$($SCRIPT_DIR/../build/mvn -Pkinesis-asl -Pmesos -Pkubernetes -Pyarn -Phive -Phive-thriftserver checkstyle:check | grep ERROR)
ERRORS=$($SCRIPT_DIR/../build/mvn -Pkinesis-asl -Pmesos -Pkubernetes -Pyarn -Phive -Phive-thriftserver2 checkstyle:check | grep ERROR)

if test ! -z "$ERRORS"; then
echo -e "Checkstyle checks failed at following occurrences:\n$ERRORS"
Expand Down
2 changes: 1 addition & 1 deletion dev/mima
Original file line number Diff line number Diff line change
Expand Up @@ -24,7 +24,7 @@ set -e
FWDIR="$(cd "`dirname "$0"`"/..; pwd)"
cd "$FWDIR"

SPARK_PROFILES="-Pmesos -Pkubernetes -Pyarn -Pspark-ganglia-lgpl -Pkinesis-asl -Phive-thriftserver -Phive"
SPARK_PROFILES="-Pmesos -Pkubernetes -Pyarn -Pspark-ganglia-lgpl -Pkinesis-asl -Phive-thriftserver2 -Phive"
TOOLS_CLASSPATH="$(build/sbt -DcopyDependencies=false "export tools/fullClasspath" | tail -n1)"
OLD_DEPS_CLASSPATH="$(build/sbt -DcopyDependencies=false $SPARK_PROFILES "export oldDeps/fullClasspath" | tail -n1)"

Expand Down
2 changes: 2 additions & 0 deletions dev/run-tests-jenkins.py
Original file line number Diff line number Diff line change
Expand Up @@ -173,6 +173,8 @@ def main():
os.environ["AMPLAB_JENKINS_BUILD_PROFILE"] = "hadoop2.6"
if "test-hadoop2.7" in ghprb_pull_title:
os.environ["AMPLAB_JENKINS_BUILD_PROFILE"] = "hadoop2.7"
if "test-hadoop3.1" in ghprb_pull_title:
os.environ["AMPLAB_JENKINS_BUILD_PROFILE"] = "hadoop3.1"

build_display_name = os.environ["BUILD_DISPLAY_NAME"]
build_url = os.environ["BUILD_URL"]
Expand Down
3 changes: 2 additions & 1 deletion dev/run-tests.py
Original file line number Diff line number Diff line change
Expand Up @@ -112,7 +112,7 @@ def determine_modules_to_test(changed_modules):
['graphx', 'examples']
>>> x = [x.name for x in determine_modules_to_test([modules.sql])]
>>> x # doctest: +NORMALIZE_WHITESPACE
['sql', 'avro', 'hive', 'mllib', 'sql-kafka-0-10', 'examples', 'hive-thriftserver',
['sql', 'avro', 'hive', 'mllib', 'sql-kafka-0-10', 'examples', 'hive-thriftserver2',
'pyspark-sql', 'repl', 'sparkr', 'pyspark-mllib', 'pyspark-ml']
"""
modules_to_test = set()
Expand Down Expand Up @@ -274,6 +274,7 @@ def get_hadoop_profiles(hadoop_version):

sbt_maven_hadoop_profiles = {
"hadoop2.7": ["-Phadoop-2.7"],
"hadoop3.1": ["-Phadoop-3.1"],
}

if hadoop_version in sbt_maven_hadoop_profiles:
Expand Down
3 changes: 2 additions & 1 deletion dev/sbt-checkstyle
Original file line number Diff line number Diff line change
Expand Up @@ -26,7 +26,8 @@ ERRORS=$(echo -e "q\n" \
-Pkubernetes \
-Pyarn \
-Phive \
-Phive-thriftserver \
-Phadoop-3.1 \
wangyum marked this conversation as resolved.
Show resolved Hide resolved
-Phive-thriftserver2 \
checkstyle test:checkstyle \
| awk '{if($1~/error/)print}' \
)
Expand Down
2 changes: 1 addition & 1 deletion dev/scalastyle
Original file line number Diff line number Diff line change
Expand Up @@ -26,7 +26,7 @@ ERRORS=$(echo -e "q\n" \
-Pkubernetes \
-Pyarn \
-Phive \
-Phive-thriftserver \
-Phive-thriftserver2 \
-Pspark-ganglia-lgpl \
-Pdocker-integration-tests \
-Pkubernetes-integration-tests \
Expand Down
11 changes: 5 additions & 6 deletions dev/sparktestsupport/modules.py
Original file line number Diff line number Diff line change
Expand Up @@ -154,19 +154,18 @@ def __hash__(self):
],
)


hive_thriftserver = Module(
name="hive-thriftserver",
hive_thriftserver2 = Module(
name="hive-thriftserver2",
dependencies=[hive],
source_file_regexes=[
"sql/hive-thriftserver",
"sql/hive-thriftserver2",
"sbin/start-thriftserver.sh",
],
build_profile_flags=[
"-Phive-thriftserver",
"-Phive-thriftserver2",
],
sbt_test_goals=[
"hive-thriftserver/test",
"hive-thriftserver2/test",
]
)

Expand Down
15 changes: 11 additions & 4 deletions dev/test-dependencies.sh
Original file line number Diff line number Diff line change
Expand Up @@ -29,7 +29,7 @@ export LC_ALL=C
# TODO: This would be much nicer to do in SBT, once SBT supports Maven-style resolution.

# NOTE: These should match those in the release publishing script
HADOOP2_MODULE_PROFILES="-Phive-thriftserver -Pmesos -Pkubernetes -Pyarn -Phive"
HADOOP2_MODULE_PROFILES="-Pmesos -Pkubernetes -Pyarn -Phive"
MVN="build/mvn"
HADOOP_PROFILES=(
hadoop-2.7
Expand Down Expand Up @@ -67,15 +67,22 @@ $MVN -q versions:set -DnewVersion=$TEMP_VERSION -DgenerateBackupPoms=false > /de

# Generate manifests for each Hadoop profile:
for HADOOP_PROFILE in "${HADOOP_PROFILES[@]}"; do
HIVE_THRIFTSERVER_MODULE=none
if [ "${HADOOP_PROFILE}" = 'hadoop-2.7' ]; then
HIVE_THRIFTSERVER_MODULE="-Phive-thriftserver"
else
HIVE_THRIFTSERVER_MODULE="-Phive-thriftserver2"
fi

echo "Performing Maven install for $HADOOP_PROFILE"
$MVN $HADOOP2_MODULE_PROFILES -P$HADOOP_PROFILE jar:jar jar:test-jar install:install clean -q
$MVN $HADOOP2_MODULE_PROFILES $HIVE_THRIFTSERVER_MODULE -P$HADOOP_PROFILE jar:jar jar:test-jar install:install clean -q

echo "Performing Maven validate for $HADOOP_PROFILE"
$MVN $HADOOP2_MODULE_PROFILES -P$HADOOP_PROFILE validate -q
$MVN $HADOOP2_MODULE_PROFILES $HIVE_THRIFTSERVER_MODULE -P$HADOOP_PROFILE validate -q

echo "Generating dependency manifest for $HADOOP_PROFILE"
mkdir -p dev/pr-deps
$MVN $HADOOP2_MODULE_PROFILES -P$HADOOP_PROFILE dependency:build-classpath -pl assembly \
$MVN $HADOOP2_MODULE_PROFILES $HIVE_THRIFTSERVER_MODULE -P$HADOOP_PROFILE dependency:build-classpath -pl assembly \
| grep "Dependencies classpath:" -A 1 \
| tail -n 1 | tr ":" "\n" | rev | cut -d "/" -f 1 | rev | sort \
| grep -v spark > dev/pr-deps/spark-deps-$HADOOP_PROFILE
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -167,6 +167,7 @@ List<String> buildClassPath(String appClassPath) throws IOException {
"sql/core",
"sql/hive",
"sql/hive-thriftserver",
"sql/hive-thriftserver2",
"streaming"
);
if (prependClasses) {
Expand Down
Loading