MAPREDUCE-6096.SummarizedJob Class Improvment #1

piaoyu · 2014-09-18T09:36:59Z

https://issues.apache.org/jira/browse/MAPREDUCE-6096

SummarizedJob class should be Improvment

When I Parse the JobHistory in the HistoryFile,I use the Hadoop System's map-reduce-client-core project org.apache.hadoop.mapreduce.jobhistory.JobHistoryParser class and HistoryViewer$SummarizedJob to Parse the JobHistoryFile(Just Like job_1408862281971_489761-1410883171851_XXX.jhist)
and it throw an Exception Just Like
Exception in thread "pool-1-thread-1" java.lang.NullPointerException
at org.apache.hadoop.mapreduce.jobhistory.HistoryViewer$SummarizedJob.(HistoryViewer.java:626)
at com.jd.hadoop.log.parse.ParseLogService.getJobDetail(ParseLogService.java:70)

After I'm see the SummarizedJob class I find that attempt.getTaskStatus() is NULL ，
So I change the order of attempt.getTaskStatus().equals (TaskStatus.State.FAILED.toString()) to
TaskStatus.State.FAILED.toString().equals(attempt.getTaskStatus())
and it works well .

New file of AltFileInputStream.java to replace FileInputStream.java in apache/hadoop/HDFS

move tensorflow on yarn to the proper module

merge TensorFlow-YARN from zhankun's branch

New, improved Python script

This adds a new type of namenode: observer. A observer is like a standby NN (in fact they share most of the code), EXCEPT it doesn't participate in either NN failover (i.e., it is not part of the HA), or check pointing. A observer can be specified through configuration. First, it needs to be added into the config: dfs.ha.namenodes, just like a normal namenode, together with other configs such as dfs.namenode.rpc-address, dfs.namenode.http-address, etc. Second, it needs to be specified in a new config: dfs.ha.observer.namenodes. This differentiate it from the ordinary active/standby namenodes. A observer can be used to serve read-only requests from HDFS client, when the following two conditions are satisfied: 1. the config dfs.client.failover.proxy.provider.<nameservice> is set to org.apache.hadoop.hdfs.server.namenode.ha.StaleReadProxyProvider. 2. the config dfs.client.enable.stale-read is set to true This also changes the way edit logs are loaded from the standby/observer NNs. Instead of loading them all at once, the new implementation loads them one batch at a time (default batch size is 10K edits) through multiple iterations, while waiting for a short amount of time in between the iterations (default waiting time is 100ms). This is to make sure the global lock won't be held too long during loading edits. Otherwise, the RPC processing time would suffer. This patch does not include a mechanism for clients to specify the bound of the staleness using journal transction ID: excluding this allows us to deploy the observer more easily. In more specific, the deployment involves: 1. restarting all datanodes with the updated configs. No binary change on datanodes is required. 2. bootstraping and starting the observer namenode, with the updated configs. Existing namenodes do not need to change. Future tasks: 1. allow client to set a bound on staleness in observer in terms of time (e.g., 2min). If for some reason the lagging in edit tailing is larger than the bound, the client-side proxy provider will fail over all the RPCs to the active namenode. 2. use journal transaction ID to ensure bound on staleness. This can be embedded in the RPC header. 3. allow new standby/observer to be deployed without datanode restart.

Update image names/tags in scripts

hadoop-yetus · 2019-07-19T19:32:29Z

💔 -1 overall

Vote	Subsystem	Runtime	Comment
0	reexec	37	Docker mode activated.
		_ Prechecks _
+1	dupname	0	No case conflicting files found.
+1	@author	0	The patch does not contain any @author tags.
-1	test4tests	0	The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch.
		_ trunk Compile Tests _
+1	mvninstall	1105	trunk passed
+1	compile	33	trunk passed
+1	checkstyle	21	trunk passed
+1	mvnsite	33	trunk passed
+1	shadedclient	673	branch has no errors when building and testing our client artifacts.
+1	javadoc	15	trunk passed
0	spotbugs	70	Used deprecated FindBugs config; considering switching to SpotBugs.
+1	findbugs	68	trunk passed
		_ Patch Compile Tests _
+1	mvninstall	32	the patch passed
+1	compile	27	the patch passed
+1	javac	27	the patch passed
-0	checkstyle	17	hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core: The patch generated 9 new + 138 unchanged - 9 fixed = 147 total (was 147)
+1	mvnsite	31	the patch passed
+1	whitespace	0	The patch has no whitespace issues.
+1	shadedclient	766	patch has no errors when building and testing our client artifacts.
+1	javadoc	14	the patch passed
+1	findbugs	73	the patch passed
		_ Other Tests _
+1	unit	315	hadoop-mapreduce-client-core in the patch passed.
+1	asflicense	26	The patch does not generate ASF License warnings.
		3383

Subsystem	Report/Notes
Docker	Client=18.09.8 Server=18.09.8 base: https://builds.apache.org/job/hadoop-multibranch/job/PR-1/1/artifact/out/Dockerfile
GITHUB PR	#1
Optional Tests	dupname asflicense compile javac javadoc mvninstall mvnsite unit shadedclient findbugs checkstyle
uname	Linux e2956d7e8802 4.4.0-139-generic #165-Ubuntu SMP Wed Oct 24 10:58:50 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	personality/hadoop.sh
git revision	trunk / `7f1b76c`
Default Java	1.8.0_212
checkstyle	https://builds.apache.org/job/hadoop-multibranch/job/PR-1/1/artifact/out/diff-checkstyle-hadoop-mapreduce-project_hadoop-mapreduce-client_hadoop-mapreduce-client-core.txt
Test Results	https://builds.apache.org/job/hadoop-multibranch/job/PR-1/1/testReport/
Max. process+thread count	1666 (vs. ulimit of 5500)
modules	C: hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core U: hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core
Console output	https://builds.apache.org/job/hadoop-multibranch/job/PR-1/1/console
versions	git=2.7.4 maven=3.3.9 findbugs=3.1.0-RC1
Powered by	Apache Yetus 0.10.0 http://yetus.apache.org