Overview

The code is for implementation of the research paper:

Fault-Tolerant Computation Meets Network Coding: Optimal Scheduling in Parallel Computing

Dependencies

Hadoop 2.9.1

Usage

Prime number setting is in CRT.java file.

Code model and fail map number setting is in config.properties. Specific details can refer to run.bash.

Run.bash is a script program for testing the code.

The CRT code is based on RESIDUES.C[3].

Step 0

Install Hadoop 2.9.1 and make sure this can work correctly.

Step 1

Create a file config.properties and write config:

OPTION=false
FAILED_NUMBER=0

Settings can be changed later.

Step 2

Compile and run the entire program:

hadoop com.sun.tools.javac.Main CRTMatrixD.java CRTMapper.java CRTReducer.java CRT.java
jar cf CRT.jar CRT*.class
hadoop jar CRT.jar CRT /user/[username]/crt/input /user/[username]/crt/output

Step 3

The output file can be seen by:

hadoop fs -cat /user/[username]/crt/output/part-r-00000

Other information can be seen in logs.

AWS settings

Our project can be easily put on AWS or other cloud services. There are two ways to create a MapReduce project on AWS. We use the EC2 to create every node, which is flexible but a little complex. The launched instances should equal the sum of the Namenode and Datanode. A detailed introduction can be seen at [2].

You can also choose Amazon EMR[1] to create the project directly.

Result

The average time for each mapper and the entire running time of the simulations for different numbers of failed mappers are compared below.

Average computation time for mappers

The total running time of the experiment

[1] https://aws.amazon.com/emr/

[2] https://medium.com/@jeevananandanne/setup-4-node-hadoop-cluster-on-aws-ec2-instances-1c1eeb4453bd

[3] http://www.discontinuity.net/projects/residue_int/residues.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Dependencies

Usage

Step 0

Step 1

Step 2

Step 3

AWS settings

Result

Average computation time for mappers

The total running time of the experiment

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Pic		Pic
.gitignore		.gitignore
CRT.java		CRT.java
CRTMapper.java		CRTMapper.java
CRTMatrixA.java		CRTMatrixA.java
CRTMatrixD.java		CRTMatrixD.java
CRTReducer.java		CRTReducer.java
RESIDUES.C		RESIDUES.C
Readme.md		Readme.md
matrixTest.java		matrixTest.java
run.bash		run.bash

convexsoft/Fault-tolerant-Computing

Folders and files

Latest commit

History

Repository files navigation

Overview

Dependencies

Usage

Step 0

Step 1

Step 2

Step 3

AWS settings

Result

Average computation time for mappers

The total running time of the experiment

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages