Skip to content
/ spark Public
forked from mesos/spark

Scala framework for iterative and interactive cluster computing.

License

Notifications You must be signed in to change notification settings

MfgLabs/spark

 
 

Repository files navigation

Modified original spark to handle S3 files.

To auth, set these environment variables:

export AWS_ACCESS_KEY_ID= export AWS_SECRET_ACCESS_KEY=

Launch cluster

Launch cluster like so:

./spark-ec2 --key-pair= --identity-file= --environment /home/mfg/enviroment.sh --slaves=2 --instance-type=m1.medium --region=eu-west-1 --user=ubuntu --ami=ami-9ec1ccea --spot-price=0.033 --cluster-type=standalone launch

The environment file is just a bash script that contains the commands to set the above environment variables:

If it goes wrong, resume by adding the --resume parameter.

Run job

Run ./spark-ec2 --key-pair= --identity-file= login

End job

./spark-ec2 destroy

About

Scala framework for iterative and interactive cluster computing.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Scala 90.5%
  • Java 3.5%
  • Python 2.7%
  • Shell 1.6%
  • JavaScript 1.6%
  • Ruby 0.1%