-
Notifications
You must be signed in to change notification settings - Fork 23
/
Copy pathassignment2-431.html
142 lines (119 loc) · 5.5 KB
/
assignment2-431.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="utf-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="viewport" content="width=device-width, initial-scale=1">
<!-- The above 3 meta tags *must* come first in the head; any other head content must come *after* these tags -->
<meta name="description" content="Course homepage for CS 431/631 451/651 Data-Intensive Distributed Computing (Winter 2019) at the University of Waterloo">
<meta name="author" content="Adam Roegiest">
<title>Data-Intensive Distributed Computing</title>
<!-- Bootstrap core CSS -->
<link href="css/bootstrap.min.css" rel="stylesheet">
<!-- IE10 viewport hack for Surface/desktop Windows 8 bug -->
<link href="css/ie10-viewport-bug-workaround.css" rel="stylesheet">
<!-- Just for debugging purposes. Don't actually copy these 2 lines! -->
<!--[if lt IE 9]><script src="../../assets/js/ie8-responsive-file-warning.js"></script><![endif]-->
<script src="js/ie-emulation-modes-warning.js"></script>
<!-- HTML5 shim and Respond.js for IE8 support of HTML5 elements and media queries -->
<!--[if lt IE 9]>
<script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script>
<script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script>
<![endif]-->
<style>
body {
padding-top: 60px; /* 60px to make the container go all the way to the bottom of the topbar */
}
</style>
</head>
<body>
<nav class="navbar navbar-inverse navbar-fixed-top">
<div class="container">
<div class="navbar-header">
<button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false" aria-controls="navbar">
<span class="sr-only">Toggle navigation</span>
<span class="icon-bar"></span>
<span class="icon-bar"></span>
<span class="icon-bar"></span>
</button>
</div>
<div id="navbar" class="collapse navbar-collapse">
<ul class="nav navbar-nav">
<li><a href="index.html">Overview</a></li>
<li><a href="organization.html">Organization</a></li>
<li><a href="syllabus.html">Syllabus</a></li>
<li class="active"><a href="assignments.html">Assignments</a></li>
<li><a href="software.html">Software</a></li>
</ul>
</div><!--/.nav-collapse -->
</div>
</nav>
<div class="container">
<div class="page-header">
<div style="float: right"><img width="250" src="images/waterloo_logo.png" alt="University of Waterloo logo"/></div>
<h1>Assignments <br/><small>Data-Intensive Distributed Computing (Winter 2019)</small></h1>
</div>
<p>Note that there are separate sets of assignments for CS 451/651 and CS
431/631. Make sure you work on the correct asssignments!</p>
<p><a href="assignments-431.html" class="btn btn-info btn-large">CS 431/631 Assignments</a></p>
<div class="subnav">
<ul class="nav nav-pills">
<li><a href="assignment0-431.html">0</a></li>
<li><a href="assignment1-431.html">1</a></li>
<li><a href="assignment2-431.html">2</a></li>
<li><a href="assignment3-431.html">3</a></li>
<li><a href="assignment4-431.html">4</a></li>
<li><a href="assignment5-431.html">5</a></li>
<li><a href="project-431.html">Final Project</a></li>
</ul>
</div>
<h3>Assignment 2 <small>due 2:30pm February 13</small></h3>
<p>
This assignment requires you to compute statistics over tokens
in the text of Shakespeare's plays, as you did for
<a href="assignment1-431.html">Assignment 1</a>. However, instead
of performing the analysis purely in Python, you will use Spark
(with a Python driver program).
</p>
<p>
For this assignment, you should continue to use the
<a href="https://jupyter.student.cs.uwaterloo.ca:8000"
target="_blank">Waterloo CS Jupyter hub</a>. Use
your password for the <emph>CS student computing
environment</emph>.
</p>
<p>
You will need to upload the following files
to your working folder on the hub:
<ul>
<li><a href="content/cs431/Shakespeare.txt">Shakespeare.txt</a>: the
complete text of Shakespeare's plays</li>
<li><a href="content/cs431/simple_tokenize.py">simple_tokenize.py</a>:
a simple Python module for tokenizing text</li>
<li><a href="content/cs431/A2.ipynb">A2.ipynb</a>: this is the starter
notebook for A2.</li>
</ul> As usual, open the starter notebook on the hub to get started working
on the assignment.
<a name="submitting"></a>
<h4 style="padding-top: 10px">Submitting Assignment 2</h4>
<p>
When you have finished, submit A2 using the same procedure you
used for A1:
<ol>
<li>Download your <samp>A2.ipynb</samp> file from the Jupyter hub to your
computer.</li>
<li>Submit your <samp>A2.ipynb</samp> file to your GitHub repository using the web interface like you did in assignment 1.</li>
</ol>
</p>
<p>That's it! Note that your A2.ipynb should be
committed and pushed to origin before the deadline.</p>
<p style="padding-top: 20px"><a href="#">Back to top</a></p>
<div style="padding-bottom: 100px"></div>
</div><!-- /.container -->
<!-- Placed at the end of the document so the pages load faster -->
<script src="https://ajax.googleapis.com/ajax/libs/jquery/1.12.4/jquery.min.js"></script>
<script src="js/bootstrap.min.js"></script>
<!-- IE10 viewport hack for Surface/desktop Windows 8 bug -->
<script src="js/ie10-viewport-bug-workaround.js"></script>
</body>
</html>