Skip to content

Commit

Permalink
fixes
Browse files Browse the repository at this point in the history
Signed-off-by: Anatoly Myachev <[email protected]>
  • Loading branch information
anmyachev committed Oct 24, 2023
1 parent d7434bf commit e3bf55d
Showing 1 changed file with 7 additions and 8 deletions.
15 changes: 7 additions & 8 deletions docs/getting_started/using_modin/using_modin_cluster.rst
Original file line number Diff line number Diff line change
Expand Up @@ -4,8 +4,7 @@ Using Modin in a Cluster

.. note::
| *Estimated Reading Time: 15 minutes*
| You can follow along in a Jupyter notebook in this two-part tutorial: `Part 1`_,
| `Part 2`_.
| You can follow along in a Jupyter notebook in this two-part tutorial: `Part 1`_, `Part 2`_.
Often in practice we have a need to exceed the capabilities of a single machine. Modin
works and performs well in both local mode and in a cluster environment. The key
Expand All @@ -17,8 +16,8 @@ transparently.
Starting up a Ray Cluster
-------------------------
Modin is able to utilize Ray's built-in autoscaled cluster. To launch a Ray cluster
using Amazon Web Service (AWS), you can use `modin-cluster.yaml`_ as the config file
(`available autoscale options`_).
using Amazon Web Service (AWS), you can use `Modin's cluster setup config`_
(`Ray's autoscaler options`_).

.. code-block:: bash
Expand Down Expand Up @@ -64,7 +63,8 @@ Using Modin on a Ray Cluster
Now that we have a Ray cluster up and running, we can use Modin to perform pandas
operation as if we were working with pandas on a single machine. We test Modin's
performance on the 200MB `NYC Taxi dataset`_ that was provided as part of our
`cluster setup script`_. We can time the following operation in a Jupyter notebook:
`Modin's cluster setup config`_. We can time the following operation in a Jupyter
notebook:

.. code-block:: python
Expand Down Expand Up @@ -113,8 +113,7 @@ you can customize your Ray environment for use in Modin!
.. _`Ray's API docs`: https://ray.readthedocs.io/en/latest/api.html
.. _`Part 1`: https://github.com/modin-project/modin/tree/master/examples/tutorial/jupyter/execution/pandas_on_ray/cluster/exercise_5.ipynb
.. _`Part 2`: https://github.com/modin-project/modin/tree/master/examples/tutorial/jupyter/execution/pandas_on_ray/cluster/exercise_6.ipynb
.. _`modin-cluster.yaml`: https://github.com/modin-project/modin/blob/master/examples/tutorial/jupyter/execution/pandas_on_ray/cluster/modin-cluster.yaml
.. _`available autoscale options`: https://docs.ray.io/en/latest/cluster/vms/references/ray-cluster-configuration.html#cluster-config
.. _`Ray's autoscaler options`: https://docs.ray.io/en/latest/cluster/vms/references/ray-cluster-configuration.html#cluster-config
.. _`Ray's cluster docs`: https://docs.ray.io/en/latest/cluster/getting-started.html
.. _`NYC Taxi dataset`: https://modin-datasets.s3.amazonaws.com/testing/yellow_tripdata_2015-01.csv
.. _`cluster setup script`: https://github.com/modin-project/modin/blob/master/examples/tutorial/jupyter/execution/pandas_on_ray/cluster/modin-cluster.yaml
.. _`cluster setup config`: https://github.com/modin-project/modin/blob/master/examples/tutorial/jupyter/execution/pandas_on_ray/cluster/modin-cluster.yaml

0 comments on commit e3bf55d

Please sign in to comment.