Skip to content

Commit

Permalink
remove mention of random initialisation in the method and clarify wha…
Browse files Browse the repository at this point in the history
…t convergence looks like, tasks 8 and 9

Picking initial points randomly here may be misleading for someone just looking up the method from this section. Have simply omitted and said that this is discussed below. Also, have removed the word convergence in favour of a description of what convergence looks like
  • Loading branch information
mallewellyn authored Mar 6, 2024
1 parent c2f8653 commit 0506bf1
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions _episodes_rmd/06-k-means.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -75,9 +75,9 @@ The distance between data points within a cluster is
used as a measure of within-cluster variation.

To carry out K-means clustering, we first pick $k$ initial points as centres or
"centroids" of our clusters. There are a few ways to choose these initial "centroids",
but for simplicity let's imagine we just pick three random co-ordinates.
We then follow these two steps until convergence:
"centroids" of our clusters. There are a few ways to choose these initial "centroids"
and this is discussed below. Once we have picked intitial points, we then follow
these two steps until appropriate clusters have been formed:

1. Assign each data point to the cluster with the closest centroid
2. Update centroid positions as the average of the points in that cluster
Expand Down

0 comments on commit 0506bf1

Please sign in to comment.