Skip to content

Commit

Permalink
Using DBSCAN for the trip clustering as well
Browse files Browse the repository at this point in the history
We are already using DBSCAN for the start and end location clustering, we can
pretty easily find trips by matching on the start and end locations. Here, we
explore two alternatives for the trip matching - add the distance matrices and
recluster, or group by the (start, end) cluster label pairs and group.

We find that the second method is correct. We also spot check both methods and
find that the location clustering can also have some minor issues sometimes,
but 2/3 spot checks worked well.
  • Loading branch information
shankari committed Jul 23, 2021
1 parent 89e241f commit c7e8205
Showing 1 changed file with 748 additions and 0 deletions.
Loading

0 comments on commit c7e8205

Please sign in to comment.