-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Iterate Cluster Tree #102
Comments
I think that information is contained in the |
Thank you for your response. It does help. However, I am not completely following. Please consider the following example. I amended the original post to print out some of the cluster tree data to aid in the discussion. I ran the code w/ these settings ...
... which produced this output (noticed I switched to cobble stone - actual tree scheme is irrelevant for this discussion) ...
My understanding is this cluster tree has 15 hierarchical clusters based on the "nodes" and the "tree print" output. The 15 hierarchical cluster tree is shown below with the number of points of each cluster.
When you say "first" cluster I am not sure how the algorithm orders the cardinality of the clusters. I would call the "first" cluster the intitial cluster which holds all 20 points. Then the "next" clusters would be two 10 point clusters. Then the "next" cluster would be the four 5 point clusters. Then final level = 4 clusters are four leaf pairs containing 2 and 3 points each. Is your description only going to show point indices in the leaf clusters? If so, I need to know the point indices in all the clusters - not just the leaf clusters. My question is then what are the points (indices) for each of the 15 hiearachical clusters in the cluster tree? I think the demonstration above reflects the number of points of each cluster but I need the actual point indices for all clusters in the cluster tree. Maybe you can tell me how to get them all for this simple example. |
Oh I think I see one aspect of the issue. Once we know the points in the leaf clusters we can then accumulate backwards to get the points in parent clusters as needed. But I still don't follow getting the leaf point indices (assuming this was your initial intent). In this example you say the "first" cluster corresponds to ...
... which is ...
... or ...
... which is 5 indices. But all the leaf sizes are either 2 or 3 according to demonstration above. What am I missing? It would also be helpful (in general) if the structure of the permutation vector was documented. Is it? |
I believe the points are assigned like this:
note that the perm values are 1 based. (then permutations can be applied with LAPACK routines like ?lapmt/?lapmr) |
Ah very good. I will examine this further in my actual context usage. I consider the issue closed for now and thanks again for your help. |
Hello,
I created a KMeans (k = 2) recursive ClusterTree with the code snippet below. I would like to verify the 3D points each cluster contains. Is this possible?
The text was updated successfully, but these errors were encountered: