Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Heatmap to UMAP scatterplot #126

Closed
mccalluc opened this issue Jan 27, 2020 · 5 comments
Closed

Add Heatmap to UMAP scatterplot #126

mccalluc opened this issue Jan 27, 2020 · 5 comments
Labels
enhancement New feature or request

Comments

@mccalluc
Copy link
Contributor

The data for the UMAP scatterplot should also support a heatmap. Make the necessary tweaks in https://github.com/hubmapconsortium/portal-containers/ and then update the view config here. (Will be blocked by the lack of a File API.)

@mccalluc mccalluc added enhancement New feature or request UI labels Jan 27, 2020
@mccalluc mccalluc removed the NEXT label Feb 27, 2020
@mccalluc
Copy link
Contributor Author

@mruffalo : Can you comment on this? Behind the UMAP, there is presumably higher dimensional data... would it be useful to have a heatmap for that? Or is there a better person to ask?

@mccalluc
Copy link
Contributor Author

@ilan-gold : Do you know if heatmaps would be interesting or useful with this data, or is this just something different?

@ilan-gold
Copy link
Member

It's on the radar here: hubmapconsortium/portal-containers#30. The main potential issue is the size - the cell x gene space is much larger than anything we have done previously, on an order of 1000s x 1000s from what I understand. Also, I'm not sure the data comes through right now - we may have to ask for it, but I could be wrong.

@mruffalo
Copy link
Contributor

The expression data (normalized, filtered, etc.) is present in the h5ad files you're reading for conversion to Arrow format -- right now you're only pulling out the UMAP coordinates and cluster assignments.

This is stored in the X (data matrix) attribute of the AnnData object:

In [1]: import anndata                                                                          

In [2]: d = anndata.read_h5ad('cluster_marker_genes.h5ad')                                      

In [3]: d.X                                                                                     
Out[3]: 
array([[ 0.05571311, -0.07759782, -0.0441038 , ..., -0.04950992,
        -0.05211497,  1.8924463 ],
       [-0.4334068 , -0.07759782, -0.0441038 , ..., -0.04950992,
        -0.05211497, -0.31383678],
       [ 0.52023876, -0.07759782, -0.0441038 , ..., -0.04950992,
        -0.05211497,  0.45089713],
       ...,
       [-0.4334068 , -0.07759782, -0.0441038 , ..., -0.04950992,
        -0.05211497, -0.31383678],
       [-0.4334068 , -0.07759782, -0.0441038 , ..., -0.04950992,
        -0.05211497, -0.31383678],
       [-0.4334068 , -0.07759782, -0.0441038 , ..., -0.04950992,
        -0.05211497, -0.31383678]], dtype=float32)

@keller-mark
Copy link
Member

Closing this, duplicate of #696

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

4 participants