Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Normal samples required? #10

Open
MjelleLab opened this issue Dec 12, 2023 · 4 comments
Open

Normal samples required? #10

MjelleLab opened this issue Dec 12, 2023 · 4 comments

Comments

@MjelleLab
Copy link

When running PAMES::get_purity(beta) it seems to require normal samples beta values and the AUC-file. Is it possible to run it using only beta-values from the tumor sample?

Best,

@romagnolid
Copy link
Collaborator

Hi, you can run get_purity using a pre-generated set of informative CpG sites.
Here https://github.com/cgplab/PAMESdata you can find different sets for 14 tumor types

Otherwise, normal samples are required to generate the AUC file and use find_informative_sites.

@MjelleLab
Copy link
Author

MjelleLab commented Dec 14, 2023

Thanks @romagnolid
Unfortunately I need brain (LGG) form TCGA. I have access to TCGA myself and found the *level3betas.txt files. How would you go about creating a normal profile based on these files? Are you using blood-normal or tissue-normal?

@romagnolid
Copy link
Collaborator

I checked and LGG have very few control samples available on TCGA but you can retreive some data from GEO or from EWAS datahub.

Next it's just three steps

library(PAMES)

N=20
auc_vector <- get_AUC(tumor_data, control_data, cores=N)
info_sites <- find_informative_sites(tumor_data,
                                     control_data,
                                     auc_vector,
                                     illumina450k_hg38, cores=N)
purity_data <- get_purity(tumor_data, info_sites)

@romagnolid
Copy link
Collaborator

romagnolid commented Dec 18, 2023

Tissue-normal is required as control. You can either download all the samples and create a matrix of beta-values (bind each level3betas.txt column-wise) or use a package such as TCGABiolinks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants