Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problems with copynumber analysis #28

Closed
jshdydh123 opened this issue Apr 28, 2024 · 4 comments
Closed

Problems with copynumber analysis #28

jshdydh123 opened this issue Apr 28, 2024 · 4 comments

Comments

@jshdydh123
Copy link

Hi,I wanted to express my appreciation for your incredible tool.

While utilizing the T1K-copynumber function, I encountered an issue. Specifically, when running the command python3 t1k-copynumber.py -g XXX_genotype.tsv --nomissing KIR3DL3,KIR2DL4,KIR3DP1,KIR3DL2, I observed that in the output file generated by copynumber, several framework genes, such as KIR2DL4, have a log-ratio of 0.00, resulting in a copynumber of 0.

However, upon examining the genotype.tsv file for these genes, I noticed that the abundance of KIR2DL4 is relatively high and the quality score is 60. I am curious about how to interpret this situation and whether there is a solution to address it.
The sample genotype.tsv file and copynumber output file are shown in the figure.

I appreciate your time and assistance in resolving this matter.

huang
kir_copynumber
kir_gentype tsv

@mourisl
Copy link
Owner

mourisl commented Apr 28, 2024

The script finds there are two copies for 2DL4 "2DL4*005 2 0.00", the number after the allele (abundance in the original genotype.tsv file), is the number of copies. Though based on your screenshot of the abundance, I think the log-likelihood difference with the next likely copy number should be larger, at least shouldn't be 0.00. Could you please paste the genotype.tsv as a text and I will look into this issue. Thank you.

@jshdydh123
Copy link
Author

Thank you for your reply.
This is the genotype.tsv file mentioned above.
t1k_kir_genotype_tsv.txt

@mourisl
Copy link
Owner

mourisl commented Apr 28, 2024

Thank you for sharing the file. The "quality_score" column in the copy number output is the difference of the likelihood, and in this case both likelihood are very small, so the difference becomes less than 0.01. I have changed the value to the difference of log-likelihood (which I think I used this value at some point), so the difference is more obvious now.

@jshdydh123
Copy link
Author

Thank you very much for your assistance. With the adjustments made to the copynumber function, I have successfully resolved the issue mentioned earlier.

@mourisl mourisl closed this as completed Apr 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants