-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Concordance of resistance predictions with phenotype #76
Comments
First draft of the figure for this analysis Figure 2Number of resistant (left) and susceptible (right) phenotypes correctly identified by mykrobe from Illumina (blue) and Nanopore (purple) data from the same samples. The red bars indicate missed (FN) or incorrect (FP) predictions. The x-axis shows the drugs with available phenotype data that mykrobe also makes predictions for. E - ethambutol; H - isoniazid; Z - pyrazinamide; R - rifampicin; S - streptomycin; Km - kanamycin; Am - amikacin; Ofx - ofloxacin; Cm - capreomycin; Mfx - moxifloxacin. |
We could conceivable also use a table (or replace the figure with a table if more informative)
|
Cool! So essentially identical results except slightly better VME for amikacin, isoniazid and streptomycin, and slightly worse ME for isoniazid . I call that a win |
Wait a minute, how come pyrazinamide is in the table, I thought we didn't have any phenotypes for that? Did I forget/get that wrong??? |
We have 1 sample with PZA DST haha. Maybe I just leave it out then? |
Here is another plot that is very insightful Effect of Nanopore read depth on mykrobe phenotype prediction. Each point indicates the proportion (y-axis) of classifications of that type at the read depth (x-axis). Read depth is "binned". That is, read depth 40 is all samples with a read depth greater than 40 and less than or equal to 50. FP - false positive; TN - true negative; etc. |
For this table #76 (comment) |
I don't really understand where the confidence intervals come from? The values aren't the result of any kind of aggregation/averaging... |
The confidence intervals inform you how much you can trust the rate (FPR, VME, whatever) based on the number of samples. |
I see. Ok, I've added that in using the Wilson score interval - which is the same as was used in the recent mykrobe paper. However, I notice the Nature Comms paper used Clopper–Pearson confidence interval. Although I don't think the two are that different. |
Channeling my inner Michael patrolling appropriate slack channels... Is about concordance with illumina, which is a different issue. |
Ah, right you are. Thank you! |
I am going to start writing the results section with the following final (pending major issues) plot and table
|
We do not have phenotype data for every sample/drug, so just for those we do have.
Similar to #75, we will have a table showing Very Major Error (missed resistance), Major Error (missed susceptible), PPV (what % of R calls are R) and NPV (what % of S calls are S) for each tool.
The text was updated successfully, but these errors were encountered: