how to handle missing values #16

cbaakman · 2021-11-29T11:42:21Z

Currently reported types of missing values:

atom types that are not in the forcefield, in which case there's no charge values/vanderwaals parameters
conservation scores that are NaN, because the protein wasn't aligned at that particular position

Possibilities about how to fix this:
solution 1: skip PDB entries with missing values
solution 2: remove the residues/atoms with missing values from the PDB file, after it's loaded in.

cbaakman · 2021-12-13T09:56:17Z

If variant is mapped to multiple PDBs, we can choose the one with no NaNs. However, this information isn't known until the script starts trying to preprocess the PDBs. So we need a more flexible script that can cancel preprocessings during the run.

cbaakman added the enhancement New feature or request label Nov 29, 2021

cbaakman self-assigned this Nov 29, 2021

rgayatri closed this as completed Apr 15, 2022

rgayatri reopened this Apr 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to handle missing values #16

how to handle missing values #16

cbaakman commented Nov 29, 2021

cbaakman commented Dec 13, 2021 •

edited

Loading

how to handle missing values #16

how to handle missing values #16

Comments

cbaakman commented Nov 29, 2021

cbaakman commented Dec 13, 2021 • edited Loading

cbaakman commented Dec 13, 2021 •

edited

Loading