Smlewis/update cyrus master oct20 #9

smlewis · 2020-10-20T18:42:36Z

updates cyrus/master with a few years of commons stuff. Note in particular the changes to make_fragments.pl, which I think @danpf made. This one needs consideration before merging since we use this branch.

These tools are for generating the SQL databases required to run MHCEpitopePredictorExternal predictors with the mhc_energy scoreterm.

Added the capability to look directly for the Rosetta database, assuming that this is being run from Rosetta/tools/mhc_energy_tools and there also exists Rosetta/main/database. Also changed os.environ to os.getenv, which returns None instead of crashing if $ROSETTA hasn't been set, and tweaked the error handling. The logic of how to find the file could maybe be cleaner, but this works for now.

…sted from being used as a template in antibody.cc. we found that this template (nano body) was problematic because and causes antibody.cc run to fail because it was trying to graft a H2 loop from this template that had a cysteine that is part of a disulfide with other parts of the non-grafted segment of the nanobody framework. we find that in previous versions of the the PDB (before 2012), this template (4w6w) would not be selected and so antibody.cc would run successfully.

…ying CDRs for antibody homology modeling.

… script.

…ain function.

The script would search for all alleles, but only the alleles that met the threshold were stored in details. This also disrupted the summary table. Now, all peptide/allele combos have their details stored in details, but we count if the score meets the threshold using meet-thresh.

…black listed from being used as a template in antibody.cc. we found that this template (nano body) was problematic because and causes antibody.cc run to fail because it was trying to graft a H2 loop from this template that had a cysteine that is part of a disulfide with other parts of the non-grafted segment of the nanobody framework. we find that in previous versions of the the PDB (before 2012), this template (4w6w) would not be selected and so antibody.cc would run successfully." This reverts commit 668467a.

…lack_list_4w6w Add the nanobody 4w6w to the outlier_list

…hare_anarci2rosetta Davela/share anarci2rosetta

…. Added rudimentary plotting capabilities.

Commits 6e5909c and bd45bb4)

I moved matplotlib so that it is only imported if plotting is used, to remove this dependency unless needed. Also fixed the help text in score.py.

Peptides in reports were being output in an arbitrary order with incorrect position numbers. This has now been fixed, and should work with repetitive sequences as well.

A bit more stringent error checking when we keep track of the positions in this way. Also, changed the absolute scoring to the log-transformed format 1-log50k(aff) instead of the affinity. Also added a .gitignore that will ignore output files when running the demo script.

db.py now supports two formats of PSSMs: the one from command line psiblast, and the one from NCBI's PSSM viewer output. If the PSSM is not in that format, it will try to process it, outputting a warning along the way.

…tope breaks; forced 'X' to be epitope break in NetMHC

extern "C" {...} is treated like a namespace extern int foo() { ... } is treated like a function

The extra debugging output that I had commented back in was leading the serialization test pipeline to conclude that all of the files were failing.

This PR massively reshapes the python_cc_reader module as this module is converted to python3. The directory structure is as follows: ``` tools/ python_cc_reader/ python_cc_reader/ beauty/ code_improvement/ cpp_parser/ external/ inclusion_removal/ library_splitting/ tests/ utility/ ``` The rationale for this dirname-within-dirname structure was given on this page: https://docs.python-guide.org/writing/structure/ At the top level `python_cc_reader`directory live the user-level scripts such as `library_levels.py` and `beautify_changed_files_in_branch.py.` Within the lower level `python_cc_reader/python_cc_reader` directories live the modules that actually do all the heavy lifting. These scripts are imported by a number of other scripts in the `tools` repository, and I have updated all of these scripts. My intention is to create this PR as a permanent record of the merge to master which I am going to make immediately after opening this PR. I will merge this to master in the `tools` repository, and then I will be merging a PR in the `main` repository (PR 4590) that updates the `tools` submodule immediately afterwards.

The URL for the antibody numbering converter changed slightly. Update accordingly.

…/fix_antibody_renumber Fix the convert_pdb_to_antibody_numbering_scheme.py script The URL for the antibody numbering converter changed slightly. Update accordingly.

…submodule2

…repo. This may or may not work for the general public, as I think (but am not sure) I was able to yank out the Meiler-lab specific things.

…submodule2

…mmons_master_oct20

everyday847 and others added 30 commits July 31, 2018 15:19

Avoid using rna_denovo_setup.py (deprecated) in helix_preassemble_setup

6ca3e0d

Added mhc_energy_tools, for generating mhc databases

fe5698a

These tools are for generating the SQL databases required to run MHCEpitopePredictorExternal predictors with the mhc_energy scoreterm.

Add anarci2rosetta.py script to the rosetta tools for use for identif…

5af1871

…ying CDRs for antibody homology modeling.

udpated outlier_list

71c8087

Merge branch 'davela/black_list_4w6w'

b025560

removed sym link to anarc2rosetta.py. sorry! repalced it the original…

056c08c

… script.

Merge branch 'davela/share_anarci2rosetta'

ea56b4c

Added argument handling, documentation, and wrapped everything into m…

868d148

…ain function.

Merge branch 'davela/share_anarci2rosetta'

7c9bdc8

made epi_thresh actually be used

8859de6

Resolved conflict with outlier_list.

964004c

fixed typo

7134e37

Merge branch 'davela/share_anarci2rosetta'

df94782

Merge pull request RosettaCommons#73 from CyrusBiotechnology/davela/b…

eea69a0

…lack_list_4w6w Add the nanobody 4w6w to the outlier_list

Merge pull request RosettaCommons#74 from CyrusBiotechnology/davela/s…

39c444c

…hare_anarci2rosetta Davela/share anarci2rosetta

flag to restrict mutable positions

ced6c26

Added exec permissions to mhc database python scripts

c5ae1ca

Added demo files and example invocations; fixed up scripts to support…

6e5909c

…. Added rudimentary plotting capabilities.

demo files

bd45bb4

Merging Chris' demo files

178a664

Commits 6e5909c and bd45bb4)

Made matplotlib optional, and fixed help of score.py

e4f67bd

I moved matplotlib so that it is only imported if plotting is used, to remove this dependency unless needed. Also fixed the help text in score.py.

Fixed a bug in db.py help text

a2d1efa

Fixed netmhcii.py to output peptides in order

c516b01

Peptides in reports were being output in an arbitrary order with incorrect position numbers. This has now been fixed, and should work with repetitive sequences as well.

Make PSSM parsing more robust in db.py

05066bf

db.py now supports two formats of PSSMs: the one from command line psiblast, and the one from NCBI's PSSM viewer output. If the PSSM is not in that format, it will try to process it, outputting a warning along the way.

multi-chain pdb files in db.py; chain breaks in pdb files causing epi…

8ad0ace

…tope breaks; forced 'X' to be epitope break in NetMHC

aleaverfay and others added 30 commits March 26, 2020 12:25

Temporarily print the PYTHONPATH from beautification script

4a2e303

Improve handling of extern in beautifier

968c8bf

extern "C" {...} is treated like a namespace extern int foo() { ... } is treated like a function

ok, let's print out the syspath on the testing server

708ae1b

Adding dummy fork_manager to test an import idea

fd33e72

trying to print a little more information

cdf3c8b

print path to python_cc_reader module

5a12033

Adding __init__.py files to python_cc_reader modules

c5f9a43

what version of python is running on the testing server?

0a7484d

Remove debugging code

b972fc5

Potentially fix Rocco's problem with [[ attribute ]] beautification

8ebe4a1

Fix python3 popen output processing in beautifier

f65e50d

Update import statements to python_cc_reader module

c9bccc6

Convert clang ast tools to python3

db996bc

Update serialization validator Popen call for python3

e599eca

Push serialization debugging modifications to the testing server

3d76cf0

Remove debugging output from serialization validator

e06bcf4

The extra debugging output that I had commented back in was leading the serialization test pipeline to conclude that all of the files were failing.

Update the in-code paths for includes

3d5564e

Fix missing addition operator.

ccac4f1

Updated reccea scripts for python3

162b3f2

Fix the convert_pdb_to_antibody_numbering_scheme.py script

70dc5c7

The URL for the antibody numbering converter changed slightly. Update accordingly.

Merge pull request RosettaCommons#92 from RosettaCommons/roccomoretti…

053a198

…/fix_antibody_renumber Fix the convert_pdb_to_antibody_numbering_scheme.py script The URL for the antibody numbering converter changed slightly. Update accordingly.

Merge remote-tracking branch 'origin/master' into roccomoretti/rdkit_…

e158a52

…submodule2

Add the score_to_b_factor.py application from the Meiler Lab scripts …

7a66d53

…repo. This may or may not work for the general public, as I think (but am not sure) I was able to yank out the Meiler-lab specific things.

Merge remote-tracking branch 'origin/master' into roccomoretti/rdkit_…

c74efab

…submodule2

Fix spacing issue in header compile

13c836f

Fix typo on external library path.

ed38bc1

Fix commenting mistake.

a52143c

Merge remote-tracking branch 'upstream/master' into smlewis/update_co…

6dd1050

…mmons_master_oct20

Merge branch 'smlewis/update_commons_master_oct20' into cyrus/master

46ff911

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Smlewis/update cyrus master oct20 #9

Smlewis/update cyrus master oct20 #9

smlewis commented Oct 20, 2020

Smlewis/update cyrus master oct20 #9

Are you sure you want to change the base?

Smlewis/update cyrus master oct20 #9

Conversation

smlewis commented Oct 20, 2020