Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Smlewis/update cyrus master oct20 #9

Open
wants to merge 306 commits into
base: cyrus/master
Choose a base branch
from

Conversation

smlewis
Copy link
Collaborator

@smlewis smlewis commented Oct 20, 2020

updates cyrus/master with a few years of commons stuff. Note in particular the changes to make_fragments.pl, which I think @danpf made. This one needs consideration before merging since we use this branch.

everyday847 and others added 30 commits July 31, 2018 15:19
These tools are for generating the SQL databases required to run
MHCEpitopePredictorExternal predictors with the mhc_energy scoreterm.
Added the capability to look directly for the Rosetta database, assuming
that this is being run from Rosetta/tools/mhc_energy_tools and there
also exists Rosetta/main/database.

Also changed os.environ to os.getenv, which returns None instead of
crashing if $ROSETTA hasn't been set, and tweaked the error handling.

The logic of how to find the file could maybe be cleaner, but this works
for now.
…sted from being used as a template in antibody.cc. we found that this template (nano body) was problematic because and causes antibody.cc run to fail because it was trying to graft a H2 loop from this template that had a cysteine that is part of a disulfide with other parts of the non-grafted segment of the nanobody framework. we find that in previous versions of the the PDB (before 2012), this template (4w6w) would not be selected and so antibody.cc would run successfully.
The script would search for all alleles, but only the alleles that met
the threshold were stored in details.  This also disrupted the summary
table.  Now, all peptide/allele combos have their details stored in
details, but we count if the score meets the threshold using
meet-thresh.
…black listed from being used as a template in antibody.cc. we found that this template (nano body) was problematic because and causes antibody.cc run to fail because it was trying to graft a H2 loop from this template that had a cysteine that is part of a disulfide with other parts of the non-grafted segment of the nanobody framework. we find that in previous versions of the the PDB (before 2012), this template (4w6w) would not be selected and so antibody.cc would run successfully."

This reverts commit 668467a.
…lack_list_4w6w

Add the nanobody 4w6w to the outlier_list
…hare_anarci2rosetta

Davela/share anarci2rosetta
I moved matplotlib so that it is only imported if plotting is used, to
remove this dependency unless needed.  Also fixed the help text in
score.py.
Peptides in reports were being output in an arbitrary order with
incorrect position numbers.  This has now been fixed, and should work
with repetitive sequences as well.
A bit more stringent error checking when we keep track of the positions
in this way.

Also, changed the absolute scoring to the log-transformed format
1-log50k(aff) instead of the affinity.

Also added a .gitignore that will ignore output files when running the
demo script.
db.py now supports two formats of PSSMs: the one from command line
psiblast, and the one from NCBI's PSSM viewer output.  If the PSSM is
not in that format, it will try to process it, outputting a warning
along the way.
…tope breaks; forced 'X' to be epitope break in NetMHC
aleaverfay and others added 30 commits March 26, 2020 12:25
extern "C" {...} is treated like a namespace
extern int foo() { ... } is treated like a function
The extra debugging output that I had commented back in was leading the
serialization test pipeline to conclude that all of the files were
failing.
This PR massively reshapes the python_cc_reader module as this module is converted to python3.

The directory structure is as follows:
```
tools/
    python_cc_reader/
        python_cc_reader/
            beauty/
            code_improvement/
            cpp_parser/
            external/
            inclusion_removal/
            library_splitting/
            tests/
            utility/
```

The rationale for this dirname-within-dirname structure was given on this page:

https://docs.python-guide.org/writing/structure/

At the top level `python_cc_reader`directory live the user-level scripts such as `library_levels.py` and `beautify_changed_files_in_branch.py.` Within the lower level `python_cc_reader/python_cc_reader` directories live the modules that actually do all the heavy lifting.

These scripts are imported by a number of other scripts in the `tools` repository, and I have updated all of these scripts.

My intention is to create this PR as a permanent record of the merge to master which I am going to make immediately after opening this PR. I will merge this to master in the `tools` repository, and then I will be merging a PR in the `main` repository (PR 4590) that updates the `tools` submodule immediately afterwards.
The URL for the antibody numbering converter changed slightly. Update accordingly.
…/fix_antibody_renumber

Fix the convert_pdb_to_antibody_numbering_scheme.py script

The URL for the antibody numbering converter changed slightly. Update accordingly.
…repo.

This may or may not work for the general public, as I think (but am not sure) I was able to yank out the Meiler-lab specific things.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.