Implement AmberTools WBO #508

j-wags · 2020-02-07T23:32:48Z

codecov-io · 2020-02-07T23:56:40Z

Codecov Report

Merging #508 into master will increase coverage by 0.66%.
The diff coverage is n/a.

…or message

j-wags · 2020-02-09T07:00:46Z

Ok, so for our "detailed" test molecule (protonated and deprotonated versions of [1]), AmberTools narrowly failed some tests that we had calibrated against OETK. Basically, we tested to ensure that every aromatic bond fell in the range of 1.15 to 1.60 in both the neutral molecule and anion. AmberTools returns values in the range 1.06 to 1.61, which caused these tests to fail. Here's a full summary:

Charge state	AmberTools WBO range	OE WBO range
neutral	1.34 - 1.44	1.35 - 1.44
anion	1.06 - 1.61	1.12 - 1.55

So basically AmberTools returns slightly more extreme values, but everything is around the same magnitude. Because these differences are small, so I'm going to relax the aromatic tolerances for these tests to accept between 1.05 and 1.65.

Huge thanks to @sukanyasasmal and @ChayaSt and for making this thoughtful test. It was very quick to plug in AmberTools and get results!

[1]

[2] Printout of aromatic bond orders in these tests (molecule is loaded from SDF so indexing is the same):

OpenEye neutral aromatic order: 1.4249782910821187
OpenEye neutral aromatic order: 1.4247535696661562
OpenEye neutral aromatic order: 1.3993880829721475
OpenEye neutral aromatic order: 1.386557474305936
OpenEye neutral aromatic order: 1.3864636016050376
OpenEye neutral aromatic order: 1.3721272356508605
OpenEye neutral aromatic order: 1.3798238226278605
OpenEye neutral aromatic order: 1.4308855238777132
OpenEye neutral aromatic order: 1.3485824537022644
OpenEye neutral aromatic order: 1.3484389250490467
OpenEye neutral aromatic order: 1.4361705759121075
OpenEye anion aromatic order: 1.4214715268113562
OpenEye anion aromatic order: 1.4082261746981803
OpenEye anion aromatic order: 1.4083384738085785
OpenEye anion aromatic order: 1.421413561882213
OpenEye anion aromatic order: 1.3418572306137813
OpenEye anion aromatic order: 1.3418629033811214
OpenEye anion aromatic order: 1.282103697809885
OpenEye anion aromatic order: 1.2819662464523884
OpenEye anion aromatic order: 1.5524099209448596
OpenEye anion aromatic order: 1.1228566259220674
OpenEye anion aromatic order: 1.1228857329592095
OpenEye anion aromatic order: 1.5523016417471136

AmberTools neutral aromatic order: 1.42847694
AmberTools neutral aromatic order: 1.42838216
AmberTools neutral aromatic order: 1.38847364
AmberTools neutral aromatic order: 1.38967052
AmberTools neutral aromatic order: 1.38955476
AmberTools neutral aromatic order: 1.36395535
AmberTools neutral aromatic order: 1.38584981
AmberTools neutral aromatic order: 1.42623811
AmberTools neutral aromatic order: 1.3548221
AmberTools neutral aromatic order: 1.3413636
AmberTools neutral aromatic order: 1.44752967
AmberTools anion aromatic order: 1.42697576
AmberTools anion aromatic order: 1.40317476
AmberTools anion aromatic order: 1.40317794
AmberTools anion aromatic order: 1.42697246
AmberTools anion aromatic order: 1.31119999
AmberTools anion aromatic order: 1.31119746
AmberTools anion aromatic order: 1.23414108
AmberTools anion aromatic order: 1.23414195
AmberTools anion aromatic order: 1.61292386
AmberTools anion aromatic order: 1.06733526
AmberTools anion aromatic order: 1.06733486
AmberTools anion aromatic order: 1.61292474

jchodera · 2020-02-09T12:50:47Z

It's likely the case that OpenEye quacpac uses some "quick and dirty" tolerances and convergence criteria to give a semiquantitative result faster, so this is a perfectly reasonable approach!

openff-dangerbot · 2020-02-11T21:50:07Z

Reviewer Roulette

A review has been requested!

To spread load more evenly across eligible reviewers, Danger has randomly picked a candidate for this review and assigned them to this PR.

If you need to run the roulette again (for example, if the assigned reviewer is unavailable), simply un-request a review from me, then request again. After about 45 seconds, I will update the message with a new random reviewer.

Reviewer
@jthorton

Review guidelines

Some notes from @j-wags that we can refine as we do this more

Timeline and responsibilities

The PR reviewer should perform a review within 48 hours of being assigned.

The review may take up to three hours.

If few or insignificant changes are needed, the reviewer should accept the PR.

If substantial fixes should be made (see categories below), the reviewer should request changes, indicating which comments are high-priority (blocking), as opposed to questions or comments.

The PR author will then correct any blocking issues and re-request review. The re-review should focus just on the changes that were requested and any new code that was added in response.

The PR author is the only person who should modify the code in the branch, and it is customary to let them press the "merge" button once the PR is approved. Either person can "resolve" non-blocking comment threads, but only the reviewer should "resolve" comment threads that prompt a re-review.

The PR author

The person requesting the review should ensure that the purpose of the review is clearly explained, by linking relevant GitHub Issues in the PR body text (ex "Closes #12"), using clear variable names, commenting non-obvious code, and identifying areas of the diff that are unusual (ex. "the molecule_to_string function was cut and pasted to a different file and didn't change, so don't review it"). If the PR diff is larger than 300 lines, they should identify the area to prioritize for the review, to let the reviewer add as much value as possible if they are time-constrained.

The PR assignee

The newly-assigned reviewer should acknowledge that they received the request, and confirm that they can perform the review within 48 hours. Generally, a good review strategy is to:

Ensure that you understand the overall purpose of the codebase (consider looking at the codebase's tests or examples to understand how the functions are used)
Read through the body text of the PR (click through to any relevant tagged issues to understand the discussion around the changes)
Ensure that you understand the purpose of the changes in the PR
Read over the entire PR diff (minus data files) without commenting
Then, begin adding comments or thoughts, in the order of
- Examples
- Tests
- Core code

Types of comments

I've found that my comments fall into a few rough categories. This is a list of them in descending order of value:

[If any of these are present, you should request changes]

Conceptual -- "I don't understand what these changes aim to do"
Scientific -- "This won't work because nitrogen can have four bonds"
Algorithmic -- "This may crash because the input list is empty"
Testing -- "This functionality was added, but never tested". If the codebase is very young, this may be waived, but after a few months, test coverage should strictly increase with each PR.

[These may be blocking at the reviewer's discretion]

API -- "The name of this function is confusing, and I'd expect it to do something else"
Documentation -- "Add docstrings to these functions/Improve the ones that are there"

[Simple fixes that generally aren't blocking]

Grammar -- "It's its, not its'"
Readability -- This triple list comprehension is unnecessary and would be really difficult to debug"

A few other tips for reviewers:

In larger-scale software development, the code review begins almost before any code is written. These conversations cover architecture, API design and specifications, and provide a blueprint for the work to be done. It's hard to receive code reviews after you've written an entire new module saying "actually, these two classes should be three classes", because it's requesting a large-scale change that would add days if not weeks to the development time. We are here to experiment with cool science, so be sure to give the code author an "out" if you're asking for large, not-completely-essential changes, by recommending that the refactor become an Issue that can be discussed over time and addressed as the code becomes more mature.
Give concrete suggestions whenever possible. It's better to say "Consider renaming this to molecule_to_string" than "this function name is confusing".
If you give open-ended feedback, be active in the discussion thread as the author asks for clarification or proposes solutons.
If there are three or less instances of the same concern, it's OK to point out each with a separate comment. However, if there are lots of repeats, just say "this issue appears throughout the proposed changes".
Say some nice things. I've learned a lot from doing code reviews, and it's cool when people point out a trick from my code that they'll use later, or something that they thought was particularly well-designed. As a dispersed team, we don't have the lab banter that usually clears up interpersonal tension, so be positive whenever you have the chance.
Every codebase has a different history and level of maturity, and each PR should ensure that the changes improve its overall quality a little bit. So adjust the level of the review based on what stage you see the software in.
Aim for around ~10 to 15 comments. More than that is overwhelming.
Watch out for toolkit- or file-dependent behavior in tests. Just because the example molecule puts all the heavy atoms before the hydrogens doesn't mean that will always be the case.
You are certainly not limited by the above. If you'd like to learn more, consider reading Google's Code Review Guide and feel free to suggest changes to this message.
Keeping a checklist while you review might be useful!

Checklist example

- [X] This task has been finished
- [ ] This item is still pending

What am I?

I am the Open Force Field Dangerbot. You can find my code and installation instructions
at https://github.com/openforcefield/dangerbot.

This reviewer was selected out of a list of Open Force Field volunteers: ["j-wags", "jaimergp", "simonboothroyd", "trevorgokey", "vtlim", "dfhahn", "jthorton", "chayast", "dgasmith", "maxentile", "jchodera"]

j-wags · 2020-02-11T21:50:49Z

docs/releasehistory.rst

@@ -33,18 +47,18 @@ Behavior changed
 """"""""""""""""
 - `PR #469 <https://github.com/openforcefield/openforcefield/pull/469>`_:
  When running :py:meth:`Topology.to_openmm <openforcefield.topology.Topology.to_openmm>`, unique atom names
-  are generated (overriding any existing atom names) if the provided atom names are not unique. This
+  are generated if the provided atom names are not unique (overriding any existing atom names). This


Changes to releasehistory below here are small grammar/formatting/syntax fixes and are not related to this PR.

jthorton · 2020-02-12T11:38:31Z