Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multi-conformer WBO Support #792

Open
SimonBoothroyd opened this issue Dec 16, 2020 · 4 comments
Open

Multi-conformer WBO Support #792

SimonBoothroyd opened this issue Dec 16, 2020 · 4 comments

Comments

@SimonBoothroyd
Copy link
Contributor

Is your feature request related to a problem? Please describe.

Currently both the AT and OE backends only use a single conformer to compute the Wiberg bond orders (WBOs). Should multiple conformers be used to compute the WBOs, and if so, how should the WBOs of multiple conformers be pooled / averaged.

This is a question which has been mentioned in several code comments and PR comments, but I think it's worth having a single issue to track this.

Given that Sage is slated to be the first OpenFF to include WBO interpolated parameters it is probably good to start revisiting this and thinking what the 'recommended approach' should be (both for fitting and for the end user).

Related comments this PR:

@j-wags

I think conformer averaging would be a nice thing to do. But, like ELF10, it's not clear that there's a single common-sense way to do it. So, I'd like to leave conformer averaging algorithms as something for the community to experiment with. Once someone wants to try it, they can invent a new keyword for bond_order_model, and implement support for it in a ToolkitWrapper.

@davidlmobley:

Definitely seems like a thing to do separately/unresolved science.

Describe the solution you'd like

Some possible solutions would be to:

  1. Just use a single conformer (current default). This would either be the first conformer provided by a user if multiple conformers are passed to use_conformers, or a single conformer generated in-place in assign_partial_charges. This would likely be the 'most' conformer dependant and input specific approach to computing the WBOs.

  2. Take a simple average of the WBOs for each provided (or in-place generated) conformer.

    a. Take the Boltzmann average of the WBOs for each provided (or in-place generated) conformer as is implemented in fragmenter.

@j-wags
Copy link
Member

j-wags commented Dec 16, 2020

I like the possible solutions listed above, and don't think they're mutually exclusive.

Option 1 is already in-place, and is triggered by the fractional_bondorder_method="AM1-Wiberg" attribute in the ProperTorsions or Bonds element of a SMIRNOFF force field. As you mention, it has a major shortcoming which is that it's heavily conformer-dependent.

Option 2 could be triggered by specifying fractional_bondorder_method="AM1-Wiberg-ELF10", which would need to be recognized by at least one ToolkitWrapper in a ToolkitRegistry. Ultimately it would need to be resolved in something like this if statement in the ToolkitWrapper stack.

The important decisions that we'll need to make are:

  • If boltzmann-weighting, how do we calculate conformer energies?
  • Do we need to make any considerations about how performance will vary across cheminformatics toolkits/versions of OFFTK? If we substantially change the implementation of our ELF10 algorithm after releasing a FF, how can this be tracked? Will we provide "legacy support" for previous versions or just recommend that folks install the old toolkit version?

thinking what the 'recommended approach' should be (both for fitting and for the end user).

I would think that the first generation of fitting and application will be single-conformer, since we're aiming for an early-2021 release. Once implemented and used in a fit, we could include multi-conformer averaging in a production FF whenever the data shows it's ready.

It's also worth mentioning that graph-based charge and partial bond order assignment is on the distant horizon, eg from espaloma, and I don't know that we've made a decision on how those will get merged into the main line of FFs.

@ChayaSt
Copy link

ChayaSt commented Jan 5, 2021

@SimonBoothroyd and @j-wags, I'm just chiming in here to say that openeye used to store the ELF10 WBOs in the bond data. That is what was used in fragmenter. Right now, fragmenter will only work as intended with older versions of openeye.

@j-wags
Copy link
Member

j-wags commented Jan 6, 2021

Thanks, @ChayaSt. Do you know the last version of OpenEye Toolkits that supported ELF10 WBOs? In the short term we can update the fragmenter conda package to ensure that it gets paired with a compatible version of OpenEye toolkits.

@j-wags
Copy link
Member

j-wags commented Mar 5, 2021

@ChayaSt Bumping this topic again -- it'll be super helpful if you have a record of any versions which did/didn't have ELF10 WBO capabilities so we can pin a good version range.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants