Drivers

Supervisor

Contributors

Additional Stakeholders

Simon Boothroyd

Owen Madin

Michael Gilson

Michael Shirts

Michael Schauperl (Deactivated)

David Mobley

Christopher Bayly

Objective

Develop a study to begin investigate which strategy to take when co-optimizing the non-bonded VdW and electrostatic force field parameters.

Timeline

The planning for this study (and especially which infrastructure is missing) will begin immediately, while the actual study will not be undertaken until the Binary Mixture Data Feasibility Study has been completed.

Problem Statement

We know that the electrostatic and VdW force field parameters are strongly coupled - only optimising the VdW parameters will likely only improve things so much, while also running the risk of encoded electrostatic phenomena into the VdW parameters. To produce the largest improvement / impact, the two should likely be co-optimised.

There are several approaches in which to refit the electrostatics -

Possible Approaches

Charge Templating

This refers to an approach where charges are assigned based on functional groups, which could be defined with SMIRKS strings. For example, every ether group looking like C-O-C might get charges 0.1/-0.2/0.1, and so one. Thus, one would look for groupings of atoms that fit a template and then assign those templated charges. This is how Momany-Rone charges worked in the commercial version of CHARMm in Quanta. On reflection, however, I think this may add complexity without much benefit.

BCC Refit

BCC Refit + dimensional reduction

From MKG: “Here is a suggestion for a relatively straightforward study. Feedback invited!

Maybe the lowest-hanging fruit would be to simultaneously optimize LJ parameters with the H2CON and H2CO3N typing models, along simultaneously with a similarly compact set of BCCs used in conjunction with regular AM1 charges or else a better but still fast QM alternative. The simplest BCC sets would have a single BCC type for each LJ-type pair. Thus, for H2CON, with five LJ types, we’d have 4+3+2+1= 10 BCCs (one for each pair); and for H2CO3N, with seven LJ types, we’d have 6+5+4+3+2+1 BCCs. If these BCC typings seemed insufficiently fine-grained, we could do something like what Michael Schauperl did in his LJ paper, allowing BCCs for various simple typing models (which could be decoupled from the LJ typing.) For example, we might have the H2CON LJ types, but H2CO3N2 BCCs. The whole thing could start simple and probe out to more complex schemes. By keeping GAFF v?.? and OpenFF v?.? as baselines, we could quickly get a sense for whether a given model is deficient vs promising.

Training Set Composition

Any optimisations will be done upon small training / test sets of C, O, H and limited functionalities to keep the scope of the study tight and attainable within a reasonable time frame. Limiting the number of functionalities should reduce the difficulty of the optimisations due to the reduction in the number of parameters (DoF) which need to be optimised.

Lorem ipsum.

Parameters to Retrain

Required Infrastructure / Software

Requirement

Description

Status

Driver

References