* Note: This post contains 'preliminary valence parameter fitting results’, which was carried out with currently available QM data from 2nd generation training data sets.
Description
This post contains benchmark of preliminary valence parameter fitting (v1.2.0-preliminary).
...
Fitting Data and Results
Fitting targets: 581 1-D torsion profiles; 2,974 optimized geometries; 278 vibrational frequencies
Input force field : same initial force field used in v1.1.0 fitting
The objective function decreased from
1.02809e+04
to3.21676e+03
in 28 steps.
...
X2 for primary(neighboring) set | X2 for full(diverse) set | |
---|---|---|
Initial force field | 1435 | 29,469 |
v1.0.0 | 948 | 20,672 |
v1.1.0 | 936 | 20,097 |
v1.2.0-preliminary | 766 | 16,939 |
To provide more intuitive insights on the benchmark results, we aggregated the resulting data and made the following plots.
...
y values in the plots(Δ WRMSE) are the difference in the WRMSE between different v1.2.0-pre and v1.1.0; Negative y value indicates better reproduction in v1.2.0-pre compared to v1.1.0. The average change in WRMSE is -1.248, indicating that overall the v1.2.0-pre better performs in reproducing QM optimized geometry than v1.1.0.
The major contribution of significant improvement in reproducing QM optimized geometry seems to be the inclusion of eMolecules discrepancy set( a set having geometries that are substantially different in smirnoff99Frosst
relative to the other force fields) in QM training set generation.
All geometries shown significant improvement with v1.2.0-pre( delta WRMSE < -0.25, blue-circled) are deprotonated phosphonates, RP(=O)(OH)(O^-). The input molecule sets(Roche set, Coverage set) used to generate the first generation optimization dataset for valence parameter fitting didn't have the phosphono group. And by using eMolecules discrepancy set during the second generation optimization dataset generation process, C=C(C(=O)O)OP(=O)(O)O
has been added to the new dataset, which enabled to properly fit the parameter related to phosphono group.
Here’s one example of the improved performance on phsphonates.
...
QM optimized geometry of ([P@@](=O)(O)[O-])[P@](=O)(O)[O-]
. ( transparent red: MM optimized geometry with v1.1.0 force field, transparent green: v1.2.0-pre force field)
In the optimized geometry from the v1.1.0 force field, the hydroxyl hydrogen is much closer to the negatively charged oxygen (1.10 Å) than in the QM optimized geometry (2.32 Å). This can be interpreted as forming an overly strong intramolecular hydrogen bond. The v1.2.0pre force field corrects this error.
2. Abinitio Targets
To investigate the improved performance of the new parameter set in reproducing QM relative energies between conformers, QM vs MM relative energies between conformers “at QM optimized geometries” were calculated.
...