Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 2 Next »

Here we compare the benchmarks of Sage 2.1.0 (unconstrained) between YAMMBS and the original benchmarking script used during the development of Sage 2.1.0.

The original benchmarks of Sage 2.1.0 were taken from here. These benchmarks use a geometry optimization convergence threshold of 5e-9 kJ/mol/nm^2. ddE is defined with respect to the lowest energy QM conformer, and ddE’s between the lowest energy conformers are excluded as they are definitionally zero.

The YAMMBS benchmarks were calculated as follows. The MoleculeStore object was generated with this commit of the YAMMBS package, which optimizes the molecules using a convergence threshold of 5e-9 kJ/mol/nm^2. The ddE’s, TFD’s, and ICRMSD’s were calculated using this commit of the YAMMBS package, which has revised the ddE definition to be consistent with the old benchmarking script (defined with respect to the lowest energy QM conformer, and ddE’s between the lowest energy conformers are excluded) and uses mapped SMILES to create molecules for TFD analysis. The RMSD’s have been calculated using the same commit as the ddE and TFD, but with a modified RMSD function that uses all atoms to calculate the RMSD, in order to be consistent with the RMSD definition in the old benchmarks. YAMMBS typically uses a heavy-atom-only RMSD, which is much faster to compute.

DDE

The overall DDE’s are nearly identical between YAMMBS and the old benchmark (“og” in the figures).

dde_ecdf.jpgdde.jpgdde_zoom.jpg

However, examining individual records reveals a number of cases where the ddE’s are very different. There are 564 records with the ddE difference > 5 kcal/mol

dde_dif.jpg

RMSD

When comparing the all-atom RMSD’s between the two codes, the overall RMSD benchmarks are nearly identical.

rmsd_bothallatom_ecdf.jpgrmsd_bothallatom.jpgrmsd_bothallatom_zoom.jpg

However, there are still large deviations when examining individual records. There are 222 records with the difference in RMSD > 0.4 A.

rmsd_aa_dif.jpg

TFD

The overall TFD metric is nearly the same between the two sets of scripts.

tfd_smiles_ecdf.jpgtfd_smiles.jpgtfd_smiles_zoom.jpg

However, there are a few records with large deviations in TFD between scripts. There are 56 records with a TFD difference > 0.1.

tfd_smiles_dif.jpg

ICRMSD

Bonds

bondrmse_smiles_ecdf.jpgbondrmse_smiles.jpgbondrmse_smiles_zoom.jpg

Actually, the bond RMSE is pretty consistent on an individual-record basis, with the exception of the one outlier molecule identified above.

bondrmse_smiles_dif.jpg

Angles

anglermse_smiles_ecdf.jpganglermse_smiles.jpganglermse_smiles_zoom.jpg

The record-by-record comparison for angle RMSE is also relatively good, though 82 records have an angle RMSE difference larger than 1 degree.

anglermse_smiles_dif.jpg

Dihedrals

dihedralrmse_smiles_ecdf.jpgdihedralrmse_smiles.jpgdihedralrmse_smiles_zoom.jpg

The dihedral angles show large differences when comparing individual records. There are 239 records with a difference in dihedral angle RMSE of > 10 deg

dihedralrmse_smiles_dif.jpg

Impropers

The overall behavior of the impropers is concerning, with yammbs showing significantly worse results. I couldn’t find a reason in the code for this.

improperrmse_smiles_ecdf.jpgimproperrmse_smiles.jpgimproperrmse_smiles_zoom.jpg

Same situation for impropers; 332 records have a difference of > 10 deg.

improperrmse_smiles_dif.jpg

Comparing individual records with large deviations

As seen above, many individual records have large deviations between the two codes for each metric. One might expect if a given record has bad agreement in RMSD, it would also have bad agreement in other metrics, and vice versa. However, there isn’t a lot of overlap between the records that have large deviations in different metrics. Below are the overlaps between different sets. For example, “overlap DDE/RMSD” is reporting the number of records that both have a large discrepency in DDE and a large discrepency in RMSD between the two codes.

Overlap DDE/RMSD 36
Overlap DDE/TFD 10
Overlap RMSD/TFD 29

Overlap DDE/bond 0
Overlap DDE/angle 0
Overlap DDE/proper 44
Overlap DDE/improper 4

Overlap RMSD/bond 0
Overlap RMSD/angle 1
Overlap RMSD/proper 106
Overlap RMSD/improper 5

Overlap TFD/bond 0
Overlap TFD/angle 0
Overlap TFD/proper 22
Overlap TFD/improper 2
  • No labels