Organometallic Tasks - Draft
"Include a small subset (~100 molecules) of small molecules relevant to OpenFF (for distinguishing levels of theory etc below)"
Need variety of metal centers and organic moieties to achieve this better to have large swath at low level of theory
Decide on Level of Theory
"Include a small subset (~100 molecules) of small molecules relevant to OpenFF (for distinguishing levels of theory etc below)"
Search through existing databases
Use Gemmi to handle dataset
figure to out what's going on with our PDB dataset, all issues are SCF convergence
Utilize this existing dataset to begin testing out NNP strategy regarding encoding spin state all spin states = 0.
Currently being worked with by Chodera lab, should get into QCArchive at some level
might need to discuss release of subset as open data
Look for structures neglected by tmQM of interest?
Simple augmentations [start working on in the background? Or clean existing and "batch" produce]
Switching within column is almost always okay.
[Will need in house tmQM for this]
Consider RDKit has "replacesubstructs" method
Conformal search:
CMILES Issue: Organometallics are difficult, not supported
SMILES All Around: Structure to SMILES conversion for Transition Metal Complexes
- not sure that's necessary because QCArchive has a hash to represent the molecules.
- Don't have to use CMILES as the name (i.e., index), can be arbitrary.
- QCArchive doesn't need CMILES, Prepare to pair program with JClark on cif --> QCA pipeline by bypassing openff molecule structures that are dependent on CMILES and directly compare cif files to QCArchive
- QCArchive has a metadata file field to add such information. Also openff
Molecule.conformer
is an attribute containing pint.Quality
class with coordinates and units. However, the list seems to usually only contain the coordinates of the molecule of main molecule.Computed/stored properties
energies
forces
other properties
atomic spin density
partial charges (multiple methods)
Dipole moment / polarizability
orbital energies (+/- 5 molecular orbitals around highest occupied molecular orbital)
Allow us to see electronic structure of complexes
relative contributions of each atom to each orbital
coefficients? - too large!
This level of theory was used to compute the following properties: electronic and dispersion energies, HOMO and LUMO energies, HOMO/LUMO gap, dipole moment, and metal center charge, which was derived from NBO