Todos from last time | Connor will see if there’s a way to expand RDKit conformer generation energy window to allow better candidate ELF conformers to be generated. CD – in 100% of cases, RDKIt and OE select the same conformer as being ELF1 from an OE-generated conformer ensemble CD – in ~95% of cases, RDKIt and OE select the same conformer as being ELF from an RDKit-generated conformer ensemble CD – (Showing notebook) CD – The only thing you have to do to get RDKit to generate way more conformers is to set the min RMSD to 0, and use max_conformers to control the number of outputs instead. JW – Do we know whether these output conformers are substantially distinct? CD – I suspect they are but haven’t verified. What you could do is generate more conformers than you want, and then prune them by diversity. LW – I know that RDKit generates a alrge umber of conformers internally and then prunes them, possibly using the provided RMSD, util the desired number of conformers is achieved. LW – Can we set an energy window in RDKit? JW – I think we’d looked into this a little last time, and I don’t think we found a way to set this. LW – I have some code that prunes conformers by an energy window that I specify (calculating energies using MMFF)
CD – Shows plot of ELF1 comparison now that RDKit is generating way more conformers
Image ModifiedJW – I look at the bottom plot in terms of “should we throw out the current random conformer selection and implement ELF1 immediately?”, and the data is making me think “no” Re: plotting for information density: seaborn jointplot can make a scatter + histograms from a pandas dataframe https://seaborn.pydata.org/generated/seaborn.jointplot.html LW – What’s up with the extreme outliers in the ELF1 plot? Eg, one atom was given 0.3 in OE but -0.6 in RDKit. JW – Maybe a chemical perception difference? I remember that there were certain points on the “same conformer computer with no geometry optimization in both engines” where most of the results were identical, but some molecules had slightly different outputs. CD – I don’t think that explains it.
Connor will evaluate whether OE vs. antechamber charge differences correlate with RMSD CD – I’m surprised that charge RMSE doesn’t correlate with geometric RMSD. LW – Which RMSD method? CD – RDKit.AllChem.GetConformerRMS with all default settings. LW – Should use GetBestRMS instead, since that will check over symmetries
LW – Symmetrization problem? Analagously to how we do symeetry-aware RMSD, should we do symmetry-aware partial charge RMSE? JW – Charge engines should already be symmetry-aware, so there shouldn’t be a need for multiple atom mappings. But some settings will disable this (liek openeyes nosymstopt ) LW – Are WBOs symmetrized by charge engines?
|
Next steps
| Why aren’t ELF1 charges agreeing? Are we doing valid RMSD calculations? How do we get RDKit to make the same conformers as OE? EmbedMultipleConfs' useRandomCoords kwarg?
Find a way to correct for heavy vs all atom RMSD settings (OE defaults to all, rdkit defaults to heavy)
|