| |
|---|
Update Dataset Tracking | Project Board; Slides |
QDS handling of non-QCSubmit dataset. | Scaffold Submission PR is completed and ready for review |
Clean benchmark releases | PR475 Industry Benchmark is almost ready to go but not all molecules are transferring to a new dataset… MLPepper Dataset is in process. Closes #465 #466 #473 Zenodo record for OpenFF ESP Fragment Conformers v1.0 is in progress singlepoint datasets aren’t covered in the existing ipynb for docker, and some datasets don’t follow standards with attributes. We have our notebook labeled as “v1.0” should I make a new version with: molecule final_molecule final_molecules[0] getattr(record, mol_attribute[ds_type]).extras[“canonical…“] and try:except with: getattr(record, mol_attribute[ds_type]).attributes[“canonical…“] should I make a “data_handling_singlepoint.ipynb” LW – Would one or the other be simpler for you? JC – The 1.0/1.1 thing would make the most sense for FF release work. But maybe I should make different notebooks for singlepoints, opts, or TDs. LW – Having a notebook for each dataset type sounds good to me.
There are no molecular statistics, do I need to generate those? LW: If it’s not any trouble, it would be good to include them JW – Might be good to set a threshold for like FTE-days (0.1?) LW – I’d just say go for it if it’s not much trouble/mostly just waiting.
|
MolSSI Info / Align Priorities on MolSSI Asks | No notes from June 24th meeting New from last QCAUM meeting: Communicated that adding the ability to copy records from a database on one server to a database on another server is not a priority for us, but Ben says it’s on his plan of work time horizon. We communicated that we would prefer to get the 5 TB copy, but not right now. Ben offered if we wanted to coordinate a backup that we have access to then that’s on the table.
Requests: |
Old Issue of the Week | Conformer generation should fall back to RDKit ETKDG on Omega failures John suggests that if Omega fails in generating initial conformers, RDKit should be the fallback. Should this be a QCSubmit ticket?
Bonus: Missing chemistry to (potentially) cover post-release-1 [#8]~[#35]: O-Br single bonds are present in GAFF2 but not present in our current datasets. We could port in a placeholder value from GAFF2, but there are no molecules with this chemistry in our current datasets.
[#7X3]~[#7X3]~([#8])~[#8]: Nitroamines
[#6:1]~[#6:2]=[#15:3]~[#6:4]: C=P double bond (potentially with adjacent singles)
|
| |