...
Where do CMILES go?
Are they attached to molecule records, or somewhat less coupled?
Are molecules in a new dataset submission accepted, and are the CMILES retained, if the molecules already exist in QCA?
Roadmap refresh: QCArchive
Trawl automation issues for additional functionality:
Github link macro link https://github.com/openforcefield/qca-dataset-submission/issues/53 Github link macro link https://github.com/openforcefield/openforcefield/issues/516
Review previous tasks, clear where possible.
What gaps in our current pipeline do we want to address next?
Submission is getting easier, but what about end-user use for:
fitting
benchmarking
scientific investigations
New datasets
First direct submission from David Cerutti
Datasets being processed
Rowley Biaryl appears stuck
ESP WFn storage
There is currently no way to specify the wavefunction protocol in basic datasets, we can do this manually or should we get this in fractal?
...
Item | Presenter | Notes |
---|---|---|
CMILES - where do they go? | Josh |
|
ESP Wfn storage | Josh |
|
Roadmap refresh | David | Reviewed roadmap refreshed by David and Jeff Wagner; changes and clarifications welcome at any time |
David Cerutti submission | David | David: I will push for creating the PR, DC is creating an SDF writer so that we can run the fragments through the toolkit to generate e.g. CMILES
|
Rowley Biaryl INCOMPLETE | David | David: we have 1 INCOMPLETE Optimization that appears to be stuck; looks visually like cases we’ve seen on older datasets, but shouldn’t be due to a schema change
A solution to this is adding task regeneration for any INCOMPLETE procedures with no attached task when attempting a |
Philosophical discussion | Ben | Ben: Two pillars at play in QCArchive:
These two pillars create a problem where when you really want to re-run a calculation on the same molecule, you can’t.
Considering changing these pillars, e.g. making deduplication opt-in instead of automatic and forced Immutability: calculations should probably remain immutable, questionable on other things (extras) Ben plans to start soliciting feedback from users on what their pain points are so these can be included in any major refactor |
Action items
- Joshua Horton will create a way to add the wavefunction protocol to the Dataset class so that only wavefunctions for the final configuration is stored
- Joshua Horton will add CMILES to the Molecule objects we are submitting to QCArchive in addition to the entries
- Ben Pritchard will gather pain points and needs from current QCArchive users to fold into refactor discussions
- David Dotson will reach out to David Cerutti to get PR up for his first direct submission
...