Skip to end of metadata
Go to start of metadata
You are viewing an old version of this page. View the current version.
Compare with Current
View Page History
« Previous
Version 3
Next »
Participants
Goals
Discussion topics
Item | Presenter | Notes |
---|
Updates from MolSSI | Ben | |
User issues, new submissions
|
| PB: #220 - need new qcsubmit JH: just have one PR on qcsubmit blocking release; working on this PB: have one compute spec on this submission DF-CCSD(T)/CBS that will take up to 150 GiB of memory for 16 heavy atoms PB: typically also use 48 cores
JH: working on #223, blocked by validation issues as well JH: ML stuff, adding HDF5 support for QCSubmit instead of a ton of SDFs, can use one file JW: what are the contents JH: conformers and mapped SMILES JW: is this file going to contain the same content as the other files, or is there something fundamentally different here? one thing that makes SDF safer is that readers and writers are not something we’re defining JH: there’s a lot of repeated info in the SDF JW: good point JH: understand concerns on future variability; would like to get a spec down as much as possible
JH: any feedback anyone has on this issue (
) appreciated
DD: concerned about collection size; will run into same issue as before SB+JH: not clear if it’s a single collection with a million conformers, or spread across several collections, or multiple million conformer collections BP: the metadata object for a collection gets very big as more and more objects are involved (molecules, specs), so this becomes an issue in the way collections are currently implemented SB: can see this taking another month for John and Peter to resolve; what is the timeline for next branch deployment? JW: DD, would you be willing to jump onto next OpenMM call to lay out constraints?
JH: is a test submission still in play? SB: Chapin’s dataset; what’s the status? DD: worked with him to set up manager on UCSD resources; can switch on and off at will; waiting for word on new submission status SB: think there may still be some ambiguity on what data, how it will be different from the Cerutti sets; will coordinate with Chapin and see where we’re at
|
Science support needs | | |
Infrastructure needs | | |
Action items
- David Dotson will turn managers back on for industry datasets, put them on their own compute tag to monitor behavior and progress
- Joshua Horton will cut a new release of
openff-qcsubmit
to unblock new users submissions - David Dotson will spin up workers specifically for Pavan’s high memory spec, Josh’s ANI submission once submitted
- David Dotson will chime on
openmm/qmdataset
on limits of large collection submissions for current QCArchive; get a sense for the numbers of entities involved and assess if this presents problems for collection metadata - Simon Boothroyd will push for a test submission from John/Peter for
openmm/qmdataset
to assess scientific value before pursuing larger sets - Ben Pritchard will prioritize Collection and
next
branch development on QCFractal; aiming tentatively for end-of-year deployment (cannot guarantee) - Simon Boothroyd will follow up with Chapin Cavender on status of dipeptide dataset, identify ambiguities and resolve if possible
- Ben Pritchard will include a fix for submission task duplication and slow queries in upcoming Fractal release and deployment/migration
- Joshua Horton will follow up with Pavan Behara on long form / keyword support in
openff-qcsubmi
for psi4
specs that include basis=None
Decisions
0 Comments