2022-06-07 QC meeting notes

Participants

@Jeffrey Wagner
@Pavan Behara
@David Dotson
@Chapin Cavender
BenPritchard

Discussion topics

Item	Notes

Item	Notes
Updates from MolSSI	BP: No major updates, working on documentation. Server is going good. DD – Last week I worked with Lorenzo to update the instructions for accessing the old industry datasets. This is using the test server, so that’s a vote of confidence! On the QCSubmit side, we want to use some test fixtures in QCFractal. Is that allowed/recommended? BP – Yes. The function signatures may change a little bit but the functionality will remain.
Infrastructure advances	DD – Now have MPI cluster working at full capacity. It took me a while to figure out how to set up jobs well - network file system isn’t very fast and there isn’t a ton of scratch space. So now I copy conda env to local disk, which takes about 15 min but really speeds up execution. I’m also reducing the frequency at which I dispatch these jobs so that we don’t cause trouble in the shared NFS.
Throughput status	New OpenFF sets from Jessica: OpenFF multiplicity correction optimization set v1.0 - 400 opts, complete. OpenFF multiplicity correction torsion drive data v1.0 - 92/99 TDs done, remaining persistent errors. PB will check with JM about next steps for this dataset. New OpenFF sets from Chapin: OpenFF Protein Capped 1-mers 3-mers Optimization Dataset v1.0 - 753/759 opts complete. OpenFF Protein Capped 3-mer Backbones v1.0 - 0/54 TDs complete. 19493 opts done. SPICE sets: around 25K calcs last week SPICE PubChem Set 4 Single Points Dataset v1.2: From 34 remaining to 26, most are persistent errors. SPICE PubChem Set 5 Single Points Dataset v1.2: 80892 from 55874, around 42K more remaining.
Science support needs	PB – Can we use intel MKL? PB – For QC workers? DD – I don’t see a legal problem with this. Will XTB work correctly with intel MKL? PB – I think so. See JW – It’s noteworthy that here even running with intel MKL still scales really poorly - Adding 8 or 16 cores only improves runtime by a factor of 2. DD – Is anyone familiar with the syntax in that discussion like `conda create --name xtb-mkl xtb "libblas==mkl"`? (General) - No DD – Do we have big XTB datasets that will need XTB workers? PB – Not now DD – Let’s punt on this until then. JW – For Bespokefit? DD – PB, could you ask TG to spin up workers tageting the `openff-tscc` tag? PB – Yes, I think he’s doing that now. 26 workers with 8 cores and 30GB each. I’m running 20 workers now from my user account. DD – Thanks for the update. Could I ask you to update the currently running info in qcfractal-compute?

Meetings

2022-06-07 QC meeting notes

Participants

Discussion topics

Action items

Decisions