Compute management | |
Update dataset tracking | https://github.com/orgs/openforcefield/projects/2/views/1 Update on in-flight sets by their compute owners BW – LipidMAPS Running slowly, would help to split it up, difficult to find the right size for the workers due to range of molecular size JW: Problem where errored molecules get added to the top of the stack to be re-submitted, but those errors were all OOM errors, so no small jobs could even start LW – Could we chunk this out now? JW: Seems to be working ok for now since these large jobs all got added to the front of the queue so the utilization is standardized now (maybe temporarily) BW – I’m ok keeping this one the way it is. It has stratified already so we’re handling the biggest jobs now. But for future sets I’m a fan of splitting. JW – Blocker now is some NRP-specific stuff, not having to do with dataset size binning
MLPepper BW – We don’t have workers going on this yet. I’ve been talking with CAdams, some confusion about what changed in the environment when installing offtk that made the QC jobs work. CAdams said thatr qcfractalcompute and qcportal DOWNgraded (from 0.56 to 0.54.1) when he installed toolkit, and this made jobs work. I’ll bring this up in MolSSI meeting. LW – Did other packages change (like numpy or pint?) pint went from 0.23 (works) to 0.24 (broken) numpy went from 1.26 to 2.1.3 (numpy 1 → 2 was a big technical change in some ways – can’t recall if we would expect it to affect defaults for e.g. allclose) dftd3 1.1 → 1.2?
BW – I haven’t done a complete diff of the envs that CAdams posted, so could be something else.
Remove tracking label from datasets that are done being computed Add unmerged dataset PRs to backlog/queued for submission. JW will remove all dataset cards from the project board except for acting/backlogged datasets, and will remove archivedcomplete column
|
Discussion of OpenFF QCA dataset standards
| |