MT – This is done, but not in the sense that it’s polished. There’s some filtering/error handling that’s in the old code that isn’t in the new. In the last step there are 100 plots that are generated (split out for different functional groups).
LW – It’d be great to help get the new hires running the QM benchmarks running regularly.
MT – Big blocker there is pulling down dataset from QCA - There are lots of issues with QCA datasets.
JW – use JSON blob and/or SDF in meantime
MT – I’m very unhappy about state of QCA/QC datasets.
LW – Starting from serialized inputs could be good if QCA continues being unreliable. Lexie and Brent have learned that using locally-saved files from QCA is more straightforward process than accessing QCF live.
MT – Doesn’t this conflict with the ethos that things need to be runnable by other people?
LW – Since we’re making this modular means that this will actually be good for scientists.
MT – I wonder if there’s an early starting point where there needs to be a structure for the data - Like QCA → (input state), on-disk → (input state), etc.
LW – Was hoping that this could be the same model as QCSubmit datasets - Basically a format defined by downloading from QCArchive.
MT – QCSubmit models are probably a good place to start for dataset storage.
(..JW walks through code changes, whether QCF datasets could replace QCS datasets…)
MT – Don’t get rid of QCSubmit datasets - QCF datasets don’t store OFFMols, they only store QCEl Molecules which aren’t suitable for our purposes.