| |
---|
Updates from team | JW: Conformers prototyped; can give demo working with dataset from Thomas Fox as a test case; does not go in smoothly of 600 molecules, 8 stereochemistry issues; RDKit is a bit sloppy with stereochemistry issues
DD: Gave update on one-shot optimization usage for some, perhaps many, partners JH: deployment procedure document; gathering information from PRs as they develop DH: Pushed analysis components, would like consolidation on read/write functions, seasons For each method, separate SDF, with exactly same molecule; have to be able to relate molecules of the same ID across results Use pandas DataFrames primarily; For CLI, separate analysis from plots/report generation Could still output SDFs with relative metrics to QM included? JW: may not want to do this, since it depends on another SDF somewhere in the results DD: let’s give it a play and see if there is a way to make this less confusing in the output
Still need unit conversion; dependent on units being present in input SDFs to this component DD: I’ll review today, push to merge
|
Dataset object? | Do we want to consolidate on a Dataset object (bundle of openFF mols pulled from SDFs, exports SDFs) For now, we’ll proceed with merging each PR, then search for places we can consolidate read/write, handling of OFF mols
|
How to handle errored cases?
| JW: 2 kinds: Could add to e.g. the validate command:
@click.option('--error-directory',
default='1-errors')
@click.option('--error-out',
default='1-errors.out')
DH: what do we do with “undefined stereochemistry?” JW: these are opinionated parts of the toolkit, which may change/improve, but those improvements tied to release cycle of toolkit, so a bit slower; will just have to use this experience to spin out issues for improvement [decision] since warnings are loud, we’ll squelch warnings; errors still get raised can make clear that in the validation step, some percentage are expected to be excluded (>1%) the minimization step may also have some percentage of failures (>5%) could also say that overall, up to 10% of your dataset may not make it end-to-end
[decision]: we’ll make a slack channel for support; allows us to operate with low-friction, loop in folks as needed for help understanding weird cases
|
Structures from the PDB | |
Basis set choice | |