Discovery

Workflow components

Each workflow component from the diagram above is numbered below.
Options for software components indicated for each.

Separability of required workflow components will allow for parallelism in development activity. The label on each workflow component indicate qualitative development required for each.

  1. Identifier assignment

  2. Conformer generation (~10 conformers per molecule)

  3. Parameterization of molecules

  4. FF coverage report

  5. Energy minimization with Psi4 (QM), OpenMM (MM)

  6. Analysis and report generation

Available software components for implementation

  1. QCSubmit

  2. QCFractal

  3. QCEngine {vital}

  4. GeomeTRIC {vital}

  5. benchmarkff

  6. openff-toolkit {vital}

  7. openmmforcefields

  8. openff-spellbook

Restricted components

  1. OpenEye Toolkit

Packaging Options

openff-benchmark

Library components and entry points can be placed in openff.benchmark.geometry_optimizations.

openff-cli

Could introduce an entrypoint in this package for distribution. (optional, and for later)

Proposal

Interface

Command-line interface executable from any shell preferable.

Identifier assignment

Conformer generation

Parameterization of molecules

Forcefield coverage report

Remaining questions

  1. Should reports be aggregated? How possible is it to back-calculate a molecular structure based on the parameters used to to parameterize it?

Energy minimization with Psi4 (QM), OpenMM (MM)

Proposing a three-pronged approach.

  1. High-throughput (primary)

  2. High-throughput debug approach (secondary)

  3. Fully-local execution (alternative)

In principle, (2) and (3) could be served via the same entrypoint.
(1) would make use of QCFractal with a persistent server to handle most of the compute orchestration.

Analysis and report generation