Sage RC2 Benchmarking Protocol - Season 1
This document details the execution procedure for adding Sage RC2 (openff-2.0.0-rc.2
) optimizations to datasets produced with the Optimization Benchmarking Protocol - Season 1 .
All steps assume the deployed conda environment is active, e.g. with:
conda activate openff-benchmark-optimization
4. Optimization execution
Using your chosen approach for executing optimizations in Optimization Benchmarking Protocol - Season 1 | 4. Optimization execution , you will run MM optimizations using the 1:4
season specification.
Multi-node approaches
A. Multi-node, persistent server, long-lived manager
To submit and execute the MM stage using the QM stage data as input:
openff-benchmark optimize submit-molecules --fractal-uri "myserver.corp.local:7777" --dataset-name "Benchmark Optimization S1 - MM-SageRC2" --season "1:4" 4-compute-qm/b3lyp-d3bj/dzvp/
You can export the data from this dataset with:
openff-benchmark optimize export --fractal-uri "myserver.corp.local:7777" --dataset-name "Benchmark Optimization S1 - MM-SageRC2" -o 4-compute-mm
B. Multi-node, persistent server, short-lived managers
Same approach as (A) above.
C. Multi-node, no server/manager setup
If your cluster uses e.g. SLURM, create a submission script like the following, called submit_molecules.sh
:
For the MM stage, using 1 thread and 6GiB for each:
You could batch multiple files into each submission with e.g. xargs
:
D. Single-node, persistent server
Same approach as (A) above.
E. Single-node, no server/manager setup
To execute the MM stage using the QM stage data as input:
Because thread-parallelism may not scale as well as simply performing many optimizations at once with your machine, consider running multiple execute
commands at once, giving each a batch of files to work on. You can do this in one shot with xargs
. For example, for the MM step, we might choose 1 thread per command, and 8 commands: