2021-01-25 Core Devs meeting

Roundtable updates

DH – Worked on benchmarking.
- Push backt o benchmarking discussion afterewards)
- DD – Something tricky with analysis commands?
- JW – Match-minima?
SB –
- Worked with MT on conda-forge migration. Realized that we have to move openmmforcefields as well. Updated to have correct namespace and fix CI. Turns out I also had to fix QCEngine.
- Made new library called constructure –
  - Can be used for molecule generation, eg automated chemical space exploration.
  - DD – Would be cool to automatically find gaps in current sets/parameter coverage and send them into QCArchive.
MT –
- Been working on conda-forge migration. SB and JRG helped a lot. OFF Toolkit, forcefields repos are on conda-forge.
- Last phase of migration is tidying up downsteams (eg openmmforcefields, qcengine, forcebalance).
- Updated lots of packages from Travis to GHA.
- Hunting down loose ends in documentation – Things that would fly under the radar of a dedicated documentation writer.
- Worked on System, worked out interactions with Vanderbilt. VU has been slow on deliverables/noncommital to project, so I’m tkaing more of a “take it or leave it approach”.
- Made GMX topology writer
- Will be working on a neutral Topology representation, that can either handle OFF-style “cheminformatics” topologies, or other ecosystems' atom-type-based topologies.
  - JW – This will be helpful for me to know how to do a OFF Toolkit Topology refactor
  - SB – Will be helpful for biopolymers, meeting scheduled for mid-Feb.
CD –
- Continuing on toolkit equivalence testing. I’ve been loading the same molecules using the cheminformatics backends. Found issues regarding bond orders around aromatic rings. Differences in stereochemistry detection/assignment. OE promiscuously adding protons to sulfates and phosphates. Some differences about SMILES outputs.
DD –
- Working on benchmarking. Did lots of iteration on the software, gathered documentation and instructions for kickoff meeting last Friday.
- Will be having partners deploy QC* infrastructure on their computing resources, will send them ~600 molecule dataset to set stability/performance expectations.
- Will be supporting deployment/science issues on the #benchmark-support channel.
- Announcement and scientific discussion will take place on #benchmarks-partners channels.
- Will be deploying new conda environments for our QCArchive workers.
- Putting together new submission set for pepconf.
- Worked on changing QC jobs to DLC instead of TRIC, and reset=True option. appears to result in much faster optimizations.
JW –
- Worked on benchmarking.
- Made single file installers, worked on hardening/refactoring validation step.
PB –
- Worked on WBO fitting.
- Tweaked forcebalance setup on UCI clusters. Now running it doesn’t make 40GB output.
- Looked at how priors+other settings affect outputs. Worked on testing out some chagnes to QCEngine. (ran an errored pepconf job and reran with new settings – Still running, can report later)
- This week I’ll be continuing WBO work and continuing preparing Genentech data.
  - JW – Is Genentech dataset the same as Swope’s benchmarking dataset?
  - (General) – No
- SB – Are changes to QC input files, is it changes to code, or input files?
  - PB – input files
  - SB – Great. I’d like to make sure we’re not doing “production science” with locally-modified code.
- PB – Are QCFractal task objects immutable? Can I change parameter values for coordsys from tric to dlc? I was unable to change it so I had to make a copy of the task.
  - DD – This is where pydantic can make thigs difficult. I’d call task.dict and then modify the dict and pull in back in with something that constructs from dict.
  - PB – Is there a friendlier way to get the output?
  - DD – I’d use the package “JQ” – It lets you treat json like a dict, and allows grep/awk-style modification to JSON.
- Break for 5 minutes – Return for sprint planning at 25 past the hour