2021-01-25 Core Devs meeting
Roundtable updates
DH – Worked on benchmarking.
Push backt o benchmarking discussion afterewards)
DD – Something tricky with analysis commands?
JW – Match-minima?
SB –
Worked with MT on conda-forge migration. Realized that we have to move openmmforcefields as well. Updated to have correct namespace and fix CI. Turns out I also had to fix QCEngine.
Made new library called constructure –
Can be used for molecule generation, eg automated chemical space exploration.
DD – Would be cool to automatically find gaps in current sets/parameter coverage and send them into QCArchive.
MT –
Been working on conda-forge migration. SB and JRG helped a lot. OFF Toolkit, forcefields repos are on conda-forge.
Last phase of migration is tidying up downsteams (eg openmmforcefields, qcengine, forcebalance).
Updated lots of packages from Travis to GHA.
Hunting down loose ends in documentation – Things that would fly under the radar of a dedicated documentation writer.
Worked on System, worked out interactions with Vanderbilt. VU has been slow on deliverables/noncommital to project, so I’m tkaing more of a “take it or leave it approach”.
Made GMX topology writer
Will be working on a neutral Topology representation, that can either handle OFF-style “cheminformatics” topologies, or other ecosystems' atom-type-based topologies.
JW – This will be helpful for me to know how to do a OFF Toolkit Topology refactor
SB – Will be helpful for biopolymers, meeting scheduled for mid-Feb.
CD –
Continuing on toolkit equivalence testing. I’ve been loading the same molecules using the cheminformatics backends. Found issues regarding bond orders around aromatic rings. Differences in stereochemistry detection/assignment. OE promiscuously adding protons to sulfates and phosphates. Some differences about SMILES outputs.
DD –
Working on benchmarking. Did lots of iteration on the software, gathered documentation and instructions for kickoff meeting last Friday.
Will be having partners deploy QC* infrastructure on their computing resources, will send them ~600 molecule dataset to set stability/performance expectations.
Will be supporting deployment/science issues on the #benchmark-support channel.
Announcement and scientific discussion will take place on #benchmarks-partners channels.
Will be deploying new conda environments for our QCArchive workers.
Putting together new submission set for pepconf.
Worked on changing QC jobs to DLC instead of TRIC, and
reset=True
option. appears to result in much faster optimizations.
JW –
Worked on benchmarking.
Made single file installers, worked on hardening/refactoring validation step.
PB –
Worked on WBO fitting.
Tweaked forcebalance setup on UCI clusters. Now running it doesn’t make 40GB output.
Looked at how priors+other settings affect outputs. Worked on testing out some chagnes to QCEngine. (ran an errored pepconf job and reran with new settings – Still running, can report later)
This week I’ll be continuing WBO work and continuing preparing Genentech data.
JW – Is Genentech dataset the same as Swope’s benchmarking dataset?
(General) – No
SB – Are changes to QC input files, is it changes to code, or input files?
PB – input files
SB – Great. I’d like to make sure we’re not doing “production science” with locally-modified code.
PB – Are QCFractal task objects immutable? Can I change parameter values for coordsys from tric to dlc? I was unable to change it so I had to make a copy of the task.
DD – This is where pydantic can make thigs difficult. I’d call
task.dict
and then modify the dict and pull in back in with something that constructs from dict.PB – Is there a friendlier way to get the output?
DD – I’d use the package “JQ” – It lets you treat json like a dict, and allows grep/awk-style modification to JSON.
Break for 5 minutes – Return for sprint planning at 25 past the hour