2022-07-25 Core Developers meeting notes

Participants

  • @Diego Nolasco (Deactivated)

  • @Matt Thompson

  • @Pavan Behara

  • @David Dotson

  • @Jeffrey Wagner

Discussion topics

Item

Notes

Item

Notes

General updates

  • DD offline Thurs - Thurs

    • DD – I can run F@H call on Tuesday morning, but will be between two trips so unavailable before/after

  • DN – Feeling unwell, family is sick.

    • JW – We don’t need anything from you in the lead team meeting today. Feel free to call out.

  •  

Individual updates

  • DN

    • We delivered the annual report to industry partners! I’m really appreciative for the help that everyone gave when pulling this information together.

    • This week, I’ll be updating the website. Previously I was making updates to align our public-facing material with the new annual report. I’d love ideas about how to improve the website

      • JW – Blog posts/ad board meeting minutes are kinda hard to find. If we bring in a web developer to refresh the website that could be a good thing to restructure

    • JSetiadi is working on his blog post, I expect to review that in the next month.

  • MT

    • Likely offline Thursday

    • OpenMM import - Working on this for a few reasons - No RC feedback so far. Also this is a highly requested feature - This will be an important structural component for other importers (AMBER, GROMACS, CHARMM, probably others). So the exciting thing is that this could enable usage of externally-parameterized components in OpenFF workflows. Though I expect this to be a lot of work, so I’ll be on this for a few weeks.

      • JW –

    • Pymbar release with breaking changes, minimal impact so far but tag me if any issues arise.

      • Semi-related, our stack works on Python 3.10 except for Yank (needed for Evaluator), which they are currently working on.

    • Progress on EPs based on OpenFE feedback. Currently stalled, however, and I don’t see it as my responsibility to un-stall it.

    • Virtual site position math stalling, could use 1-2 hours of pair programming with somebody. Maybe I’m setting particle positions incorrectly?

      • JW – I’d love to work on this. Let’s do this at our working session wednesday.

      • DN – I’m happy that this is progressing - The ad board really wants vsites

      • JW – Two things to keep in mind

        • We’re talking about the infrastructure to apply vsites, there’s also the issue of having infrastructure for a force field trained with vsites, which is the second half f what the ad board wants, and now something we’re guarnateeing

        • The OpenFE infrastructure doesn’t support vsites, and won’t support them in the near future. So we won’t be able to run PL benchmarks on Rosemary if it has vsites.

      • DN – To clarify, our plans haven’t changed for Rosemary - It won’t have vsites, even though everyone keeps asking.

      • DD – From my perspective, I think we’d just need to add vsite handling to the atom mapper. Then everything might just fall into place.

    • Otherwise quiet week and/or forgot what happened

  • DD

    • Protein-ligand benchmarks

      • Early last week made decisions on specific technologies to use for each component of fah-alchemy

      • Spent a lot of time on the tail end of the week working on gufe

        • felt a bit stuck on our approach to serialization in gufe#30, trying to build on Swenson's approach while taking into account Richard and my perspectives on it

        • met with OpenFE folks and Swenson on Friday for long working session to determine path forward, find agreement; executing this week

        • we want to take the same approach if possible in gufe#36, giving us a consistent approach to serialization / stubbing across all gufe data model objects

        • for fah-alchemy, this is less important since I'm planning to use a graph database for storage/serialization of AlchemicalNetwork data

      • Going to be a hard slog through August to meet September deadline

        • JW – OpenFF deadline is flexible. Also, we have Nolasco and Chodera involved in these deliverables, who should be able to gain cooperation from other contributors if things get stuck in discussion.

          • DN – JC isn’t on gov board of OpenFE, he’s on the TAC.

        • DN + JW – And you’re doing a great job working from a super difficult position.

      • Currently stalled on protein-ligand-benchmark#52; Irfan is working as evaluator for John, Melissa to get this to finish

    • QCArchive

      • PRP chokes on new RNA set (qca-dataset-submission#301); currently running through Lilac only due to high memory. Default basis set evaluations worked just fine, but there are other jobs with a more detialed basis that encounter major memory issues.

        • JW –

        • PB – Thanks for looking into this. I’ll run a test manager at UCI and see if UCI can complete jobs, and how many resources it’s taking.

          • DD – Thanks! Expecially interested to see if it respects the cores_per_worker arg.

        • PB – I’ve seen the CPU oversubscription issue before. In general, if you’re running on a node and you queue 10/40 cores, it still considers the full 40 to be available.

        • PB + JW – We’ve seen CPU oversubscription issues for QCEngine/psi4/xtb before but can’t remember details.

        • DD – I don’t see this on all the managers, maybe it’s only a specific circumstance where this happens.

  • PB

    • Mostly fitting related work

      • helping JM setup her FB runs for torsion multiplicity work, debugging failures

      • dug a bit into creep in some angle parameters over generations of force fields and my hypothesis is ignoring Gen1 sets completely post v1.2.0 fits might’ve caused this. There were some torsion parameters, coupled with those angle parameters that had discrepancies, that didn't have any torsion profile training targets but were optimized just with optgeo targets. Some smaller fits with additional targets from Gen 1 sets seem to work better, going to do a larger fit to see if there would be any improvement. One example, a40 and t123, for phosphorous related molecules. Another, a31, a32 and t155 for hypervalent sulfur. t123, t155 (and a few other parameters) are completely missing from Gen2 sets post filtering out strongly H-bonded molecules.

        • PB – After filtering the 800+ torsiondrives, there are only 700 left, and so some parameters are trained to only 1 or 2 torsiondrives, and other don’t have any targets at all.

    • Sage manuscript revisions based on DM's feedback, have few more sections to overhaul.

    • Valence parameter optimization for Double-exponential fits

      • had a problem with few targets failing consistently and Josh found the source of error to be the OH vdW parameter, which is supposed to be a non-zero small value (a fix used in LJ versions)

      • also, we decided to reduce the scope to trained vdW parameters, and excluded S, P, F and I from the valence training set

  • JW –

    • We got the annual report out the door!

      • DN – I think we invested this year in making a process for putting out the report, and that next year we’ll have this ready to go.

      • DD – Could I take a look at it?

      • DN – Sure. I’ll share the PDFs.

    • Working on fixes for biopolymer rc. Think I fixed OE atom indexing in PDB.

    • This week will go full steam towards biopolymer, working with MT on interchange, reviewing+approving interchange regression testing



 

 

Action items

Decisions