Page Comparison

Participants

...

Discussion topics

Item

Presenter

Notes

Project organization

Jeffrey Wagner

Vote and secure approval for:
- Approvers
- The process for making binding decisions
- Discussion + decision venues (you should feel pressure to NOT have discussions in private channels/DMs, and rather move them to these discussion venues)
First three meetings will be on formalizing requirements. Planning to solidify first draft of requirements by the end of meeting 3.
F@H interface is a “should” for Rosemary - Must have free energy benchmarks, and this will be by far the most straightforward pathway.

Notes:

Unanimity of primary driver and two approvers accepted
Veto authority of primary driver and any approver accepted
Decision forum accepted

Technical overview

David Dotson

Overall architecture - High level details
- DD –
- JC – Advantages of colocation would be that Chodera lab could cover hosting expenses for OpenFF (both technical and personnel).
- JC – I’d like to share as much as possible with OpenFE - Molecule transformations, network mapping, etc.
- JC – Can make data available on AWS for redownload, for people to download at their own discretion
  - DD – Could save data on AWS S3. Different tiers of speeds/schedules, could be accessed by a public URI.
  - DD – Interested in getting output spec information from OpenFE.
- JC – What’s missing here is info about how inputs are specified.
- JW – how do we avoid versioning hell on the stuff inside the green box?
  - JC – First thing to standardize what goes in and out of the green box. Could freeze an env
  - JW – how do you do this where there aren’t any automation details for free energy calculations in the green box?
    - JC – what you get is a person doing all the work of generating inputs, which manifests as a slow and expensive feedback loop
      - we want to be able to make iterations fast and cheap; this is scalability
      - being able to get information about failures is key for FF development / drug discovery / advancing the free energy calculation infrastructure
    - JC – would like us to come up with inputs that are a starting point, then we can iterate on them to build out downstream components
Current plans/functionality
- JC – ideally we are in a position where in 3-6 months we have a functional green box that supports one engine
  - has a RESTful API where protein-ligand-benchmarks can be dumped in
  - uses OpenFE components in a common data
- JW – would like to see free energy benchmarking results, want things to be reproducible and isolatable
  - network planner, system generator, etc.; need to be able to keep everything constant except one component, etc., ability to run locally with only suspected problem inputs
- JW – do we get to specify which version of Gromacs / OpenMM we want to run on client machine?
  - JC – to some extent, yes; within a few recent releases can specify core versions
    - could be some usability for OpenFE folks; if you wanted to try a large selection of different network planners, could make use of Folding@Home to test these
  - JC – public benchmarking data will be the only way for OpenFE to identify failures; won’t be able to operate on proprietary data
- JW – how do we want visibility into errors; how do we delete or invalidate previous errors?
  - are we planning on having the ability to delete?
  - JC – We’ve definitely needed to trash datasets/results before.
  - JC – It’s become helpful to be able to pause or delete individual runs/edges.
  - JC – an adaptive supervisory process is possible later on that can add or delete edges in a network graph and propagate those changes into the work server
  - JW – Generally agree. This could become complex so I’d be flexible on this
Collect use cases/user stories
- DD – Let’s populate the issue tracker of the following repo: issue tracker
  - These stories will be the basis of discussion next week.
- JS – How black-box/agnostic would this be?
  - DD – We can try to loop as much in as possible
- JW – I think JS may have multiple individual requests - Things like “make the inputs agnostic enough to take host and guest systems” and “allow people to use pAPRika as a workflow component”
- JS – Could this run forcebalance optimizaitons internally?
  - JW – I don’t think so, but this could be a backend for optimizations if we engineer it right.
  - DD – Yes, if we know more details about the sorts of calcs that FB will submit, then we can look at whether it’s easy to make that compatible with our input format.
JC – I’d talked to RGowers about shared object models/APIs, and keeping touch about that.
- RGowers – This is ebing collected on our github (add link)
Identify functionality gaps
- Use cases need to eventually be written, in fah-alchemy GH repo issue tracker
- DD will make an issue template: Aspirational API, edge cases (must support/should support/shouldn’t support)

...

Versions Compared

Old Version 3

New Version 4

Key

Participants

Discussion topics