Item | Presenter | Notes | Notes |
---|
protein-ligand-benchmark - 0.3.0 milestone review
| |
fah-alchemy project backlog prioritization
| fah-alchemy : Phase 1 - MVP identify appropriate place for Executor, ProtocolDAGResult versioned schema DD – I was hoping that we could generalize the F@H executor with OpenFE, but it seems like OpenFE’s goals are to have a smaller in-process executor, whereas the F@H executor may need larger-scale things like a RESTful API. RG – We’re not sure which executor architecture we’ll use at this time. JC – It seems like this is a clear target for standardization. So the F@H executor could be a subclass of the executor API. Are there reasons this couldn’t happen? RG – I think the nature of the implementation of the executor is separate from the chemistry of the problem. So this would go outside GUFE. JC – This does seem like a critical part of the “grand unified” part. Like, having a serial implementation would be in scope for living in GUFE. RG – We’re pretty focused on something DASK or DASK-like. DD – For F@H alchemy, we need to have a concept of access permissions, users, roles, etc. That’s not something that OpenFE will need. RG – Right, that’s not chemistry. JC – The user roles+auth stuff can live above the executor, though DD – JC – Executor takes in… DD – Multiple compute hosts could subscribe to the same executor. For fah-alchemy, I’m interested in moving forward with something that meets our known needs. JC – … Is there an example of how the code would be calling each other? What will the API and component cross-talk look like? DD + RG – Let’s meet later this week to better understand this, and whether it can fit in OpenFE. JC – I do think that the separation of chemistry and technical details is reasonable… RG – …. RG and DD will meet later this week to get a better sense of these details.
DD – Results storage+format. What can we plan to do now, how can we load old results? JC - I wouldn’t worry too much about this, final results are what really matter, next most useful data are the snapshots, and then the final tier (and my not be able to be maintained) is the raw data from the intermediate steps
DD – Are you advocating for storage of intermediate results? JC – No, I think results should be ephemeral, and after some time it’s reasonable to require regeneration. DD – This is a similar conclusion to a discussion I had with DSwenson earlier. There’s a lot of trust placed on protocol authors to handle the complexity between the input and output. JC – Sounds great. Just make sure there’s a way for important error information/detailed run data to be recoverable. DD – Sounds good.
DD – Prioritization of items on this board? Should any of these come first?
|
ResultStore working group update
| DS – I’ll send out a note on slack to find a time for the ResultsStore working group meeting. Right now, OpenFE has a PR up that addresses the raw files that come out of simulations. So this is something that needs to be agreed upon between different projects. We need a way that addresses information, probably like a file path, that could map to a local file, S3 store, hdf5 file content, etc. This is something that should be agreed upon beforehand but can change over time.
|
MH : ProtocolSettings taxonomy working group update | |
| RG – What limits a protocol from being used on F@H? If that were clearer I’d be able to understand how much of the project board can be done outside of F@H. DD – One concept right now is that we have to solve how we store a registry of protocols. This will be important to ensure that we get the same thing when serializing and deseriailizing. This is an important component of storing the DAG. RG – This would basically be monkeypatching the namespace of the protocol. JC – This is ultimately related to the science of how we decompose the science into pieces. I’d love to see a world where we decouple the protocol from the engine being used to compute. DD – … RG – I’m looking at these protocols from the perspective of: how can I present this to our board in a way that makes the overlap clear. DD – The big question is where to have the separation between protocols and schedulers.
|
| |