Skip to end of metadata
Go to start of metadata
You are viewing an old version of this page. View the current version.
Compare with Current
View Page History
« Previous
Version 6
Next »
Participants
Goals
alchemiscale.org
DD : alchemiscale
roadmap 2025: possible components
additional FAH protocols in alchemiscale-fah
parallel execution of ProtocolDAG
s on conventional compute, GPU saturation e.g. for feflow.NonEquilibriumCyclingProtocol
merging and copying AlchemicalNetwork
s in alchemiscale
server
additional Strategy
implementations beyond NetBFE, e.g. CBayesMBAR
compute autoscaling for HPC, Kubernetes clusters
support for result file retention and retrieval?
user-facing dashboard, visualizer?
admin-facing dashboard, management interface (e.g. adding/removing users, scopes, permissions)
automated, online FE estimates for transformations, whole-network estimators?
compute cost estimate for planned networks based on previously-executed networks?
others?
DD : proposal: reorganize alchemiscale
project coordination
migrate alchemiscale
repo under OpenFreeEnergy Github org
create alchemiscale
channel in OpenFE Slack for developer communication
create alchemiscale.org
channel for users of that production instance, announcements related to instance issues, deployments
use Github Discussions as a hub for user questions that fall outside of issues, are more to do with usage questions
host working group under OpenFE org, use infrastructure for tracking meeting notes
DD : alchemiscale-fah
live test performed with FAH volunteers
executed 190 ProtocolDAG
s on FAH using FahNonEquilibriumCyclingProtocol
on ['tyk2', 'mcl1', 'hif2a', 'shp2']
without additional human interaction, corresponding to 1900 FAH work units
results on FAH comparable to equivalent compute performed on NRP for each edge:
Discussion topics
Notes |
---|
alchemiscale.org
DD : alchemiscale roadmap 2025: possible components additional FAH protocols in alchemiscale-fah parallel execution of ProtocolDAG s on conventional compute, GPU saturation e.g. for feflow.NonEquilibriumCyclingProtocol merging and copying AlchemicalNetwork s in alchemiscale server additional Strategy implementations beyond NetBFE, e.g. CBayesMBAR compute autoscaling for HPC, Kubernetes clusters JW – Selfishly, this would be really nice for me, since this is exactly what I do manually JS – Agree. Helps because of the difficulty of orchestrating compute over time zone difference, so this can help a lot with turnaround. DD – This would be handy for me too. If we always have a backlog this won’t be that important, but if we don’t it’ll help a lot.
support for result file retention and retrieval? user-facing dashboard, visualizer? admin-facing dashboard, management interface (e.g. adding/removing users, scopes, permissions) automated, online FE estimates for transformations, whole-network estimators? compute cost estimate for planned networks based on previously-executed networks? others? JS – Pulling down final snapshots from simulation output DD – This would be somewhat protocol-dependent. So alchemiscale-adjacent, but would need to be implemented in the protocol. IP – There’s also a question of how to handle paths in protocols. DD – Currently, if a protocol returns paths as part of its return values, we don’t do anything with them. So there would need to be some new functionality to handle actual files. Would be a really big refactor though. IA – Is this related to DS’s storage management stuff? DD – It is, we have an open PR on alchemiscale (#104) that stalled out because of coordination costs and other priorities. So we could implement something about storage in the GUFE layer (GUFE #186) , that alchemiscale later uses. There’s also a possibly-simpler alternate proposal in alchemiscale #180. IA – Last I heard from DS, this was “working in protocols” but I don’t know exactly what this means. DD – There were a few levels of storage being implemented - this added a “permanent” storage layer that long-lived files could be added to, but nothing interfaced with this.
IP – Is extending protocols in scope? DD – Alchemiscale already supports extends , but the protocol also needs to implement it. IA – From the OpenFE standpoint, this MAY happen. Not this year but possibly future years. DD – There’s kinda a difference in philosophy between noneqcyc and repex, where, if you ruin a bunch of NECs, you can combine them to get an uncertainty that narrows. But with repex, you don’t get convergence, you get finer resolution on the distribution of the outputs. Not sure if this is planning to change, and if extends support would play into this. IA – Short version - Industry partners want to be able to restart sims. I’m thinking that restarting and extending may be neighbors. DD – Is this in the case of a sim in the edge failing or something else? IA – A sim in the edge failing midway through.(?) DD – (see recording ~42 mins) .
DD : proposal: reorganize alchemiscale project coordination migrate alchemiscale repo under OpenFreeEnergy Github org create alchemiscale channel in OpenFE Slack for developer communication create alchemiscale.org channel for users of that production instance, announcements related to instance issues, deployments use Github Discussions as a hub for user questions that fall outside of issues, are more to do with usage questions host working group under OpenFE org, use infrastructure for tracking meeting notes JW – This all sounds A-OK by me. Maybe also share the recording with the notes. JE – I think this is a great idea and am in support. Might be some logistical hurdles we have to clear in terms of taking advantage of OpenFE’s resources (eg public/private split on OpenFE’s confluence). So there may be some logistics to the migration. DD – That’s fine, we can do this in stages. JE – We will also need to have some internal OpenFE discussion about other ramifications.
DD – We’d also migrate the project board to openfe - Will make the tagging/work assignment more straightforward.
DD : alchemiscale-fah live test performed with FAH volunteers executed 190 ProtocolDAG s on FAH using FahNonEquilibriumCyclingProtocol on ['tyk2', 'mcl1', 'hif2a', 'shp2'] without additional human interaction, corresponding to 1900 FAH work units results on FAH comparable to equivalent compute performed on NRP for each edge (General) – There’s something really weird about this range. DD will follow up with IA. But overall the consistency between NRP and FAH is very promising.
|
|
Action items
Decisions
Add Comment