Notes |
---|
|
New datasets JR – I’ve been talking to PEastman and somne of the other SPICE folks to try and come up with a version 2.0. Happy to outline ideas for how to expand if folks here are interested. A couple of things I’d like to do in version 2: Build a massive dataset along the lines of “ML Big Data” Include solvation effects on all molecules in the dataset (not just AAs) Keep including more chemical space (bring in more of pubchem and other sources) Generate more conformers
Interested in two-level approach: I’ve been interested in OpenCatalyst work (similar project to develop a DB of QM calcs for materials, to develop catalysts). They host public challenges to evaluate models and update a leaderboard. So it’d be neat to have the QM community move past QM9 and have a challenge.
BP – This could fit on the ML instance - That will have SPICE and would fit the mission of other ML datasets DD – Where would the compute power come from/what resources could be marshalled? OpenFF helped with V1 but we may not be able to do a V2. JR – Can pull in a hodgepodge from genentech, also from NVidia and LightningAI. Also facebook folks (who do opencatalyst) were interested. So if we come up with a solid plan for what’ll be in the dataset we can likely marshall resources. My bigger concern is storage, and I’ve spoken with BP about this. I think it’ll be important to store densities and wavefunctions for at least a subset of the data. On the resources side, it’ll be good/necessary who have one person be the point person for SPICE2.0/openconformer intiative. Organizationally this will basically be necessary, and genentech could kick in to fund this role, or if there’s someone interested they could lead it. DD – Do you have suggestions for folks who could be a point person? JR – Not really. PEastman is interested in contributing technically but not managing the project. I’m not sure about OMSF/MolSSI. BP – At MolSSI I’m not sure we have someone with the expertise and the time. I’d love to have a dataset leader at MolSSI (ML or otherwise) since that’d take work off my shoulders. DD – Would SPICE2.0 be a big enough draw to reprioritize folks at MolSSI? BP – No, I don’t think so. JR – I’ll ask about folks at Genentech. JW – re: POSE grant; may be an avenue for at least a host org, funding; doesn’t solve finding the right person DD – Probably the right next step is to talk to JC about whether there’s a good person we could get. JW – Happy to have OpenFF contribute compute via PRP/NRP. Or to help MolSSI get on those platforms. So feel free to ask me for an introduction.
LW – We have some datasets we want to get in, but not urgently. Looking at diverse fragments containing iodine. Also hypervalent sulfurs. And a DNA dataset. All are relevant for vsites. But we don’t need them for the initial vsite FFs. Was hoping to submit once QCSubmit is updated, and once the new hires are up to speed, so in the next few months. The datasets aren’t finalized but I’ve been looking at some ideas. PB – Would these be for ESPs? LW – Iodine dataset and hypervalent sulfur yes, but DNA no. And these would be geometry optimizations. (General) – Not actually storing the ESPs, but instead storing wavefunctions and reconstructing ESPs locally. TG – Does it still store wavefunctions for the whole trajectory? Is there some way to just get it for first and last frames? BP – There’s an option to store the initial and final structure in a trajectory. But I don’t think there’s a way to store wavefuncs for the initial and final and then also store the trajectory without wavefunctions.
|
|
QCFractal development : sprint begins …
BP – I think we’re basically feature-complete here. And I’m tracking my work in Asana, so I don’t think I’ll open this up. Right now my “sprint” is basically bugfixes. I’m not adding any major new features, and I’ll prune the issue tracker and PR list once the next branch comes out. DD – This meeting could be a good way to pull in new devs
|
Additional business DD – Is there a MolSSI QCArchive Working Group start date? BP – Could be the week of the trainings. JW – Could go in the off-week from the current semiweekly meeting. DD – OpenFF could host the current meeting through the end of august? DD – I’ll mark this though Aug 29. BP – There’s a MolSSI user group meeting in November, like a mini symposium. We’re planning on MOPAC as a topic.
|