Achira dataset proposals | LM – Current proposal is to expand parts of the SPICE2 dataset, and we don’t have compute/infra up yet. Wondered if we could collaborate and use QCSubmit/NRP. First idea is to do an opt dataset with a very limited number of steps on some or all of SPICE2. Would love to collaborate as much as you’re able to, datasets would be open source and hosted on QCA. LW – Some technical Qs SPICE1 hit some storage space limits on QCArchive - For this dataset, would it be all of SPICE2? LM – Currently unsure, wanted to generate more data close to thermally accessible structures, but haven’t decided yet what fraction of structures we’d want to start with. CR – Basically same understanding here. LW – This would be helpful, would bring back to lead team
LW – Re: your own runners - are these for certain coming/is there a rollout date planned? LW – For dataset management, would the goal be to use Q-D-S, or just directly submit to MolSSI QCA, etc?
LW – We’re generally open to work with you, but filling in more details would be great. Unfortunately we’re shutting down next week so wouldn’t be able to get things going until Jan if we don’t submit this week. What’s your level of urgency? LM – I think we’d like to get things going before the break, but will get back to you. We’re also off the next two weeks. CR – We’re meeting with folks to talk about compute resources later this week, unsure about exactly how urgent/what target dates are.
JW – BW do you recall BW – Under 300 Da is easy for us to run. 300-600 gets complicated. But probably if the whole dataset is under 400Da it’s easy for us at our default level of theory. LW: SPICE2 is ~2m conformations JW: could probably to between 10-100k opts over the holidays JW: may have to sort out storage directly with molssi for entire SPICE2 dataset BW (in chat): For 300-600, 4 CPUs/32 GB RAM has worked well without really any intervention. 4/20 GB for <300 Da LM – Sounds good, we’ll get more info and get back to you
|