BP: Running tests for qcf-next and everything seems good!
DD:
BP: I will cut a release earlier than I think and may run simultaneously old and new servers. Still contemplating options to host the server long term.
JW: Yeah, I started the conversation within OMSF. If we host our own we’ll start EOLing datasets, happy to start this conversation sooner.
BP – New version has more support for views, you can dump datasets into a SQLite file. Forward compatibility will still be complicated. New version will also allow for deleting datasets.
DD: Do you have granular permissions system for users?
BP: Not exactly, but deletions, etc., are restricted.
DD: I would prefer to have additional userid to just take care of dataset modifications like deletions, so that we won’t mess by accident.
BP – QCEl will also need updates. Not many other maintainers there, there’s LBurns and one other person who sometimes helps. The new postdoc, Mars, should also be working on it. We’ve got both postdocs now, working on onboarding. One will help with QCSchema, other will be using QCA on some applications.
BP – Was there a complaint about server performance?
PB – We’re comparing dipole moments from the benchmarking molecules.
DD – Ah, that’s right, but the exporter is slow. So there are 150k optimizations and it takes a long time to download. But we’re not committed to maintaining/updating openff-benchmark, so it took some configuring. Then BSwope asked whether there’s a faster way to get this and I said yes, but I don’t have the time to make custom solutions for them. So I looped PB in to this since he has some familiarity with the area.
PB – Right, I need to write a script for this, will aim to do it this morning.
DD – …
PB – Does the molecule data contain the company identifier?
BP – It may be in the entry data.
DD – So for each molecule, take the entry on the collection, and it will point toward the other objects that may have the name.
BP – Could be cool to offer a method that just pulls down all the final molecules. I could do this server-side pretty efficiently.
Opts: 310894 → 311229 → stuck at this, might be a longer run
TDs: 20 → 22 → stuck at this
OpenFF Protein Capped 1-mer Sidechains v1.2
TDs: 44 → 45 (remaining 1)
Stuck on 155215 for a week - might be a longer run or not
CC – I’ll take a look at these
DD – Should I leave lilac workers on? Or would it be better to have everything running on TSCC?
PB – Agree, let’s shut down all workers on lilac.
RNA Trinucleotide Single Point Dataset v1.0 - Almost complete!
57299 → 24362 → 7 remaining (might be longer runs, no status updates for the last three days)
DD – Let’s bump this to high-priority since we may be able to finish it off
User questions/issues
BP – Refresh my memory? In geometry optimizations, for every gradient step, you’re storing various indices, and they take up a lot of space (partly because they’re duplicated - One set has underscores and the other doesn’t)
PB – This is mayer and wiberg bond order, I think.
BP – I’d like to delete one of these copies. Should I delete the one with or without underscores? As I’m migrating the server I’d like to clean this up.
JW – Could we check which one qcsubmit pulls? That’s probably the one that we use.
0 Comments