Compute management | JW – I think I’ve left responsibilities vague/undefined while going on/offline a lot the past 2 weeks. I’d be happy with a range of outcomes, including me managing everything or lily managing everything. Status quo seems to be me running old workers and lily handling pyddx, which I’m also fine with, though this may be wasteful of two people’s attention. LM – Was running my own workers since JW was offline. LW – I think it could make sense for dataset owners to be ultimately responsible for the computation of their dataset, either by managing their own workers or communicating with central person. JW will take over Lipid MAPS worker management and continue running workers for the phosphate torsiondrive dataset
|
What happened to qc_record.extras ? | |
Discussion of OpenFF QCA dataset standards
|
LW: This is a public facing document, so we should decide what we do want to follow, and remove anything we don’t want to follow. Luckily, we have been following one part, which is accurately marking our datasets as not following the guidelines JW: This basically got dropped after Simon left, I have no particular attachment to these standards but we should have some standards Specific discussion points LW: Not sure what it means by meaning of molecule names? We don’t name the molecules JW: Changelog doesn’t really make sense, we don’t change our datasets JW: Blacklist doesn’t make sense to me, need more info. If there were problems with our dataset, we’d submit a new one with a new version LM: Maybe bad QCA IDs, to filter out after the fact? LW: Maybe, that would have to be added after the fact Let’s ask trevor
Dataset status: LW/JW: These are a pain and no one does it, let’s get rid of it LW: We could use the same tags as the project board JW: Could be a way to automate it with the project board, but I don’t know it, the updates to projects are confusing
Will continue discussion offline and ping trevor
|