Problem statement and permanent links should be on the top page
Meeting notes should be on child pages
Research data + code should go in a dedicated repo, with major figures/conclusions as release assets
Proton transfer dataset generation
CD sent SB his scripts, SB will run those on a large dataset on Lilac and report back failures.
CD’s automation indicated some non-isomorphisms in the tripeptides-with-oppositely-charged-AAs sets, but he hasn’t looked closely at them.
PDB connectivity guesses are really bad
CD – Previously, JW had mentioned using QCElemental as an alternative to OpenEye
General – The initial molecule dataset (Minidrugbank.sdf) was such garbage that we can’t really use those results. So CD will round-trip all of those molecules through OpenEye and back to SDF so that we at least have a valid molecule for each.
Today, we’ll work on reading molecules from PDB using RDKit, guessing their bonds using QCElemental, and running the isomorphism checks from that.
(We made a code snippet to do pdb connectivity comparison that doesn’t use OpenEye, just rdkit)
CD – This is already removing a lot of the error cases that were spuriously coming up
Add Comment