Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Item

Notes

Organizing project pagesCollecting a larger dataset

  • Problem statement and permanent links should be on the top page

  • Meeting notes should be on child pages

  • Research data + code should go in a dedicated repo, with major figures/conclusions as release assets

Proton transfer dataset generation

  • CD sent SB his scripts, SB will run those on a large dataset on Lilac and report back failures.

  • CD’s automation indicated some non-isomorphisms in the tripeptides-with-oppositely-charged-AAs sets, but he hasn’t looked closely at them.

PDB connectivity guesses are really bad

  • CD – PReviouslyPreviously, JW had mentioned using QCElemental as an alternative to OpenEye

  • General – The initial molecule dataset (Minidrugbank.sdf) was such garbage that we can’t really use those results. So CD will round-trip all of those molecules through OpenEye and back to SDF so that we at least have a valid molecule for each.

  • Today, we’ll work on reading molecules from PDB using RDKit, guessing their bonds using QCElemental, and running the isomorphism checks from that.

  • (We made a code snippet to do pdb connectivity comparison that doesn’t use OpenEye, just rdkit)

    • CD – This is already removing a lot of the error cases that were spuriously coming up

Code Block
from rdkit import Chem
from openff.toolkit.topology import Molecule

rdmol = Chem.MolFromPDBFile('sqm_original.pdb', removeHs=False)
mol_from_pdb = Molecule.from_rdkit(rdmol, 
                                   allow_undefined_stereo=True,
                                   hydrogens_are_explicit=True)

input_mol = Molecule.from_file('input_original.sdf')

mol_from_pdb.is_isomorphic_with(input_mol, 
                                bond_order_matching=False,
                                formal_charge_matching=False,
                                aromatic_matching=False,
                               )

Action items

  •  

Decisions