2020-08-11 QCFractal User Meeting notes
Date
Aug 11, 2020
Participants
@Jeffrey Wagner
@David Dotson
@Joshua Horton
@David Cerutti (Deactivated)
Discussion topics
Item | Notes |
---|---|
Updates from MolSSI |
|
Queue/Manager status |
|
User questions |
|
CMILES for disaccharide set |
|
| Path 1 (OE reading PDBs): python -c "from openforcefield.topology import Molecule; import sys; molecule = Molecule.from_file(sys.argv[1]); print(molecule.to_string()) " |
| Path 2: antechamber -fi pdb -i glu.pdb -fo mol2 -o test.mol2
antechamber -i 1_ac.mol2 -fi mol2 -o 1_ac_sy.mol2 -fo mol2 -at sybyl -dr no
obabel -imol2 1_ac_sy.mol2 -osdf -O new.sdf
python -c "from openforcefield.topology import Molecule; import sys; molecule = Molecule.from_file(sys.argv[1]); print(molecule.to_smiles()) " new.sdf
On structure with carbonyl and sulfate, OE interprets PDB correctly, antechamber loses track of bond orders + charge on sulfate (off-dev) jeffreywagner@JW-MBP$ diff out_ac out_pdb
1c1
< [H][C@]1([C@@]([C@](O[C@@]([C@]1([H])OS([O])([O])[O])([H])O[C@@]2([C@]([C@@]([C@](O[C@]2([H])C([H])([H])O[H])([H])OC([H])([H])[H])([H])OC(=O)C([H])([H])[H])([H])O[H])[H])([H])C([H])([H])O[H])([H])O[H])O[H]
---
> [H][C@]1([C@@]([C@](O[C@@]([C@]1([H])OS(=O)(=O)[O-])([H])O[C@@]2([C@]([C@@]([C@](O[C@]2([H])C([H])([H])O[H])([H])OC([H])([H])[H])([H])OC(=O)C([H])([H])[H])([H])O[H])[H])([H])C([H])([H])O[H])([H])O[H])O[H]
|
Protein dataset CMILES | JH- After the call I have found that the CMILES in the initial protein dataset optimizations are not correct due to the input mol2 files having all of the bond orders set to 1. The optimizations are still correct however as the net charge of the molecule was still 0. The v2.0 dataset will fix all CMILES strings and fully complete the dataset. |