Strategy for protein FF torsion fits | Chapin Cavender | |
SMIRKS for protein-specific torsions | Chapin Cavender | Backbone torsions (written assuming that lower SMIRKS overwrite upper SMIRKS) General protein backbone: [#6X4]-[#6X3](=O)-[#7X3]-[#6X4]-[#6X3](=O)-[#7X3]-[#6X4] Glycine: [#6X4]-[#6X3](=O)-[#7X3]-[#6X4H2]-[#6X3](=O)-[#7X3]-[#6X4] Proline: [#6X4]-[#6X3](=O)-[#7R1X3]-[#6X4R1]-[#6X3](=O)-[#7X3]-[#6X4] Beta-branched (Ile, Thr, Val): [#6X4]-[#6X3](=O)-[#7X3]-[#6X4H1](-[#6X4H1])-[#6X3](=O)-[#7X3]-[#6X4] CC – Right now I use the X decorator instead of counting Hs in a lot of cases. Do we think this is safe? This would mean that a chlorinated amino acid would get these parameters applied instead of getting general small molecule parameters. PB – I think it’d be fine to use the same parameters for chlorinated AAs JW – It will depend on how generalizable the parameters would be. If the protein-specific torsion k s end up similar in magnitude to the generic torsions, we'll have some confidence that they'll generalize. DM – Agree, let’s fit using the current SMIRKS and then see how much they deviate from the “General” parameter to determine how safe it would be use allow for their more general use.
Sidechains torsions General protein chi1: [#7X3]-[#6X4](-[#6X3]=O)-[#6X4]-[!#1X4] General protein chi2: [#7X3]-[#6X4](-[#6X3]=O)-[#6X4]-[#6]~[!#1X4] Beta-branched chi1: [#7X3]-[#6X4](-[#6X3]=O)-[#6X4H1]-[!#1X4] Beta-branched chi2: [#7X3]-[#6X4](-[#6X3]=O)-[#6X4H1]-[#6]~[!#1X4] CC – For aromatic side chains, should we refit the CB - CG torsion? Or will it be sufficient to use the general CX4-CX3 torsion for that? JW – That’s a really hard question. The aromatic sidechain rotation landscape is probably dominated by sterics anyway MT – Probably best to keep it simple at this stage, since that would be a really hard question to investigate. So I’d be in favor of not trying to add new torsion parameters for that. CC – Agree. Some modern FFs are looking at fitting unique terms for each sidechain, but older FFs have sidechains sharing a lot of torsions. So right now we’ll try to use a few strategies for fitting/including sidechains, including The ILDN approach (4 unique sidechain treatments) The 14sb approach (12 unique sidechain treatments) Treating everything generically as discussed above (2 unique sidechain treatments, discussed in this meetings)
CC – Are we OK with not extending the sidechain SMRIKS over the adjacent peptide bond? This means that we’ll treat terminal residue side chains the same as main chain residue side chains.
JW – How are the benchmark datasets looking? JW – Also, FYI, we’re deciding whether to have the F@H interface replicate DHahns work precisely (like, using the exact same input structures and edges, which will be hard), or whether to just rerun the previously benchmarked FFs using the new infrastructure. The new numbers If we use the new infrastructure, a likely outcome is that all results may be systematically worse, since the earlier sims had a lot of expert intervention, and the F@H infrastructure will do everything in an automated way. So we’re looking at how hard it would be to force DHahn’s exact settings into the OpenFE instructure, but if it’s difficult I’m going to propose that we just rerun everything with the understanding that an automated solution may return worse results but that at least we’ll be comparing apples to apples.
|