OM – Showed that vsites improved performance in many cases.
CB – Did the vsite refitting include an LJ refit as well? Or are the before- and after- plots generated using different LJ parameters as well?
OM – I’m not sure, will need to look into this
CB – Do we have mixture data involving thiophene? Some earlier work had shown the importance of this group.
OM – I don’t think so. We didn’t have any sulfur compound data in the Sage training set - For the sulfur-containing data we didn’t have at least 5 data points so they didn’t meet our inclusion criteria.
CB – Could we include aromatic sulfur in future training?
OM – Right now it’s a data issue. We could relax criteria or find more data to get suitable inputs.
CB – Could we move forward without experimental data? Using QC or something else?
OM – I don’t think I can make that call.
PB – I can get the electrostatic energy decomposition from SAPT, and that could be a good source of theory training data.
CB – I do agree with the preference to train on experimental data. But if our FF is going to work on druglike chemistry, we should let theory help guide how we prioritize the search for experimental data.
MS – If we have vsites that get tuned to experiment, we’ll have to decide which experiments to do in the first place. And if we only have enough experimental data to use for benchmarking/validation…
CB – If, in the context of doing all kinds of chemistry, we’re missing experimental data in one area, would we leave that part of the FF untouched?
MS –
OM – If we don’t have experimental data, we’d have trouble adjusting LJ values using just throey data
CB – People have done that before, though. But we should identify areas where using pure theory is valid (without experimental data). There’s existing work showing that this is valid, but it’s a bit indirect (it works in discovery projects/hit selection)
LW – Charges are one area where I’m not sure that high-level theory data translates into usable FFs. …
MS – We could say “for sulfur, we’ll just fit to ESPs”. Is there liquid property/experimental data we can use to validate?
CB – These off-center charges on S won’t have a big consequence in most cases. Where they’ll have a consequence is when there’s a strong hbond acceptor in the vicininty of the thiophene. So that would be a good way to look for data - basically the neighboring group would be in or out of plane. Like thiophene and pyrrole or another donor-only solvent. So even if you had neat liquid thiophene, it probably wouldn’t be very useful for thiophene fitting - You’d need short range, directional interactions.
MS – So maybe we can do stuff with S without a validation set? But I’m really uncomfortable moving ahead with no experimental validation.
OM – To be fair, our current LJ params are from OPLS and are intended for use with a different combining rule. So it’s a low bar to clear.
CB – The theory comparison I’d do would be a thiophene-water dimer. I’d scan over translation and rotation of the water, in particular around the thipohene group. Then I’d train to ESPs and dimer energies. But this still wouldn’t obviously be a replacement for experimental data. We’d need to make the argument for why we can move forward without experiment.
DM – If we already have vsites that are justifiable, maybe we should take that gain and go with it, and then deal with the more complex ones later.
MS – Agree
CB – Sounds good
CB – Did we put “rabbit ears” go on carbonyl oxygens?
CB – Are we using xtal data?