CIB – I like this direction. Three questions/comments: You are going to have to make a representation of bonds. One key advance that we want to bring into the FF is to move away from integer bond orders into widespread use of WBOs. Floating points will be hard to represent in a bit vector. TG – There’s a bit vector for the bond as well. DM – We would plan for more of these to be replaces by WBOs CB – Is there a representation for “any” bond order? TG – Yes – Bond bit vector “11111…”
How will you handle an atom which is described by what it’s bound to? SMARTS can be recursive, which could increase complexity. Bit vectors overdefine things which are mutually exclusive: An atom can be X1, X2, X3, or X4, but not many of them. Will this affect representation? Is this unnecessarily complicated/high dimensional?
CIB- how to analyze the data? There are several methods out there you can choose. within data in bit vector, using some grouping scheme, like random forest, to figure out what is common among them. DT has a feature selection: what the recurring theme is in the tree. Action items postponed to next week
View file |
---|
name | OpenFF_Gokey_ChemPer_call_2020-09-11.pdf |
---|
|
|