Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Date

Participants

Goals

  • Determine new procedure for selecting QM datasets for fitting (potentially for May meeting release, if ready in time)

  • Divide up work to accomplish that

Notes:

Coverage of current QM dataset (Su)

...

Lee-Ping: Possible way to deal with coverage: Start with very big list of molecules. Build a list of molecules which use each parameter (being careful that for torsions they are exocyclic torsions) then you do clustering within that list.  Chemical similarity clustering for all moelcules molecules which use each parameter, then pick diverse molecules which use that parameter (e.g. five most diverse). 

...

Talk with Jeff: Class structure? script? Where does it live? 

Action items

Decisions

  • Decided to make systematic approach for selecting molecules for QM data generation & fitting given a target dataset; this will be applied dataset-by-dataset to select new molecules for use in fitting
  • Will attempt to select/redesign a new QM dataset for fitting rather than simply extending our prior QM dataset
  • Decided on tentative algorithm for molecule selection approach