Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

While it has been hypothesized that we should only be interested in choosing molecules so as to achieve a uniform coverage of SMIRKS patterns, this may be problematic. The same SMIRKS pattern (especially the ones without specificity) can match multiple different chemical environments within molecules, and may require different data to be more effectively constrained. This would likely not be an issue in the regime of an excess of data, but given the scarcity of data (especially diverse data) it would be good to remove the possibility of this having an effect.

Alcohol-Ester (+Acid)

In general, the The initial data set selection has involved selecting a set of six common alcohols (methanol, ethanol, propanol, isopropanol, isobutanol and tert-butanol), and for each alcohol selecting approximately two esters, one larger and one smaller, for which both

Mathinline
host9e5865a8-c37e-3de7-b41a-1ad417a001db
body--uriencoded--H_%7Bmix%7D
and
Mathinline
host9e5865a8-c37e-3de7-b41a-1ad417a001db
body--uriencoded--V_%7Bexcess%7D
data is obtainable. The components selected according to this criteria are then intended to be used as the components in the pure data only study.

...