...
Lee-Ping: Possible way to deal with coverage: Start with very big list of molecules. Build a list of molecules which use each parameter (being careful that for torsions they are exocyclic torsions) then you do clustering within that list. Chemical similarity clustering for all moelcules molecules which use each parameter, then pick diverse molecules which use that parameter (e.g. five most diverse).
...