...
Setting the max conformers was the most effective method in maintaining chemical diversity when selecting molecules from clusters and reducing the number of clusters. Other methods, such as selecting the smallest molecule in the cluster, might reduce # of conformers but also reduce chemical diversity. The main goal of the training data set selection is to increase chemical diversity to parameterize increase the chemical space our force field covers.