This tool can
be used to split training set and test set by picking a subset of diverse molecules. The similarity of ECFP6 fingerprints based on
'DiceSimilarity' is employed to calculate distances between molecular objects, which guarantees the molecular diversity.