Derify/augmented_canonical_pubchem_13m
Viewer
• Updated
• 13.3M • 37
A set of SMILES datasets canonicalized with RDKit and 33% randomly augmented for robust, diverse molecular ML training.
Totally Free + Zero Barriers + No Login Required