Sorting_molecules_script.ipynb provides a walkthough of the task, briefly providing some background to the data and the objective of this workflow.
The data.xlsx file contains some conjugated polymer data. Only the smiles strings are used for this task (other columns can be ignored).
The remaing .py modules provide helper functions, which are used in Sorting_molecules_script.ipynb