Start¶
The Start modules define how to determine which calculations to perform before there is enough data available to train a machine learning model.
Available Methods¶
ExaMol provides a few different start methods, each with a maximum recommended search space size.
Starter  | 
Category  | 
Maximum Search Size  | 
|---|---|---|
Fast  | 
100M  | 
|
KMeans  | 
100K  | 
Using a Starter¶
Simply provide an iterator over the names of molecules to consider:
starter = RandomStarter()
starting_pool = starter.select(['C', 'O', 'N'], 2)  # Will generate two choices
The starter will provide a list of SMILES strings from those that were provided.
Increase the speed of selection by setting the max_to_consider option of the Starter,
which will truncate the list of molecules strings at a specific size before running the selection algorithm.