Genetic Programming for Automated Machine Learning
We discussed TPOT, a tool for optimizing machine learning pipelines which uses genetic programming and generates accurate algorithms. TPOT can be configured to get accurate models using large datasets. Even though a data scientist could do this work as well, it could take him months or even years to explore the same amount of possible cases.
Automated machine Learning (or simply AutoML) refers to automating the generation of a data analysis pipeline. AutoML can include data pre-processing, feature selection, and feature engineering methods along with machine learning methods and parameter settings that are optimized for your data. The biggest benefit of AutoML is that it automates the algorithm selection, that will now take hours instead of months in the case of manual selection.