automated machine learning (AutoML) (original) (raw)

Automated machine learning (AutoML) is the process of applying machine learning models to real-world problems using automation. More specifically, it automates the selection, composition and parameterization of ML models. Automating the machine learning process makes it more user-friendly and often provides faster, more accurate outputs than hand-coded algorithms.

AutoML software platforms make machine learning more user-friendly and give organizations without a specialized data scientist or ML expert access to machine learning. These platforms can be built in house or acquired from a third-party vendor and accessed through open source repositories such as GitHub.

How does the AutoML process work?

AutoML is typically a platform or open source library that simplifies each step in the machine learning process, from handling a raw data set to deploying a practical ML model. In traditional machine learning, models are developed by hand, and each step in the process must be handled separately.

A diagram depicting the AutoML process.

AutoML automatically locates and uses the optimal type of machine learning algorithm for a given task. Two concepts help achieve this:

Users with minimal machine learning and deep learning knowledge can then interface with the models through a coding language such as Python.

More specifically, here are some steps in the machine learning process that AutoML can automate, in the order they occur:

Why is AutoML important?

AutoML is important because it represents a milestone in machine learning and artificial intelligence. AI and ML have been subject to the "black box" criticism -- meaning machine learning algorithms can be difficult to reverse engineer. Although they improve efficiency and processing power to produce results, it can be difficult to track how the algorithm delivered that output. Consequently, this also makes it challenging to choose the correct model for a given problem, because it can be difficult to predict a result if a model is a black box.

AutoML helps to make machine learning less of a black box by making it more accessible. It automates parts of the ML process that apply the algorithm to real-world scenarios. A human performing this task would need an understanding of the algorithm's internal logic and how it relates to the real-world scenarios. AutoML, however, learns and makes choices that are more time-consuming or resource-intensive for humans to do with efficiency at scale.

Fine-tuning the end-to-end machine learning process -- or machine learning pipeline -- through meta learning has been made possible by AutoML.

On a wider scale, AutoML also represents a step toward artificial general intelligence.

Pros and cons of AutoML

The main benefits of AutoML are as follows:

The main challenge of AutoML is the temptation to view it as a replacement for human knowledge.

Like most automation, AutoML is designed to perform rote tasks efficiently with accuracy and precision, freeing up employees to focus on more complex or novel tasks. Things that AutoML automates -- such as monitoring, analysis and problem detection -- are rote tasks that are faster if automated. A human should still be involved to assess and supervise the model. AutoML should help, not replace, data scientists and other employees, especially those with expert knowledge.

Another challenge is that AutoML is a relatively new field, and some of the most popular tools are not yet fully developed.

Different ways to use AutoML

AutoML shares common use cases with traditional machine learning. Some of these include the following:

AutoML tool features

The following are some popular AutoML platforms:

Tools such as Auto-sklearn and AutoKeras are open source and can be run on local infrastructure, meaning users can avoid the costs of proprietary cloud services. They rely strongly on known architectures and data they've already seen, and support classification and regression techniques, among other tasks.

Tools such as Google AutoML and Azure ML, by contrast, are proprietary cloud platforms that offer scale, but also incur costs associated with cloud services. They use recurrent neural networks, convolutional neural networks, long short-term memory and other ML models.

Continue Reading About automated machine learning (AutoML)

Dig Deeper on AI technologies