Tabular data overview (original) (raw)

Vertex AI lets you perform machine learning with tabular data using simple processes and interfaces. You can create the following model types for your tabular data problems:

For an introduction to machine learning with tabular data, seeIntroduction to Tabular Data. For further information about Vertex AI solutions, seeVertex AI solutions for classification and regressionand Vertex AI solutions for forecasting.

A note about fairness

Google is committed to making progress in followingresponsible AI practices. To this end, our ML products, including AutoML, are designed around core principles such as fairnessand human-centered machine learning. For more information about best practices for mitigating bias when building your own ML system, see Inclusive ML guide - AutoML.

Vertex AI solutions for classification and regression

Vertex AI offers the following solutions for classification and regression:

Tabular Workflow for End-to-End AutoML

Tabular Workflow for End-to-End AutoML is a complete AutoML pipeline for classification and regression tasks. It is similar to theAutoML API, but allows you to choose what to control and what to automate. Instead of having controls for the whole pipeline, you have controls for every step in the pipeline. These pipeline controls include:

Benefits

To learn more about Tabular Workflows, see Tabular Workflows on Vertex AI. To learn more about Tabular Workflow for End-to-End AutoML, see Tabular Workflow for End-to-End AutoML.

Tabular Workflow for TabNet

Tabular Workflow for TabNet is a pipeline that you can use to train classification or regression models. TabNet uses sequential attention to choose which features to reason from at each decision step. This promotes interpretability and more efficient learning because the learning capacity is used for the most salient features.

Benefits

To learn more about Tabular Workflows, see Tabular Workflows on Vertex AI. To learn more about Tabular Workflow for TabNet, see Tabular Workflow for TabNet.

Tabular Workflow for Wide & Deep

Tabular Workflow for Wide & Deep is a pipeline that you can use to train classification or regression models. Wide & Deep jointly trains wide linear models and deep neural networks. It combines the benefits of memorization and generalization. In some online experiments, the results showed that Wide & Deep significantly increased Google store application acquisitions compared with wide-only and deep-only models.

Benefits

To learn more about Tabular Workflows, see Tabular Workflows on Vertex AI. To learn more about Tabular Workflow for Wide & Deep, see Tabular Workflow for Wide & Deep.

Classification and regression with AutoML

Vertex AI offers integrated, fully managed pipelines for end-to-end classification or regression tasks. Vertex AI searches for the optimal set of hyperparameters, trains multiple models with multiple sets of hyperparameters and then creates a single, final model from an ensemble of the top models. Vertex AI considers neural networksand boosted trees for the model types.

Benefits

For further information, seeClassification and Regression Overview.

Vertex AI solutions for forecasting

Vertex AI offers the following solutions for forecasting:

Tabular Workflow for Forecasting

Tabular Workflow for Forecasting is the complete pipeline for forecasting tasks. It is similar to theAutoML API, but allows you to choose what to control and what to automate. Instead of having controls for the whole pipeline, you have controls for every step in the pipeline. These pipeline controls include:

Benefits

To learn more about Tabular Workflows, see Tabular Workflows on Vertex AI. To learn more about Tabular Workflow for Forecasting, see Tabular Workflow for Forecasting.

Forecasting with AutoML

Vertex AI offers an integrated, fully managed pipeline for end-to-end forecasting tasks. Vertex AI searches for the optimal set of hyperparameters, trains multiple models with multiple sets of hyperparameters, and then creates a single, final model from an ensemble of the top models. You can choose between Time series Dense Encoder (TiDE),Temporal Fusion Transformer (TFT), AutoML (L2L), and Seq2Seq+ for your model training method. Vertex AI considers only neural networks for the model type.

Benefits

For further information, seeForecasting Overview.

Forecasting with BigQuery ML ARIMA_PLUS

BigQuery ML ARIMA_PLUS is a univariate forecasting model. As a statistical model, it is faster to train than a model based on neural networks. We recommend training a BigQuery ML ARIMA_PLUS model if you need to perform many quick iterations of model training or if you need an inexpensive baseline to measure other models against.

Like Prophet, BigQuery ML ARIMA_PLUS attempts to decompose each time series into trends, seasons, and holidays, producing a forecast using the aggregation of these models' predictions. One of the many differences, however, is that BQML ARIMA+ uses ARIMA to model the trend component, while Prophet attempts to fit a curve using a piecewise logistic or linear model.

Google Cloud offers a pipeline for training a BigQuery ML ARIMA_PLUS model and a pipeline for getting batch predictions from a BigQuery ML ARIMA_PLUS model. Both pipelines are instances ofVertex AI Pipelines fromGoogle Cloud Pipeline Components (GCPC).

Benefits

For further information, seeForecasting with ARIMA+.

Forecasting with Prophet

Prophet is a forecasting model maintained by Meta. See the Prophet paperfor algorithm details and the documentation for more information about the library.

Like BigQuery ML ARIMA_PLUS, Prophet attempts to decompose each time series into trends, seasons, and holidays, producing a forecast using the aggregation of these models' predictions. An important difference, however, is that BQML ARIMA+ uses ARIMA to model the trend component, while Prophet attempts to fit a curve using a piecewise logistic or linear model.

Google Cloud offers a pipeline for training a Prophet model and a pipeline for getting batch predictions from a Prophet model. Both pipelines are instances ofVertex AI Pipelines fromGoogle Cloud Pipeline Components (GCPC).

Integration of Prophet with Vertex AI means that you can do the following:

Although Prophet is a multivariate model, Vertex AI supports only a univariate version of it.

Benefits

For further information, seeForecasting with Prophet.

What's next