Este curso ensina você a construir modelos Vertex AI AutoML sem escrever uma única linha de código; criar modelos BigQuery ML com conhecimentos básicos de SQL; configurar jobs de treinamento personalizados no Vertex AI usando contêineres (com pouco conhecimento em Docker); utilizar o Feature Store para gerenciamento e governança de dados; aplicar engenharia de atributos para melhorar os modelos; escolher as opções adequadas de pré-processamento de dados para seu caso de uso; desenvolver modelos de machine learning distribuídos e escaláveis com TensorFlow; e adotar as melhores práticas para implementar soluções de ML no Google Cloud.
Aprenda tudo isso e muito mais!
Objetivos
• Criar, treinar e implantar um modelo de machine learning sem escrever uma única linha de código usando o Vertex AI AutoML.
• Entender quando usar AutoML e BigQuery ML.
• Criar conjuntos de dados gerenciados no Vertex AI.
• Adicionar atributos ao Feature Store.
• Descrever o Analytics Hub, Dataplex e Data Catalog.
• Explicar o ajuste de hiperparâmetros com o Vertex Vizier e como ele pode melhorar o desempenho do modelo.
• Criar um Notebook gerenciado pelo usuário no Vertex AI Workbench, construir um job de treinamento personalizado e implantá-lo usando um contêiner Docker.
• Explicar predições batch e online, além de monitoramento de modelos.
• Descrever como melhorar a qualidade dos dados.
• Realizar análise exploratória de dados.
• Criar e treinar modelos de aprendizado supervisionado.
• Otimizar e avaliar modelos com funções de perda e métricas de desempenho.
• Criar conjuntos de dados de treino, validação e teste de forma escalável e reproduzível.
• Implementar modelos de ML com TensorFlow/Keras.
• Explicar como representar e transformar atributos.
• Compreender os benefícios da engenharia de atributos.
• Explicar os Vertex AI Pipelines.
Público-Alvo
Neste curso, os participantes aprenderão as seguintes habilidades:
- Aspirantes a analistas de dados, cientistas de dados e engenheiros de dados em machine learning.
- Profissionais que desejam se familiarizar com ML utilizando Vertex AI AutoML, BQML, Feature Store, Workbench, Dataflow, Vizier (para ajuste de hiperparâmetros) e TensorFlow/Keras.
Pré-requisitos
Para aproveitar ao máximo este curso, os participantes precisam atender aos seguintes critérios:
- Familiaridade com conceitos básicos de machine learning.
- Proficiência básica em uma linguagem de script, de preferência, Python.
Duração
Investimento
Resumo do curso
What are best practices for implementing machine learning on Google Cloud? What is
Vertex AI and how can you use the platform to quickly build, train, and deploy AutoML
machine learning models without writing a single line of code? What is machine
learning, and what kinds of problems can it solve?
Google thinks about machine learning slightly differently: it’s about providing a unified
platform for managed datasets, a feature store, a way to build, train, and deploy
machine learning models without writing a single line of code, providing the ability
to label data, create Workbench notebooks using frameworks such as TensorFlow,
SciKit Learn, Pytorch, R, and others. Our Vertex AI Platform also includes the ability
to train custom models, build component pipelines, and perform both online and batch
predictions. We also discuss the five phases of converting a candidate use case to be
driven by machine learning, and consider why it is important to not skip the phases. We
end with a recognition of the biases that machine learning can amplify and how
to recognize them.
• Describe the Vertex AI Platform and how it is used to quickly build, train, and deploy
AutoML machine learning models without writing a single line of code.
• Describe best practices for implementing machine learning on Google Cloud.
• Develop a data strategy around machine learning.
• Examine use cases that are then reimagined through an ML lens.
• Leverage Google Cloud Platform tools and environment to do ML.
• Learn from Google’s experience to avoid common pitfalls.
• Carry out data science tasks in online collaborative notebooks.
The course begins with a discussion about data: how to improve data quality and
perform exploratory data analysis. We describe Vertex AI AutoML and how to build,
train, and deploy an ML model without writing a single line of code. You will understand
the benefits of Big Query ML. We then discuss how to optimize a machine learning
(ML) model and how generalization and sampling can help assess the quality of ML
models for custom training.
• Describe Vertex AI AutoML and how to build, train, and deploy an ML model without
writing a single line of code.
• Describe Big Query ML and its benefits.
• Describe how to improve data quality.
• Perform exploratory data analysis.
• Build and train supervised learning models.
• Optimize and evaluate models using loss functions and performance metrics.
• Mitigate common problems that arise in machine learning.
• Create repeatable and scalable training, evaluation, and test datasets.
The modules cover designing and building a TensorFlow input data pipeline, building
ML models with TensorFlow and Keras, improving the accuracy of ML models, writing
ML models for scaled use, and writing specialized ML models.
• Create TensorFlow and Keras machine learning models.
• Describe TensorFlow key components.
• Use the tf.data library to manipulate data and large datasets.
• Build a ML model using tf.keras preprocessing layers.
• Use the Keras Sequential and Functional APIs for simple and advanced model
creation. Understand how model subclassing can be used for more
customized models.
• Use tf.keras.preprocessing utilities for working with image data, text data, and
sequence data.
• Train, deploy, and productionalize ML models at scale with Cloud AI Platform.
Want to know about Vertex AI Feature Store? Want to know how you can improve
the accuracy of your ML models? What about how to find which data columns make
the most useful features? Welcome to Feature Engineering, where we discuss good
versus bad features and how you can preprocess and transform them for optimal use
in your models. This course includes content and labs on feature engineering using
BigQuery ML, Keras, and TensorFlow.
• Describe Vertex AI Feature Store.
• Compare the key required aspects of a good feature.
• Combine and create new feature combinations through feature crosses.
• Perform feature engineering using BigQuery ML, Keras, and TensorFlow.
• Understand how to preprocess and explore features with Dataflow and Dataprep
by Trifacta.
• Understand and apply how TensorFlow transforms features.
This course encompasses a real-world practical approach to the ML Workflow: a case
study approach that presents an ML team faced with several ML business
requirements and use cases. This team must understand the tools required for data
management and governance and consider the best approach for data preprocessing:
from providing an overview of Dataflow and Dataprep to using BigQuery
for preprocessing tasks.
The team is presented with three options to build machine learning models for two
specific use cases. This course explains why the team would use AutoML, BigQuery
ML, or custom training to achieve their objectives.
A deeper dive into custom training is presented in this course. We describe custom
training requirements from training code structure, storage, and loading large datasets
to exporting a trained model.
You will build a custom training machine learning model, which allows you to build
a container image with little knowledge of Docker.
The case study team examines hyperparameter tuning using Vertex Vizier and how it
can be used to improve model performance. To understand more about model
improvement, we dive into a bit of theory: we discuss regularization, dealing with
sparsity, and many other essential concepts and principles. We end with an overview
of prediction and model monitoring and how Vertex AI can be used to manage
ML models.
• Understand the tools required for data management and governance.
• Describe the best approach for data preprocessing: from providing an overview of
Dataflow and Dataprep to using SQL for preprocessing tasks.
• Explain how AutoML, BigQuery ML, and custom training differ and when to use
a particular framework.
• Describe hyperparameter tuning using Vertex Vizier and how it can be used
to improve model performance.
• Explain prediction and model monitoring and how Vertex AI can be used to manage
ML models.
• Describe the benefits of Vertex AI Pipelines.