Data Engineering with AWS Machine Learning
The whole field of machine learning revolves around data. This course will teach you how to properly choose between the various AWS data repositories, ingestion services, and transformation services in a cost-effective, best-practice manner.
Storing data for machine learning is challenging due to the varying formats and characteristics of data. Raw ingested data must first be transformed into the format necessary for downstream machine learning consumption, and once the data is ready to be used, it must be ingested from storage to the machine learning service. In this course, Data Engineering with AWS Machine Learning, you’ll learn to choose the right AWS service for each of these data-related machine learning ML tasks for any given scenario. First, you’ll explore the wide variety of data storage solutions available on AWS and what each type of storage is used for. Next, you’ll discover the differing AWS services used to ingest data into ML-specific services and when to use each one. Finally, you’ll learn how to transform your raw data into the proper formats used by the various AWS ML services. When you’re finished with this course, you’ll have the skills and knowledge of how to properly provide data solutions for storing, preparing, and ingesting data needed to architect data engineering solutions on AWS for Machine Learning, and be prepared to take the AWS Machine Learning Certification exam.
Author Name: Kim Schmidt
Author Description:
Kim Schmidt has been working in the technology industry for more than 12 years with an extremely broad array of titles and using very diverse technologies. She holds many industry certifications and awards. Companies Kim has worked for or with include Microsoft, Dun & Bradstreet, Google, Amazon Web Services, and a couple of Augmented Reality companies. Kim is the Founder and CEO of DataLeader, an AWS Partner and Vendor company. DataLeader focuses on AWS Big Data Architecture, Data Solutions,… more
Table of Contents
- Course Overview
1min - Important Data Characteristics to Consider in a Machine Learning Solution
8mins - Typical Data Flow for Machine Learning on AWS
8mins - Data Storage Options for Machine Learning on AWS
22mins - Database Options for Machine Learning on AWS
13mins - Using a Data Warehouse or a Data Lake as a Machine Learning Repository
18mins - Streaming Data Ingestion Solutions on AWS for Machine Learning
23mins - Batch Data Ingestion Solutions on AWS for Machine Learning
10mins - Data Transformation Overview on AWS for Machine Learning
10mins - Data-driven Workflows: The AWS Data Pipeline
6mins - Data Transformation Using Apache Spark on Amazon EMR
10mins - Data Transformation Using Serverless AWS Glue and Serverless Amazon Athena
40mins
There are no reviews yet.