Data Engineering
Showing 277–288 of 758 results
Getting Started with the Databricks Lakehouse Platform
This course will teach you how the Data Lakehouse architecture brings you the best of both Data Lakes and Data Warehouses allowing you to meet your data needs for big data processing, SQL analytics, and machine learning in a single platform.
Getting Started with Your First SQL Server Instance
This course will teach you the basics of SQL Server architecture, installation, tools, and configuration in order to help you understand SQL Server administration.
Google Cloud Big Data and Machine Learning Fundamentals
This course introduces the Google Cloud big data and machine learning products and services that support the data-to-AI lifecycle.
Google Cloud Platform Big Data and Machine Learning Fundamentals
This course introduces participants to the big data capabilities of Google Cloud. Through a combination of presentations, demos, and hands-on labs, participants get an overview of Google Cloud and a detailed view of the data processing and machine learning capabilities. This course showcases the ease, flexibility, and power of big data solutions on Google Cloud.
GoogleCloud: Google Cloud Computing Foundations: Data, ML, and AI in Google Cloud
This final course in the Google Cloud Computing Foundations Professional Certificate reviews managed big data services, machine learning and its value, and how to demonstrate your skill set in Google Cloud.
GoogleCloud: Modernizing Data Lakes and Data Warehouses with Google Cloud
This course is intended for developers who are responsible for: Querying datasets, visualizing query results, and creating reports. Specific job roles include: Data Engineer, Data Analyst, Database Administrators, Big Data Architects.
Handling Batch Data with Apache Spark on Databricks
This course will teach you how to transform and aggregate batch data using Apache Spark on the Azure Databricks platform using selection, filter, and aggregation queries, built-in and user-defined functions, and perform windowing and join operations on batch data.
Handling Streaming Data with Apache Pulsar 2
Apache Pulsar is a highly scalable, high throughput system that handles both queuing as well as streaming data with incredible ease. This course will teach you all the necessary concepts and tools to adopt Apache Pulsar to your projects.
Handling Streaming Data with AWS Kinesis Data Analytics Using Java
Kinesis Data Analytics is a service to transform and analyze streaming data with Apache Flink and SQL using serverless technologies. You'll learn to use the Amazon Kinesis Data Analytics service to process streaming data using Apache Flink runtime.
Handling Streaming Data with Azure Databricks Using Spark Structured Streaming
In this course, you will deep-dive into Spark Structured Streaming, see its features in action, and use it to build end-to-end, complex & reliable streaming pipelines using PySpark. And you will be using Azure Databricks platform to build & run them.
Handling Streaming Data with GCP Dataflow
Dataflow is a serverless, fully-managed service on the Google Cloud Platform for batch and stream processing.
HarvardX: CS50’s AP® Computer Science Principles
This is CS50 AP, Harvard University's introduction to the intellectual enterprises of computer science and the art of programming for students in high school, which satisfies the College Board's new AP CS Principles curriculum framework.