Data Engineering
Showing 37–48 of 829 results
Analytics Acceleration for Data Lakes
In this session, we will be exploring The Database of Now from MemSQL and show how it provides real-time analytic performance across several data sources with scalable SQL for an integrated hybrid platform.
Analyze Data with SQL
Learn to analyze data with SQL and prepare for technical interviews.
Analyzing Business Data in SQL
Learn to write SQL queries to calculate key metrics that businesses use to measure performance.
Analyzing SQL Server Query Plans
Every database server runs different workloads and queries. You will learn how to analyze query plans and troubleshoot SQL Server performance problems.
Apache Kafka and KSQLDb in Action: Let’s Build a Streaming Data Pipeline
In this talk, we’ll build a streaming data pipeline using nothing but our bare hands, Kafka Connect, and ksqlDB.
Apache Pig 101
Get introduced to Apache Pig, a high-level platform for processing large datasets. Learn Pig Latin scripting to simplify data transformation tasks on Hadoop.
Apache Spark Fundamentals
This course will teach you how to use Apache Spark to analyze your big data at lightning-fast speeds; leaving Hadoop in the dust! For a deep dive on SQL and Streaming check out the sequel, Handling Fast Data with Apache Spark SQL and Streaming.
Applying SQL to Real-World Problems
Find tables, store and manage new tables and views, and write maintainable SQL code to answer business questions.
Applying the Lambda Architecture with Spark, Kafka, and Cassandra
This course introduces how to build robust, scalable, real-time big data systems using a variety of Apache Spark's APIs, including the Streaming, DataFrame, SQL, and DataSources APIs, integrated with Apache Kafka, HDFS and Apache Cassandra.
Approaches to Requirement Analysis for Efficient Data Storage and Processing
Welcome to Approaches to Requirement Analysis for Efficient Data Storage and Processing. This course will teach you how to understand and identify steps to take your data to the right location based on its use case.
Architecting Big Data Solutions Using Google Bigtable
Google Bigtable is a sophisticated NoSQL offering on the Google Cloud Platform with extremely low latencies. By the end of this course, you'll understand why Bigtable is much more powerful offering than HBase, with linear scaling of your data.
Architecting Big Data Solutions Using Google Dataproc
Dataproc is Google’s managed Hadoop offering on the cloud. This course teaches you how the separation of storage and compute allows you to utilize clusters more efficiently purely for processing data and not for storage.