Data Engineering
Showing 37–48 of 758 results
Apache Kafka and KSQLDb in Action: Let’s Build a Streaming Data Pipeline
In this talk, we’ll build a streaming data pipeline using nothing but our bare hands, Kafka Connect, and ksqlDB.
Apache Spark Fundamentals
This course will teach you how to use Apache Spark to analyze your big data at lightning-fast speeds; leaving Hadoop in the dust! For a deep dive on SQL and Streaming check out the sequel, Handling Fast Data with Apache Spark SQL and Streaming.
Applying SQL to Real-World Problems
Find tables, store and manage new tables and views, and write maintainable SQL code to answer business questions.
Applying the Lambda Architecture with Spark, Kafka, and Cassandra
This course introduces how to build robust, scalable, real-time big data systems using a variety of Apache Spark's APIs, including the Streaming, DataFrame, SQL, and DataSources APIs, integrated with Apache Kafka, HDFS and Apache Cassandra.
Approaches to Requirement Analysis for Efficient Data Storage and Processing
Welcome to Approaches to Requirement Analysis for Efficient Data Storage and Processing. This course will teach you how to understand and identify steps to take your data to the right location based on its use case.
Architecting Big Data Solutions Using Google Bigtable
Google Bigtable is a sophisticated NoSQL offering on the Google Cloud Platform with extremely low latencies. By the end of this course, you'll understand why Bigtable is much more powerful offering than HBase, with linear scaling of your data.
Architecting Big Data Solutions Using Google Dataproc
Dataproc is Google’s managed Hadoop offering on the cloud. This course teaches you how the separation of storage and compute allows you to utilize clusters more efficiently purely for processing data and not for storage.
Architecting Schemaless Scalable NoSQL Databases Using Google Datastore
This course is about Datastore, a schemaless, serverless NoSQL service that fills a specific niche on the GCP. Datastore offers fast lookups virtually independent of the dataset size and is optimized for hierarchical queries on document data.
Architecting Serverless Big Data Solutions Using Google Dataflow
Dataflow represents a fundamentally different approach to Big Data processing than computing engines such as Spark. Dataflow is serverless and fully-managed, meaning that provisioning resources and scaling can be transparent to the data architect.
Architecting the Global Real-time Fraud Prevention with a Performant Data Platform
In this session, Nick Blievers will discuss risk-based authentication leveraging digital identities, real-time customer trust decisions, selecting the right high performance data platform, and machine learning powered at the data layer.
AutoCAD 2017 Essentials: Rendering Interior and Exterior Scenes
This AutoCAD rendering course will teach you rendering interior and exterior scenes with new AutoCAD tools and enhancements. Software Required: AutoCAD 2017.
Automate Data Pipelines
Learn techniques for automating data pipelines for faster data processing.