Developing Spark Applications Using Scala & Cloudera
Apache Spark is one of the fastest and most efficient general engines for large-scale data processing. In this course, you’ll learn how to develop Spark applications for your Big Data using Scala and a stable Hadoop distribution, Cloudera CDH.
At the core of working with large-scale datasets is a thorough knowledge of Big Data platforms like Apache Spark and Hadoop. In this course, Developing Spark Applications Using Scala & Cloudera, you’ll learn how to process data at scales you previously thought were out of your reach. First, you’ll learn all the technical details of how Spark works. Next, you’ll explore the RDD API, the original core abstraction of Spark. Then, you’ll discover how to become more proficient using Spark SQL and DataFrames. Finally, you’ll learn to work with Spark’s typed API: Datasets. When you’re finished with this course, you’ll have a foundational knowledge of Apache Spark with Scala and Cloudera that will help you as you move forward to develop large-scale data applications that enable you to work with Big Data in an efficient and performant way.
Author Name: Xavier Morera
Author Description:
Xavier Morera is driven by one passion: taking on the challenge of understanding complex topics and sharing that knowledge with others. He’s currently focused on the transformative fields of AI, machine learning, generative AI, search, and big data. As an entrepreneur, project manager, technical author, and trainer, Xavier brings a diverse set of skills and deep expertise to every project he takes on. He holds multiple certifications with Cloudera, Microsoft, and the Scrum Alliance and has been… more
Table of Contents
- Course Overview
2mins - Why Spark with Scala and Cloudera?
13mins - Getting an Environment and Data: CDH + StackOverflow
34mins - Refreshing Your Knowledge: Scala Fundamentals for This Course
24mins - Understanding Spark: An Overview
27mins - Getting Technical with Spark
45mins - Learning the Core of Spark: RDDs
42mins - Going Deeper into Spark Core
47mins - Increasing Proficiency with Spark: DataFrames and Spark SQL
37mins - Continuing the Journey on DataFrames and Spark SQL
35mins - Working with a Typed API: Datasets
19mins - Final Takeaway and Continuing the Journey with Spark
11mins
There are no reviews yet.