×

Spark Fundamentals I

Add to wishlistAdded to wishlistRemoved from wishlist 0
Add to compare+
level

Intermediate

Rating

4.4

Review

1.97k+ Reviews

Enrolled

23k+ Enrolled

Dive into Apache Spark basics. Learn how this distributed computing framework handles big data processing efficiently, with an introduction to its key concepts and operations.

Add your review

At a Glance

Ignite your interest in Spark with an introduction to the core concepts that make this general processor an essential tool set for working with Big Data.


About This Course
Learn the fundamentals of 

Spark, the technology that is revolutionizing the analytics and big data world! Spark is an open source processing engine built around speed, ease of use, and analytics. If you have large amounts of data that requires low latency processing that a typical MapReduce program cannot provide, Spark is the way to go. 
  • Learn how it performs at speeds up to 100 times faster than Map Reduce for iterative algorithms or interactive data mining.
  • Learn how it provides in-memory cluster computing for lightning fast speed and supports Java, Python, R, and Scala APIs for ease of development.
  • Learn how it can handle a wide range of data processing scenarios by combining SQL, streaming and complex analytics together seamlessly in the same application.
  • Learn how it runs on top of Hadoop, Mesos, standalone, or in the cloud. It can access diverse data sources such as HDFS, Cassandra, HBase, or S3.

Course Syllabus
  • Module 1 – Introduction to Spark – Getting started

    1. What is Spark and what is its purpose?
    2. Components of the Spark unified stack
    3. Resilient Distributed Dataset (RDD)
    4. Downloading and installing Spark standalone
    5. Scala and Python overview
    6. Launching and using Spark’s Scala and Python shell ©
  • Module 2 – Resilient Distributed Dataset and DataFrames

    1. Understand how to create parallelized collections and external datasets
    2. Work with Resilient Distributed Dataset (RDD) operations
    3. Utilize shared variables and key-value pairs
  • Module 3 – Spark application programming

    1. Understand the purpose and usage of the SparkContext
    2. Initialize Spark with the various programming languages
    3. Describe and run some Spark examples
    4. Pass functions to Spark
    5. Create and run a Spark standalone application
    6. Submit applications to the cluster
  • Module 4 – Introduction to Spark libraries

    1. Understand and use the various Spark libraries
  • Module 5 – Spark configuration, monitoring and tuning

    1. Understand components of the Spark cluster
    2. Configure Spark to modify the Spark properties, environmental variables, or logging properties
    3. Monitor Spark using the web UIs, metrics, and external instrumentation
    4. Understand performance tuning considerations

General Information
  • This course is self-paced.
  • It can be taken at any time.
  • It can be audited as many times as you wish.

Recommended skills prior to taking this course
  • Basic understanding of Apache Hadoop and Big Data.
  • Basic Linux Operating System knowledge.
  • Basic understanding of the Scala, Python, R, or Java programming languages.

Requirements

Course Staff


Henry L. Quach
Henry L. Quach is the Technical Curriculum Developer Lead for Big Data. He has been with IBM for 9 years focusing on education development. Henry likes to dabble in a number of things including being part of the original team that developed and designed the concept for the IBM Open Badges program. He has a Bachelor of Science in Computer Science and a Master of Science in Software Engineering from San Jose State University.


 


Alan Barnes
Alan Barnes is a Senior IBM Information Management Course Developer / Consultant. He has worked in several companies as a Senior Technical Consultant, Database Team Manager, Application Programmer, Systems Programmer, Business Analyst, DB2 Team Lead and more. His career in IT spans more than 35 years.

User Reviews

0.0 out of 5
0
0
0
0
0
Write a review

There are no reviews yet.

Be the first to review “Spark Fundamentals I”

Your email address will not be published. Required fields are marked *

Spark Fundamentals I
Spark Fundamentals I
Edcroma
Logo
Compare items
  • Total (0)
Compare
0
https://login.stikeselisabethmedan.ac.id/produtcs/
https://hakim.pa-bangil.go.id/
https://lowongan.mpi-indonesia.co.id/toto-slot/
https://cctv.sikkakab.go.id/
https://hakim.pa-bangil.go.id/products/
https://penerimaan.uinbanten.ac.id/
https://ssip.undar.ac.id/
https://putusan.pta-jakarta.go.id/
https://tekno88s.com/
https://majalah4dl.com/
https://nana16.shop/
https://thamuz12.shop/
https://dprd.sumbatimurkab.go.id/slot777/
https://dprd.sumbatimurkab.go.id/
https://cctv.sikkakab.go.id/slot-777/
https://hakim.pa-kuningan.go.id/
https://hakim.pa-kuningan.go.id/slot-gacor/
https://thamuz11.shop/
https://thamuz15.shop/
https://thamuz14.shop/
https://ppdb.smtimakassar.sch.id/
https://ppdb.smtimakassar.sch.id/slot-gacor/
slot777
slot dana
majalah4d
slot thailand
slot dana
rtp slot
toto slot
slot toto
toto4d
slot gacor
slot toto
toto slot
toto4d
slot gacor
tekno88
https://lowongan.mpi-indonesia.co.id/
https://thamuz13.shop/
https://www.alpha13.shop/
https://perpustakaan.smkpgri1mejayan.sch.id/
https://perpustakaan.smkpgri1mejayan.sch.id/toto-slot/
https://nana44.shop/
https://sadps.pa-negara.go.id/
https://sadps.pa-negara.go.id/slot-777/
https://peng.pn-baturaja.go.id/
https://portalkan.undar.ac.id/
https://portalkan.undar.ac.id/toto-slot/
https://penerimaan.ieu.ac.id/
https://sid.stikesbcm.ac.id/