Machine Learning with PySpark for Big Data

₹24,000 + taxes

Enroll Now

₹24,000 + taxes

New to Analytics?
Sign Up for a free course!

Learn Spark, the unified analytics engine for large-scale data processing, and its components with special focus on Spark MLlib (Machine Learning library) using Python or Scala language. Spark or PySpark is an essential supplement if you want to expand your skillset to be able to process Big Data and apply Machine Learning algorithms at scale.
Learning Mode:  Online (Self-paced)
Certification Certification:  Jigsaw Academy
Course Duration:  3 months
Access Duration:  6 months

What you get

Faculty & Technical Support
Mobile App Access
Case Studies
UChicago Certification
Guaranteed Internships
IBM Certification
IBM Certification
In-Person Faculty Support
Placement Support
Capstone Project
Online Classes
Online Q&A Sessions
IOT Hardware Kit
IOT Hardware Kit
IconNot Applicable
IconApplicable
* T&C apply

Tools & Curriculum Covered

Tools Tools Tools Tools Tools Tools

Introduction to Spark

Introduction to Python

Resilient Distributed Dataset (RDD) and Operations

Case Study - Bank Transaction Dataset Analysis in PySpark

Case Study - Find the Blockbuster Movie in the Given Dataset in PySpark

Spark SQL and DataFrames

Case Study - Find Average Output in the Given Logfiles Generated

Case Study - New York Stock Exchange Data Analysis in PySpark

Spark Streaming

Case Study - PySpark Streaming Analysis using Apache Kafka

Introduction to Machine Learning

SparkMLlib - Machine Learning Library

Case Study - Network Intrusion Detection with the Classification Model

Case Study - Retail Store Recommendation System using PySpark MLLib

Case Study - Power System Anomaly Detection in PySpark MLLib

Case Study - Wikipedia Data Analysis to Produce Ten Most Relevant Terms in Data

Case Study - Correlation Study of Stocks using US Stock Market Data in PySpark MLLib

Set up and Getting Started

Introduction to Spark

Internals of Spark

Case Study - Bank Transaction Dataset Analysis in Apache Spark

Quiz

Quiz

Spark Architecture

Case Study - Find the Blockbuster Movie in the Given Dataset in Spark

Case Study - Find Average Output in the Given Logfiles Generated

Case Study - New York Stock Exchange Data Analysis in Apache Spark

Quiz

Quiz

Spark Components

Case Study - Analyse an Academic Dataset using Yelp in Spark

Case Study - Spark Streaming Stock Market Dataset Analysis using DStream

Case Study - Spark Streaming Analysis using Apache Kafka

SparkR

Spark MLLib

Case Study - Network Intrusion Detection with the Classification Model

Case Study - Retail Store Recommendation System using Spark MLLib

Case Study - Power System Anomaly Detection in Spark MLLib

Case Study - Wikipedia Data Analysis to Produce Ten Most Relevant Terms in Data

Case Study - Correlation Study of Stocks using US Stock Market Data in PySpark MLLib

Faqs

You can start by browsing through our Explore Analytics section. You could also visit our Analytics Training blog for more reading material, reports and in-depth articles on other topics like data science, machine learning, artificial intelligence, Big Data and IOT. If you’d prefer watching videos, head over to our Youtube channel.

For any of the Big Data courses, past exposure to SQL and any programming language or programming concepts is desired. Prior knowledge of Linux is also helpful, but not mandatory. For advanced Big Data courses, exposure to SQL and programming preferably in Python is important. Familiarity with Hadoop specifically Hive would also be helpful. If you have specific questions regarding eligibility or prerequisites for any program, please contact us at +91 9019217000.

Participants can pay through an online transfer, debit card or credit card (Visa, Master Card and Amex). Foreign students can pay via credit card or PayPal.

Jigsaw Academy has been at the forefront of analytics in India for the last 8 years. In that time, the institute has been ranked #1 in the country by Analytics India Magazine multiple times, including for the last two years running. With our expert faculty and carefully designed courses, we have been able to help several students build towards a career in analytics. We also have ongoing collaborations with prestigious institutes such as the University of Chicago and Bocconi University in Milan. In addition to individuals, we are also invested in providing high-quality analytics training at the corporate level.