Top Spark Training In Bangalore - Alice Springs

Friday, 16 October 2015

Item details

City: Alice Springs, Northern Territory
Offer type: Offer

Contacts

Contact name Nithin
Phone 9164161200

Item description

Spark Training
Prerequisite:
Candidate attending the training should have a basic knowledge on Java or scala.
Big Data Conceptuals
What is Big Data ?
The need for Big Data
Why Big Data now ?
Myths of Big Data
Tabular representation of data unit measurement.
Is one petabyte big data ?
Types of Architectures in Big Data
What is a CAP theorem ?
Problems with large-scale systems
HDFS
Why HDFS ?
HDFS Architecture
Spark
JDK 8 - Quick Introduction
Functional Programming with Java
Lambda expressions and Functional Interfaces in Java
Core Spark
Introduction to Apache Spark
What is Spark ? Explain about the modules in spark
Spark-Shell - scala and python REPL
Spark Internals - The Driver program, master, workers, executors and the tasks
Running spark in a standalone mode
Spark UI and monitoring a job
RDD
What is an RDD ?
Laziness in RDD Evaluation
Different ways of creating an RDD
Types of RDD’s - PairRDD, DoubleRDD
Partitions - The core of RDD
Operations in Spark
Spark Configuration and the Spark Context
RDD Operations - Transformation and Actions
Map, filter, distinct, collect, take operations
Storage levels supported in spark
Caching and persistence
Programming with a partition and use of custom partitioners
Accumulators and Broadcast variables
Checkpointing an RDD
Spark deployment plans
SparkSQL
The DataFrame Abstraction
Elucidate on SparkSQL
Spark Streaming
Kafka and the need
Reading and writing data to kafka cluster
Basic read from a socket
Spark Streaming from kafka
Developing streaming applications
Spark Machine Learning
Decision Trees
Linear Regression
Bayesian Classification
Spark Performance Tuning
Various strategies to adopt to performance tune your spark application
Project: A live project of how each of the API’s are used in the industry.
Note to participants:
All content in this course will be a hands-on session.
All slides of the course will be given to candidates.
Source code of all examples tried out in the session will be provided.