, ,

Big Data with Hadoop and Spark

Duration : 4 Days

This course will introduce developers to the Hadoop ecosystem, focus on multiple programming models including MapReduce, Pig, Hive, and Apache Spark.

SKU: TXBDHPSRK Categories: , ,

Prerequisites

Basic knowledge of programming in Java

Intended audience

Developers who wish to obtain the skills and knowledge to develop big data solutions with Hadoop and Spark.

Objectives

• Developing MapReduce programs in Java
• Analyzing Data using Pig and Hive
• Distributed programming with Spark
• Using Machine learning algorithms

Course Outline

Hadoop Introduction
• Big Data Analytics
• What is and Why Hadoop
• Comparing Hadoop with Other Technologies
• Hadoop Architecture
• Hadoop Ecosystem
• Hadoop Usage Examples

Hadoop Installation
• Hadoop Operational Modes
• Hadoop Installation Options

Hadoop HDFS
• What Is and Why HDFS
• HDFS Architecture
• HDFS Features
• HDFS Commands
• HDFS Web UI
• Hue Web UI

Hadoop MapReduce
• Hadoop MapReduce Operational Architecture
• MapReduce General Concepts
• Hadoop MapReduce Programming
• Hadoop MapReduce Framework
• Hadoop MapReduce Example: Word Count
• Executing Hadoop MapReduce Application
• Input File and Input Format

Hadoop Pig
• What Is and Why Pig?
• Executing Pig
• Pig Latin Concepts
• Pig Latin Script: Operators and Functions
• Pig Latin Script: Structure
• Pig Latin Relational Operators
• Pig Examples

Hadoop Hive
• What Is and Why Hive?
• Hive Architecture
• HiveQL
• Physical Layout
• Loading Data into Hive Tables
• Partitions
• Joining
• Buckets

Spark Workshop
• Why Spark?
• Basic Concepts
• Running on Clusters
• Spark SQL
• Spark Streaming

Reviews

There are no reviews yet.

Be the first to review “Big Data with Hadoop and Spark”

Your email address will not be published. Required fields are marked *