Product Description

9 Course Modules

Complete all the Modules and achieve the best in Big Data

Projects

Designed to help you practice and apply the skills you learn.

Certificates

Highlight your new skills on your resume or LinkedIn

Course Description

This program will help you understand what insights big data can provide through hands-on experience with the tools and systems. The basic of Hadoop with MapReduce, Pig, Hive, HBase, Spark, and Cassandra will help you to do basic exploration of large, complex datasets. We also provide multiple real world practice projects those projects will help you apply the skills to do professional big data analyses.

Course Outline

Chapter One

Section Outlines

BigData Intro

What is Big data in Application and Systems
The facts about big data
What makes big data valuable
Big Data characteristics-volume, variety, velocity, veracity, valence
Big Data Science processing -acquiring data, exploring data
Programming Models for Big data

Project + Case Study

Data Analysis on Marketing (Sears) data

Chapter Two

Section Outlines

Hadoop Intro

Introduction to Hadoop and Hadoop ecosystem
Compare Hadoop to traditional large-scale system
Core components of Hadoop
Hadoop Master-Slave Architecture
Deal with Data-Storage, processing, analysis and Exploration
Understanding HDFS architecture
Deal with Node-Name node, Data node, Secondary Node
Hadoop backup, recovery and maintenance (administration tasks)

Project + Case Study

Hadoop Installation on Windows, Mac, Linux
Hadoop on cloud, learn benefits of working with Hadoop at cloud, how to install, manage and scale at cloud

Chapter Three

Section Outlines

MapReduce Essential

MapReduce introduction
MapReduce Architecture and Components
Input Splits in MapReduce
Introduce YARN
MapReduce and YARN Workflow-Execution on YARN
General MapReduce Program

Project + Case Study

Algorithm Design －－Search Engine

Chapter Four

Section Outlines

MapReduce and Machine Learning

MapReduce in real-world
Advance MapReduce– counters, distributed Cache, MRunit
Deal with complex MapReduce programs
Design Pattern in MapReduce
MapReduce Pattern – Filtering, Data Organization Patterns
JUnit and MRUnit Testing Frameworks
Machine Learning – with big data, history of predictive modeling, taxonomies
Data Mining Model Evaluation and validation
Statistical Modeling and testing
Classification Algorithms- Naive Bayes
Decision Tree – Induction, construction, overfitting
The K-Means Algorithm.

Project + Case Study

Predictive Design

Chapter Five

Section Outlines

HDFS And Hadoop Management

Basic HDFS commands
Components of HDFS
HDFS Read and Write files
Storage – HDFS architecture, Using HDFS
HDFS -high availability, federation

Project + Case Study

Design HDFS

Chapter Six

Section Outlines

Database I

Introduce to NoSql Database
Introduction, Installation and Configuration about MongoDB
Begin with Cassandra
Understand Cassandra Data Model and Architecture
Reading and Writing Data in Cassandra
Integrating Cassandra with Hadoop
Understand Spark and Big Data
Spark Common Operations
Introduction to Scala
Spark Streaming
GraphX, SparkSQL and Performance in Spark

Project + Case Study

Design a system to replay of transactions in HDFS

Chapter Seven

Section Outlines

Database II

Introduction to Pig, Hive, Hbase
Installation – Pig ,Hive, Hbase
Pig with MapReduce
Pig Latin- relational operators, Join and CoGroup, Group and Union
Hive and HiveQL – Joining Tables, Dynamic Partitioning
Hive querying language and Hive UDFs
Hbase with Java Client API, Java Admin API
Hbase Key Design-storage model, querying, table design

Software Engineer (Big Data) Training

Product Description

9 Course Modules

Projects

Certificates

Course Description

Course Outline

Chapter One

Section Outlines

BigData Intro

Chapter Two

Section Outlines

Hadoop Intro

Project + Case Study

Chapter Three

Section Outlines

MapReduce Essential

Project + Case Study

Chapter Four

Section Outlines

MapReduce and Machine Learning

Project + Case Study

Chapter Five

Section Outlines

HDFS And Hadoop Management

Project + Case Study

Chapter Six

Section Outlines

Database I

Project + Case Study

Chapter Seven

Section Outlines

Database II

Project + Case Study

Chapter Eight

Section Outlines

Case Study

Chapter Nine

Section Outlines

Final Project

REGISTRATION FORM

Take this Course

Reviews

Be the first to review “Software Engineer (Big Data) Training” Cancel reply

Related Products

Big Data Seminar

Big Data Essential Training

u offer