Course Detail

Big Data - Hadoop

Big Data - Hadoop - GAMAKA AI SOLUTIONS


Course Detail


Course Description

Big Data - Hadoop & Spark (AWS) (Python Track)

Mode: Online / Offline (Classroom) / Blended

*Prerequisite: Knowledge of Python, Machine Learning, SQL

Research suggests that by the end of 2021 India alone will face a shortage of about two lac data scientists. The probable growth of Big Data in India is because of the awareness of the benefits that insights from unstructured data can impact businesses.
Jobs for Hadoop developers in on the rise as organizations from different verticals such as e-commerce, retail, automobile, telecom are adopting analytics to gain an advantage over their competitors.
Data volumes will continue to increase and with such an exponential increase in usage of data analytics, the Global Big Data Market will grow with an anticipated CAGR of 18.68% during the forecast period & will reach revenue of $183.62 billion by 2027.
The average salary for Big Data Hadoop Analysts ranges from $68,465 to $138,808

Program Structure

  • Big data introduction
    • What is big data?
    • V’s of Big data
    • (Volume,Velocity,Variety,Veracity)
    • Data types
    • Distributed System
    • Single system vs distributed system
    • Solution for Big data : Hadoop
  • Hadoop core components
    • Diff v1 &v2
    • Overview of Hadoop eco system
    • Map reduce
  • Introduction to AWS & Cloud
    • Cloud computing
    • AWS basics
    • AWS services
    • Setting up AWS freetier Account
    • big data computation on AWS
    • Access Permissions with S3
    • SQL vs. NoSQL Databases
    • Databases and Big Data on AWS
    • Working on EMR with Hive
  • Spark overview
    • Spark Architecture
    • RDD
    • Ml lib
    • Linear Regression on spark
    • logistic regression on spark 
    • decision tree on spark
    • naive bayers on spark
    • Xgboost On Spark
  • AWS ML tools
    • Amazon Sagemaker

Duration: 2 Months / 30+ hours

Blended – Combination of Online & Offline(classroom)

Projects/Case Studies

Any 2 Case Studies (T & C apply)

Project 01: Working with MapReduce, Hive and Sqoop

Industry: General

Problem Statement: How to successfully import data using Sqoop into HDFS for data analysis

Topics: As part of this project, you will work on the various Hadoop components like MapReduce, Apache Hive and Apache Sqoop. You will have to work with Sqoop to import data from relational database management system like MySQL data into HDFS. You need to deploy Hive for summarizing data, querying and analysis. You have to convert SQL queries using HiveQL for deploying MapReduce on the transferred data. You will gain considerable proficiency in Hive and Sqoop after the completion of this project.

Highlights:

  • Sqoop data transfer from RDBMS to Hadoop
  • Coding in Hive Query Language
  • Data querying and analysis

Project 02: Work on MovieLens data for finding the top movies

Project 03: Hadoop YARN Project; End-to-end PoC

Project 04: Table Partitioning in Hive

Project 05: Connecting Pentaho with Hadoop Ecosystem

Project 06: Multi-node Cluster Setup

Project 07: Hadoop Testing Using MRUnit

Project 08: Hadoop Web Log Analytics

Project 09: Hadoop Maintenance

Project 10: Twitter Sentiment Analysis

Project 11: Analyzing IPL T20 Cricket

Advantages of joining Gamaka AI

  • Instructor led online & classroom interactive sessions
  • One-To-One online problem-solving sessions
  • Complete Soft Copy of Notes & Latest Interview Preparation Set
  • Trainers are working IT professional with top IT MNC’s
  • 100% Placement Assistance
  • Resume Building & Mock Interview Sessions
  • 100% Hands-on Training with Live Projects/Case Studies
  • Internship & Course Completion Certificate
  • 1 Year free subscriptions to portal for updated guides, notes, poc, projects & interview preparation set.
  • Extensive training programs with Recorded Sessions
  • 24*7 Support on enquiry@gamakaai.com

Download Brochure

Download PDF

Fee Structure

BIG DATA HADOOP

Fees: ₹20,000/-

50% OFF

₹ 15,500/-

2 installments

₹ 7,500/-
(10 days gap)
Down Payment 

₹ 14,000/-

BIG DATA HADOOP + DATA SCIENCE WITH PYTHON

Fees: ₹60,000/-

60% OFF

₹ 24,500/-

2 installments

₹ 12,000/-
(10 days gap)
Down Payment 

₹ 23,000/-

BIG DATA HADOOP + DATA SCIENCE WITH PYTHON + TABLEAU

Fees: ₹80,000/-
60% OFF

₹ 32,500/-

2 installments

 16,000/-

(10 days gap)

Down Payment 

₹ 30,000/-

  • Registration – ₹ 500/-
  • Weekdays & Weekends Batches – Flexible Timings

Will I get certified?

Upon successful completion of this data science course, you’ll earn a Certificate. The certificate adds the required weight in any portfolio.

Internship Certificate

This certificate will be issued to those pursuing internships with our development team or clients with whom we have tie-ups. Data Science Internship gives opportunity to learn from professionals, gain practical experience in this field, and build a robust professional network.

Enquire Now





All Courses

Blogs

PYTHON BASICS INTERVIEW QUESTIONS

By Admin

Take Your Career on Pinnacle with Job Oriented Professional Courses

By Admin

Why Python is a Necessity for the IT Sector and Its Advantages

By Admin

Institute Overview

Pune, Maharashtra, India

Why GAMAKA AI?     GAMAKA AI SOLUTIONS is an advanced training center which conducts various professional training courses for various technologies. Institute is led by experienc... Read More

Related Courses

Google Map