Browse Library

Big Data and Hadoop

Managing Big data using Hadoop tools like MapReduce, Hive, Pig, hBase and m

Instructed by Saheb Singh

12 days left at this price!
$25 $55 55% off
30 days money back guarantee
It Includes
  • Get access to this course only
  • Lifetime Course Access
  • Play & Pause Videos
  • Get Certificate of Completion
  • High Quality Recorded Lectures
  • Learn Online from Mobile/PC/Tablet
  • Download Course for Offline Viewing
  • Inlcudes Real Projects
  • Free Instructor Support
  • In late 1990s, Developers and Programmers were generating data through coding, in late 2000s everyone on Social media generating data on FB, Twitter, Insta etc and these days Machines are generating data which overall creating a Huge Volume of data which cannot be handled easily through traditional databases, so after completion of this course you'll be able to do that using HADOOP as your platform and also Able to crack Cloudera CCA 175 Certification

This is an interactive lecture of one of my Big data and Hadoop class where everything is covered from the scratch and also you will see students asking doubts so you can clear those concepts here as well.

Students will be Able to crack Cloudera CCA 175 Certification after successful completion and with little practice.

Tools covered :

1. Sqoop

2. Flume

3. MapReduce

4. Hive

5. Impala

6. Beeline

7. Apache Pig

8. HBase


10. Project on a real data set.

  • A Laptop, 6 GB RAM (at least), 100 GB FREE HDD
  • Students who want to step into Big Data, want to know how to Analyse, work on and manage it.
View More...

Section 1 : All about BIG DATA

  • Lecture 1 :
  • Big Data and Hadoop Introduction Preview
  • Lecture 2 :
  • Hadoop framework
  • Lecture 3 :
  • Hadoop Ecosystem
  • Lecture 4 :
  • HDFS
  • Lecture 5 :
  • Magic Boxes, Sqoop and Flume
  • Lecture 6 :
  • NameNode, DataNode, JournalNode
  • Lecture 7 :
  • Input output operations, Ram and HDD, pros and cons
  • Lecture 8 :
  • Mapreduce Theory 1.1
  • Lecture 9 :
  • Mapreduce Theory 1.2
  • Lecture 10 :
  • Mapreduce Theory 1.3
  • Lecture 11 :
  • Combiner Approach in MapReduce
  • Lecture 12 :
  • Coding : Sqoop with SQL
  • Lecture 13 :
  • Visit to Cloudera Machine
  • Lecture 14 :
  • Sqoop commands with introduction to Linux commands as well
  • Lecture 15 :
  • Sqoop commands
  • Lecture 16 :
  • Basics of core Java, introduction to eclipse, MapReduce Coding
  • Lecture 17 :
  • Coding : MapReduce
  • Lecture 18 :
  • Hive Theory
  • Lecture 19 :
  • Hive: connecting, loading, defining delimiters
  • Lecture 20 :
  • Coding : Hive
  • Lecture 21 :
  • Hive to Impala and Beeline
  • Lecture 22 :
  • Hive : Partitioning
  • Lecture 23 :
  • Hive Bucketing
  • Lecture 24 :
  • Lecture 25 :
  • Lecture 26 :
  • Assignment on DataNodes
  • Introduction Let's assume that, you have 100 TB of data to store and process with Hadoop. The configuration of each available DataNode is as follows: • 8 GB RAM • 10 TB HDD •100 MB/s read-write speed  You have a Hadoop Cluster with replication factor = 3 and block size = 64 MB. In this case, the number of DataNodes required to store would be: • Total amount of Data * Replication Factor / Disk Space available on each DataNode •100 * 3 / 10 •30 DataNodes  Now, let's assume you need to process this 100 TB of data using MapReduce. And, reading 100 TB data at a speed of 100 MB/s using only 1 node would take: •Total data / Read-write speed •100 * 1024 * 1024 / 100 •1048576 seconds •291.27 hours  So, with 30 DataNodes you would be able to finish this MapReduce job in: •291.27 / 30 •9.70 hours  1.Problem Statement How many such Data Nodes you would need to read 100TB data in 5 minutes in your Hadoop Cluster?
  • How do i access the course after purchase?

    Once you purchase a course (Single course or Subscription), you will be able to access the courses instantly online by logging into your account. Use the user name & password that you created while signing up. Once logged in, you can go to the "My Courses" section to access your course.
  • Are these video based online self-learning courses?

    Yes. All of the courses comes with online video based lectures created by certified instructors. Instructors have crafted these courses with a blend of high quality interactive videos, lectures, quizzes & real world projects to give you an indepth knowledge about the topic.
  • Can i play & pause the course as per my convenience?

    Yes absolutely & thats one of the advantage of self-paced courses. You can anytime pause or resume the course & come back & forth from one lecture to another lecture, play the videos mulitple times & so on.
  • How do i contact the instructor for any doubts or questions?

    Most of these courses have general questions & answers already covered within the course lectures. However, if you need any further help from the instructor, you can use the inbuilt Chat with Instructor option to send a message to an instructor & they will reply you within 24 hours. You can ask as many questions as you want.
  • Do i need a pc to access the course or can i do it on mobile & tablet as well?

    Brilliant question? Isn't it? You can access the courses on any device like PC, Mobile, Tablet & even on a smart tv. For mobile & a tablet you can download the Learnfly android or an iOS app. If mobile app is not available in your country, you can access the course directly by visting our website, its fully mobile friendly.
  • Do i get any certification after completing the course?

    Yes. Once you succesfully complete any course on Learnfly marketplace, you get a certiifcate of course completion emailed to you within 24 hours with your name & the Learnfly badge. You can definately brag about it & share it on your social media or with friends as one of your achievement. Click here to view the sample certificate Click Here
  • For how long can i access my course after the purchase?

    If you buy a single course, that course is accessible to you for a lifetime. If you go for a premium subcription, you can access all the courses on Learnfly marketplace till your subscription is Active.
  • Whats the difference between Single Course Purchase & Go Premium option?

    With Single Course Purchase, you only get an access of one single course. Whereas, with premium monhtly or annual subscription, you can access all the existing or new courses on learnfly marketplace. You can decide what option suits you the best and accordingly you can make your purchase.
  • Is there any free trial?

    Currently, we don't have any free trial but it may be available in near future.
  • What is the refund policy?

    We would hate you to leave us. However, if you are not satisfied, you can ask for a full refund within 30 days & we will be happy to assist you further.

Saheb Singh,

Hello guys, How are you all doing? So a quick introduction of me, well I am a Big Data Expert and Data Scientist, I have 3 years of experience on these domain in companies as well as a trainer. My teaching method is completely different than others like I am not a slide reader, I use analogies and examples a lot to explain things and mostly try to be practical which you'll see in the course. Hope to see you all in the lectures, Good Luck!
View More...

Big Data Pipeline Applied to UFOs

By : Eduardo Morelli

Lecture 6


Git and GitHub Version Control - Th...

By : Abhilash Nelson

Lecture 15


Statistics for Data Scientists and ...

By : Phikolomzi Gugwana

Lecture 31


Machine Learning from Scratch using...

By : Saheb Singh

Lecture 14


Data Preparation for Analytics A-Z...

By : Shokat Ali

Lecture 13


Technical Writing: How to Write Sof...

By : Jordan Stanchev

Lecture 21

Sign up and start learning
By signing up, you agree to our Terms of Use and Privacy Policy
Forget Password