SlideShare a Scribd company logo
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Agenda
 What Is Artificial Intelligence ?
 What Is Machine Learning ?
 Limitations Of Machine Learning
 Deep Learning To The Rescue
 What Is Deep Learning ?
 Deep Learning Applications
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Agenda
Hadoop Introduction
Hadoop Ecosystem
Hadoop Use-cases
Demo
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Hadoop Introduction
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Hadoop Introduction
Hadoop is a framework that allows us to store and process large data sets in parallel and distributed fashion.
Allows to dump any kind of data
across the cluster
Allows parallel processing of the
data stored in HDFS
HDFS (Storage)
YARN
(Processing)
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Hadoop Ecosystem
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Hadoop Ecosystem
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Use-Cases
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Use-Cases
Recommendations
Managing Reviews using
NLP
ISIS Tweet network
Analysis
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
NetFlix Use-Case
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
NetFlix
Recommendation Engine
80% of views comes from
recommendation
Recommendations are driven by
Machine Learning Algorithms
Continuous A/B Testing
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Transformers
The Item Transformer
➢ Extends Spark ML Transformer
➢ Accepts DMC-12 DataFrame with contextual
information
➢ Transforms DataFrame at the item level
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Processes
Using
DataFrames
Multithread
Model
Training
Distributed
Model
Training
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
TripAdvisor Use-Case
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
TripAdvisor
➢ Covers almost all parts of the world
➢ One of the best platform for hotel
reviews
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
TripAdvisor
Dataset
Generation
Training Application
1 32
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
ISIS Tweet Use-Case
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Isis Tweets
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Goals
Social Network
Cluster
Analysis
Keyword
Analysis
Data
Categorization
of Links
Sentiment
Analysis
Timeline View
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
ISIS Tweet Analysis
Transforming Data
Filtration
Visualizations
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
DEMO
Travel Sector Use-Case
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Travel Sector
Find Top 20 frequently travelled destinations
Top 20 locations people travel from
Top 20 high air revenue destinations
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
WebDriver vs. IDE vs. RC
➢ Data Warehouse is like a relational database designed for analytical needs.
➢ It functions on the basis of OLAP (Online Analytical Processing).
➢ It is a central location where consolidated data from multiple locations (databases) are stored.

More Related Content

PDF
Django Rest Framework | How to Create a RESTful API Using Django | Django Tut...
PDF
Microsoft Azure Storage Overview | Microsoft Azure Training | Microsoft Azure...
PDF
What is Django | Django Tutorial for Beginners | Python Django Training | Edu...
PDF
Git Merge Conflict Tutorial | Resolving Merge Conflicts In Git | DevOps Train...
PDF
Cloud Computing Tutorial For Beginners | What is Cloud Computing | AWS Traini...
PDF
React vs Angular 4 | Angular 2 vs React | React & Angular | ReactJS Training ...
PDF
Dockerizing An Angular Application Using Git, Jenkins & Docker! | DevOps Tuto...
PDF
Data Driven Framework In Selenium Webdriver | Data Driven Testing | Selenium ...
Django Rest Framework | How to Create a RESTful API Using Django | Django Tut...
Microsoft Azure Storage Overview | Microsoft Azure Training | Microsoft Azure...
What is Django | Django Tutorial for Beginners | Python Django Training | Edu...
Git Merge Conflict Tutorial | Resolving Merge Conflicts In Git | DevOps Train...
Cloud Computing Tutorial For Beginners | What is Cloud Computing | AWS Traini...
React vs Angular 4 | Angular 2 vs React | React & Angular | ReactJS Training ...
Dockerizing An Angular Application Using Git, Jenkins & Docker! | DevOps Tuto...
Data Driven Framework In Selenium Webdriver | Data Driven Testing | Selenium ...

What's hot (20)

PDF
Angular 4 Data Binding | Two Way Data Binding in Angular 4 | Angular 4 Tutori...
PDF
Time Series In R | Time Series Forecasting | Time Series Analysis | Data Scie...
PDF
Hadoop Tutorial | Big Data Hadoop Tutorial For Beginners | Hadoop Certificati...
PDF
What Is React | ReactJS Tutorial for Beginners | ReactJS Training | Edureka
PDF
Android Studio Tutorial For Beginners -2 | Android Development Tutorial | And...
PDF
AWS Autoscaling | Autoscaling and Load Balancing in AWS | AWS Training | Edureka
PDF
Azure Interview Questions And Answers | Azure Tutorial For Beginners | Azure ...
PDF
Introduction to Artificial Intelligence | AI using Deep Learning | Edureka
PDF
Azure Virtual Network Tutorial | Azure Virtual Machine Tutorial | Azure Train...
PDF
Top 10 Programming Languages | Programming Languages For Beginners | Computer...
PPTX
Microsoft Azure Fundamentals
PDF
How To Become A Big Data Engineer? Edureka
PDF
Azure Machine Learning Tutorial | Azure Tutorial | Azure Training | Edureka
PPTX
How to plan Azure Career the Right Way?
PDF
Machine Learning In Python | Python Machine Learning Tutorial | Deep Learning...
PPTX
Flash Card : Manage Resources in Azure
PPTX
Flash card Module 8-Manage Identity and Access in Azure Active Directory
PDF
Microsoft Azure Tutorial For Beginners | Microsoft Azure Training | Edureka
DOCX
Microsoft azure bootcamp @ hpe diegem
PDF
Is the Cloud ready for Your Firm?
Angular 4 Data Binding | Two Way Data Binding in Angular 4 | Angular 4 Tutori...
Time Series In R | Time Series Forecasting | Time Series Analysis | Data Scie...
Hadoop Tutorial | Big Data Hadoop Tutorial For Beginners | Hadoop Certificati...
What Is React | ReactJS Tutorial for Beginners | ReactJS Training | Edureka
Android Studio Tutorial For Beginners -2 | Android Development Tutorial | And...
AWS Autoscaling | Autoscaling and Load Balancing in AWS | AWS Training | Edureka
Azure Interview Questions And Answers | Azure Tutorial For Beginners | Azure ...
Introduction to Artificial Intelligence | AI using Deep Learning | Edureka
Azure Virtual Network Tutorial | Azure Virtual Machine Tutorial | Azure Train...
Top 10 Programming Languages | Programming Languages For Beginners | Computer...
Microsoft Azure Fundamentals
How To Become A Big Data Engineer? Edureka
Azure Machine Learning Tutorial | Azure Tutorial | Azure Training | Edureka
How to plan Azure Career the Right Way?
Machine Learning In Python | Python Machine Learning Tutorial | Deep Learning...
Flash Card : Manage Resources in Azure
Flash card Module 8-Manage Identity and Access in Azure Active Directory
Microsoft Azure Tutorial For Beginners | Microsoft Azure Training | Edureka
Microsoft azure bootcamp @ hpe diegem
Is the Cloud ready for Your Firm?
Ad

Viewers also liked (20)

PDF
Angular 4 Components | Angular 4 Tutorial For Beginners | Learn Angular 4 | E...
PDF
Angular 4 Tutorial For Beginners | Angular 4 Introduction | Angular 4 Trainin...
PDF
Docker Swarm For High Availability | Docker Tutorial | DevOps Tutorial | Edureka
PDF
Artificial Neural Network Tutorial | Deep Learning With Neural Networks | Edu...
PDF
Bitcoin Blockchain Explained | Understanding Bitcoin and Blockchain | Blockch...
PDF
Power BI Training | Getting Started with Power BI | Power BI Tutorial | Power...
PDF
Selenium Page Object Model Using Page Factory | Selenium Tutorial For Beginne...
PDF
What Is DevOps? | Introduction To DevOps | DevOps Tools | DevOps Tutorial | D...
PDF
Docker Compose | Containerizing MEAN Stack Application | DevOps Tutorial | Ed...
PDF
Introduction To TensorFlow | Deep Learning Using TensorFlow | TensorFlow Tuto...
PDF
React Components Lifecycle | React Tutorial for Beginners | ReactJS Training ...
PDF
ReactJS Tutorial For Beginners | ReactJS Redux Training For Beginners | React...
PDF
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
PPTX
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
PDF
Introduction to Data Science
PDF
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
PDF
Data Science Training | Data Science Tutorial | Data Science Certification | ...
PPTX
AI for IA's: Machine Learning Demystified at IA Summit 2017 - IAS17
PPTX
UX in the Age of AI: Where Does Design Fit In? Fluxible 2017
PPTX
Faster Usability Testing in an Agile World - Agile UX Virtual Summit 2017 by ...
Angular 4 Components | Angular 4 Tutorial For Beginners | Learn Angular 4 | E...
Angular 4 Tutorial For Beginners | Angular 4 Introduction | Angular 4 Trainin...
Docker Swarm For High Availability | Docker Tutorial | DevOps Tutorial | Edureka
Artificial Neural Network Tutorial | Deep Learning With Neural Networks | Edu...
Bitcoin Blockchain Explained | Understanding Bitcoin and Blockchain | Blockch...
Power BI Training | Getting Started with Power BI | Power BI Tutorial | Power...
Selenium Page Object Model Using Page Factory | Selenium Tutorial For Beginne...
What Is DevOps? | Introduction To DevOps | DevOps Tools | DevOps Tutorial | D...
Docker Compose | Containerizing MEAN Stack Application | DevOps Tutorial | Ed...
Introduction To TensorFlow | Deep Learning Using TensorFlow | TensorFlow Tuto...
React Components Lifecycle | React Tutorial for Beginners | ReactJS Training ...
ReactJS Tutorial For Beginners | ReactJS Redux Training For Beginners | React...
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
Introduction to Data Science
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Training | Data Science Tutorial | Data Science Certification | ...
AI for IA's: Machine Learning Demystified at IA Summit 2017 - IAS17
UX in the Age of AI: Where Does Design Fit In? Fluxible 2017
Faster Usability Testing in an Agile World - Agile UX Virtual Summit 2017 by ...
Ad

Similar to Big Data Use Cases | Hadoop Tutorial for Beginners | Hadoop Training | Edureka (20)

PDF
What is HDFS | Hadoop Distributed File System | Edureka
PDF
Google Cloud Platform Tutorial | GCP Fundamentals | Edureka
PDF
Big Data Engineer Skills and Job Description | Edureka
PPTX
GCP_Tutorial.pptx
PDF
Customer-Product Analysis With Tableau | Tableau Training For Beginners | Tab...
PDF
Talend Big Data Tutorial | Talend DI and Big Data Certification | Talend Onli...
PDF
Google Cloud Storage | Google Cloud Platform Tutorial | Google Cloud Architec...
PDF
Hadoop vs Spark | Which One to Choose? | Hadoop Training | Spark Training | E...
PPTX
Modern REST APIs for Enterprise Databases - OData
PDF
Sam fineberg big_data_hadoop_storage_options_3v9-1
PDF
Big Data
PDF
Aioug big data and hadoop
PDF
Hadoop training kit from lcc infotech
PPTX
Spark with Azure HDInsight - Tampa Bay Data Science - Adnan Masood, PhD
PPTX
What is deep learning (and why you should care) - Talk at SJSU Oct 2018
PDF
Big Data with Hadoop – For Data Management, Processing and Storing
PPT
Hadoop_Its_Not_Just_Internal_Storage_V14
PDF
Top 10 Open Source Technologies In 2018 | Trending Technologies 2018 | Edureka
PPTX
Hadoop Training in Delhi
PPTX
Hadoop Training | Hadoop Training For Beginners | Hadoop Architecture | Hadoo...
What is HDFS | Hadoop Distributed File System | Edureka
Google Cloud Platform Tutorial | GCP Fundamentals | Edureka
Big Data Engineer Skills and Job Description | Edureka
GCP_Tutorial.pptx
Customer-Product Analysis With Tableau | Tableau Training For Beginners | Tab...
Talend Big Data Tutorial | Talend DI and Big Data Certification | Talend Onli...
Google Cloud Storage | Google Cloud Platform Tutorial | Google Cloud Architec...
Hadoop vs Spark | Which One to Choose? | Hadoop Training | Spark Training | E...
Modern REST APIs for Enterprise Databases - OData
Sam fineberg big_data_hadoop_storage_options_3v9-1
Big Data
Aioug big data and hadoop
Hadoop training kit from lcc infotech
Spark with Azure HDInsight - Tampa Bay Data Science - Adnan Masood, PhD
What is deep learning (and why you should care) - Talk at SJSU Oct 2018
Big Data with Hadoop – For Data Management, Processing and Storing
Hadoop_Its_Not_Just_Internal_Storage_V14
Top 10 Open Source Technologies In 2018 | Trending Technologies 2018 | Edureka
Hadoop Training in Delhi
Hadoop Training | Hadoop Training For Beginners | Hadoop Architecture | Hadoo...

More from Edureka! (20)

PDF
What to learn during the 21 days Lockdown | Edureka
PDF
Top 10 Dying Programming Languages in 2020 | Edureka
PDF
Top 5 Trending Business Intelligence Tools | Edureka
PDF
Tableau Tutorial for Data Science | Edureka
PDF
Python Programming Tutorial | Edureka
PDF
Top 5 PMP Certifications | Edureka
PDF
Top Maven Interview Questions in 2020 | Edureka
PDF
Linux Mint Tutorial | Edureka
PDF
How to Deploy Java Web App in AWS| Edureka
PDF
Importance of Digital Marketing | Edureka
PDF
RPA in 2020 | Edureka
PDF
Email Notifications in Jenkins | Edureka
PDF
EA Algorithm in Machine Learning | Edureka
PDF
Cognitive AI Tutorial | Edureka
PDF
AWS Cloud Practitioner Tutorial | Edureka
PDF
Blue Prism Top Interview Questions | Edureka
PDF
Big Data on AWS Tutorial | Edureka
PDF
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
PDF
Kubernetes Installation on Ubuntu | Edureka
PDF
Introduction to DevOps | Edureka
What to learn during the 21 days Lockdown | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
Tableau Tutorial for Data Science | Edureka
Python Programming Tutorial | Edureka
Top 5 PMP Certifications | Edureka
Top Maven Interview Questions in 2020 | Edureka
Linux Mint Tutorial | Edureka
How to Deploy Java Web App in AWS| Edureka
Importance of Digital Marketing | Edureka
RPA in 2020 | Edureka
Email Notifications in Jenkins | Edureka
EA Algorithm in Machine Learning | Edureka
Cognitive AI Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
Blue Prism Top Interview Questions | Edureka
Big Data on AWS Tutorial | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
Kubernetes Installation on Ubuntu | Edureka
Introduction to DevOps | Edureka

Recently uploaded (20)

PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPT
Teaching material agriculture food technology
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Encapsulation theory and applications.pdf
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Big Data Technologies - Introduction.pptx
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Approach and Philosophy of On baking technology
“AI and Expert System Decision Support & Business Intelligence Systems”
A comparative analysis of optical character recognition models for extracting...
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Teaching material agriculture food technology
NewMind AI Weekly Chronicles - August'25-Week II
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Diabetes mellitus diagnosis method based random forest with bat algorithm
Encapsulation theory and applications.pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
Unlocking AI with Model Context Protocol (MCP)
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Group 1 Presentation -Planning and Decision Making .pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
Big Data Technologies - Introduction.pptx
MIND Revenue Release Quarter 2 2025 Press Release
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Encapsulation_ Review paper, used for researhc scholars
Approach and Philosophy of On baking technology

Big Data Use Cases | Hadoop Tutorial for Beginners | Hadoop Training | Edureka

  • 1. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Agenda  What Is Artificial Intelligence ?  What Is Machine Learning ?  Limitations Of Machine Learning  Deep Learning To The Rescue  What Is Deep Learning ?  Deep Learning Applications
  • 2. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Agenda Hadoop Introduction Hadoop Ecosystem Hadoop Use-cases Demo
  • 3. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Hadoop Introduction
  • 4. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Hadoop Introduction Hadoop is a framework that allows us to store and process large data sets in parallel and distributed fashion. Allows to dump any kind of data across the cluster Allows parallel processing of the data stored in HDFS HDFS (Storage) YARN (Processing)
  • 5. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Hadoop Ecosystem
  • 6. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Hadoop Ecosystem
  • 7. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Use-Cases
  • 8. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Use-Cases Recommendations Managing Reviews using NLP ISIS Tweet network Analysis
  • 9. Copyright © 2017, edureka and/or its affiliates. All rights reserved. NetFlix Use-Case
  • 10. Copyright © 2017, edureka and/or its affiliates. All rights reserved. NetFlix Recommendation Engine 80% of views comes from recommendation Recommendations are driven by Machine Learning Algorithms Continuous A/B Testing
  • 11. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Transformers The Item Transformer ➢ Extends Spark ML Transformer ➢ Accepts DMC-12 DataFrame with contextual information ➢ Transforms DataFrame at the item level
  • 12. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Processes Using DataFrames Multithread Model Training Distributed Model Training
  • 13. Copyright © 2017, edureka and/or its affiliates. All rights reserved. TripAdvisor Use-Case
  • 14. Copyright © 2017, edureka and/or its affiliates. All rights reserved. TripAdvisor ➢ Covers almost all parts of the world ➢ One of the best platform for hotel reviews
  • 15. Copyright © 2017, edureka and/or its affiliates. All rights reserved. TripAdvisor Dataset Generation Training Application 1 32
  • 16. Copyright © 2017, edureka and/or its affiliates. All rights reserved. ISIS Tweet Use-Case
  • 17. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Isis Tweets
  • 18. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Goals Social Network Cluster Analysis Keyword Analysis Data Categorization of Links Sentiment Analysis Timeline View
  • 19. Copyright © 2017, edureka and/or its affiliates. All rights reserved. ISIS Tweet Analysis Transforming Data Filtration Visualizations
  • 20. Copyright © 2017, edureka and/or its affiliates. All rights reserved. DEMO Travel Sector Use-Case
  • 21. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Travel Sector Find Top 20 frequently travelled destinations Top 20 locations people travel from Top 20 high air revenue destinations
  • 22. Copyright © 2017, edureka and/or its affiliates. All rights reserved. WebDriver vs. IDE vs. RC ➢ Data Warehouse is like a relational database designed for analytical needs. ➢ It functions on the basis of OLAP (Online Analytical Processing). ➢ It is a central location where consolidated data from multiple locations (databases) are stored.