SlideShare a Scribd company logo
3
Most read
9
Most read
11
Most read
Apache Sqoop Tutorial | Sqoop: Import & Export Data From MySQL To HDFS | Hadoop Training | Edureka
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Topics to be covered…
Problems with RDBMS
Need for Sqoop
Introduction to Sqoop
Features of Sqoop
Sqoop Architecture
Sqoop Commands & Demo using MySQL
1
2
3
4
5
6
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Problems with Relational Database
Data Importing
was Tedious
task
Difficult to
handle large
datasets
Cant store
Unstructured Data
Time Consuming
Task
RDBMS
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Need for Sqoop
Data Analysis
Command Line Interface
High Performance
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
What is Sqoop?
RDBMS HDFS
Import
Tool used to transfer bulk data between HDFS & Relational Database Servers
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
What is Sqoop?
RDBMS
HDFS
Export
Tool used to transfer bulk data between HDFS & Relational Database Servers
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Features of Sqoop
Full Load
Incremental
Load
Parallel
Import/Export
Compression
Kerberos Security
Integration
Data loading
directly to HIVE
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Sqoop Architecture
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
How Sqoop Import & Export Works?
Sqoop Commands
IMPORT
EXPORT
LIST DATABASE
LIST TABLES
CODEGEN
Sqoop Import Command
Import
Import each table of the RDBMS in Hadoop
$ sqoop import 
--connect jdbc:mysql://<ip address>/<database name>
--table <mysql_table name>
--username <username_for_mysql_user> --password <Password>
--target-dir <target directory where data needs to be imported>
IMPORT
EXPORT
LIST DATABASE
LIST TABLES
CODEGEN
Sqoop Export Command
Exports data from HDFS to RDBMS
Export
$ sqoop export 
--connect jdbc:mysql://<ip address>/<database name>
--table <mysql_table name>
--username <username_for_mysql_user> --password <Password>
--export-dir <directory name where data needs to be exported>
IMPORT
EXPORT
LIST DATABASE
LIST TABLES
CODEGEN
Sqoop List Database
It lists the databases present in the relational database
Sqoop
$ sqoop list databases
--connect jdbc:mysql://<ip address>/<database name>
--table <mysql_table name>
--username <username_for_mysql_user> --password <Password>
IMPORT
EXPORT
LIST DATABASE
LIST TABLES
CODEGEN
Sqoop List Tables
ID Name Age Address
1 Jino 25 Kottayam
2 Neha 22 Dharwad
3 Chaitra 23 Bangalore
It lists the available tables in the database
$ sqoop list tables
--connect jdbc:mysql://<ip address>/<database name>
--table <mysql_table name>
--username <username_for_mysql_user> --password <Password>
IMPORT
EXPORT
LIST DATABASE
LIST TABLES
CODEGEN
Sqoop Codegen
Generates DAO
Class automatically
Generates Java
Class file
Source code can be
recreated
$ sqoop codegen
--connect jdbc:mysql://<ip address>/<database name>
--table <mysql_table name>
--username <username_for_mysql_user> --password <Password>
Apache Sqoop Tutorial | Sqoop: Import & Export Data From MySQL To HDFS | Hadoop Training | Edureka
Ad

Recommended

Apache Spark Introduction
Apache Spark Introduction
sudhakara st
 
Sqoop
Sqoop
Prashant Gupta
 
SQOOP PPT
SQOOP PPT
Dushhyant Kumar
 
What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...
What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...
Simplilearn
 
Hadoop Overview & Architecture
Hadoop Overview & Architecture
EMC
 
Mongodb basics and architecture
Mongodb basics and architecture
Bishal Khanal
 
Hive(ppt)
Hive(ppt)
Abhinav Tyagi
 
Apache HBase™
Apache HBase™
Prashant Gupta
 
Introduction to Apache Spark
Introduction to Apache Spark
Rahul Jain
 
Introduction to sqoop
Introduction to sqoop
Uday Vakalapudi
 
Big Data: Big SQL and HBase
Big Data: Big SQL and HBase
Cynthia Saracco
 
Apache hive introduction
Apache hive introduction
Mahmood Reza Esmaili Zand
 
Map Reduce
Map Reduce
Prashant Gupta
 
Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component
rebeccatho
 
Introduction to Pig
Introduction to Pig
Prashanth Babu
 
Big Data Analytics with Hadoop
Big Data Analytics with Hadoop
Philippe Julio
 
Apache Flume
Apache Flume
Arinto Murdopo
 
Introduction to HBase
Introduction to HBase
Avkash Chauhan
 
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQL
Ramakant Soni
 
Apache Spark Architecture | Apache Spark Architecture Explained | Apache Spar...
Apache Spark Architecture | Apache Spark Architecture Explained | Apache Spar...
Simplilearn
 
Schemaless Databases
Schemaless Databases
Dan Gunter
 
Hive Tutorial | Hive Architecture | Hive Tutorial For Beginners | Hive In Had...
Hive Tutorial | Hive Architecture | Hive Tutorial For Beginners | Hive In Had...
Simplilearn
 
Apache Spark in Depth: Core Concepts, Architecture & Internals
Apache Spark in Depth: Core Concepts, Architecture & Internals
Anton Kirillov
 
Introduction to HiveQL
Introduction to HiveQL
kristinferrier
 
Intro to Neo4j and Graph Databases
Intro to Neo4j and Graph Databases
Neo4j
 
Hadoop File system (HDFS)
Hadoop File system (HDFS)
Prashant Gupta
 
Introduction to Big Data & Hadoop Architecture - Module 1
Introduction to Big Data & Hadoop Architecture - Module 1
Rohit Agrawal
 
Basics of MongoDB
Basics of MongoDB
HabileLabs
 
Introduction to scoop and its functions
Introduction to scoop and its functions
Rupak Roy
 
Sqoop Explanation with examples and syntax
Sqoop Explanation with examples and syntax
dspyanand
 

More Related Content

What's hot (20)

Introduction to Apache Spark
Introduction to Apache Spark
Rahul Jain
 
Introduction to sqoop
Introduction to sqoop
Uday Vakalapudi
 
Big Data: Big SQL and HBase
Big Data: Big SQL and HBase
Cynthia Saracco
 
Apache hive introduction
Apache hive introduction
Mahmood Reza Esmaili Zand
 
Map Reduce
Map Reduce
Prashant Gupta
 
Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component
rebeccatho
 
Introduction to Pig
Introduction to Pig
Prashanth Babu
 
Big Data Analytics with Hadoop
Big Data Analytics with Hadoop
Philippe Julio
 
Apache Flume
Apache Flume
Arinto Murdopo
 
Introduction to HBase
Introduction to HBase
Avkash Chauhan
 
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQL
Ramakant Soni
 
Apache Spark Architecture | Apache Spark Architecture Explained | Apache Spar...
Apache Spark Architecture | Apache Spark Architecture Explained | Apache Spar...
Simplilearn
 
Schemaless Databases
Schemaless Databases
Dan Gunter
 
Hive Tutorial | Hive Architecture | Hive Tutorial For Beginners | Hive In Had...
Hive Tutorial | Hive Architecture | Hive Tutorial For Beginners | Hive In Had...
Simplilearn
 
Apache Spark in Depth: Core Concepts, Architecture & Internals
Apache Spark in Depth: Core Concepts, Architecture & Internals
Anton Kirillov
 
Introduction to HiveQL
Introduction to HiveQL
kristinferrier
 
Intro to Neo4j and Graph Databases
Intro to Neo4j and Graph Databases
Neo4j
 
Hadoop File system (HDFS)
Hadoop File system (HDFS)
Prashant Gupta
 
Introduction to Big Data & Hadoop Architecture - Module 1
Introduction to Big Data & Hadoop Architecture - Module 1
Rohit Agrawal
 
Basics of MongoDB
Basics of MongoDB
HabileLabs
 
Introduction to Apache Spark
Introduction to Apache Spark
Rahul Jain
 
Big Data: Big SQL and HBase
Big Data: Big SQL and HBase
Cynthia Saracco
 
Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component
rebeccatho
 
Big Data Analytics with Hadoop
Big Data Analytics with Hadoop
Philippe Julio
 
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQL
Ramakant Soni
 
Apache Spark Architecture | Apache Spark Architecture Explained | Apache Spar...
Apache Spark Architecture | Apache Spark Architecture Explained | Apache Spar...
Simplilearn
 
Schemaless Databases
Schemaless Databases
Dan Gunter
 
Hive Tutorial | Hive Architecture | Hive Tutorial For Beginners | Hive In Had...
Hive Tutorial | Hive Architecture | Hive Tutorial For Beginners | Hive In Had...
Simplilearn
 
Apache Spark in Depth: Core Concepts, Architecture & Internals
Apache Spark in Depth: Core Concepts, Architecture & Internals
Anton Kirillov
 
Introduction to HiveQL
Introduction to HiveQL
kristinferrier
 
Intro to Neo4j and Graph Databases
Intro to Neo4j and Graph Databases
Neo4j
 
Hadoop File system (HDFS)
Hadoop File system (HDFS)
Prashant Gupta
 
Introduction to Big Data & Hadoop Architecture - Module 1
Introduction to Big Data & Hadoop Architecture - Module 1
Rohit Agrawal
 
Basics of MongoDB
Basics of MongoDB
HabileLabs
 

Similar to Apache Sqoop Tutorial | Sqoop: Import & Export Data From MySQL To HDFS | Hadoop Training | Edureka (20)

Introduction to scoop and its functions
Introduction to scoop and its functions
Rupak Roy
 
Sqoop Explanation with examples and syntax
Sqoop Explanation with examples and syntax
dspyanand
 
Disaster Recovery and Cloud Migration for your Apache Hive Warehouse
Disaster Recovery and Cloud Migration for your Apache Hive Warehouse
Sankar H
 
Apache sqoop with an use case
Apache sqoop with an use case
Davin Abraham
 
Real-Time Data Loading from MySQL to Hadoop
Real-Time Data Loading from MySQL to Hadoop
Continuent
 
Hive
Hive
Vetri V
 
Introduction to Sqoop | Big Data Hadoop Spark Tutorial | CloudxLab
Introduction to Sqoop | Big Data Hadoop Spark Tutorial | CloudxLab
CloudxLab
 
Hw09 Sqoop Database Import For Hadoop
Hw09 Sqoop Database Import For Hadoop
Cloudera, Inc.
 
Introduction to Sqoop Aaron Kimball Cloudera Hadoop User Group UK
Introduction to Sqoop Aaron Kimball Cloudera Hadoop User Group UK
Skills Matter
 
PHP - Intriduction to MySQL And PHP
PHP - Intriduction to MySQL And PHP
Vibrant Technologies & Computers
 
Hive explanation with examples and syntax
Hive explanation with examples and syntax
dspyanand
 
Disaster Recovery and Cloud Migration for your Apache Hive Warehouse
Disaster Recovery and Cloud Migration for your Apache Hive Warehouse
DataWorks Summit
 
Apache scoop overview
Apache scoop overview
Nisanth Simon
 
Get started with Microsoft SQL Polybase
Get started with Microsoft SQL Polybase
Henk van der Valk
 
Advanced Sqoop
Advanced Sqoop
Yogesh Kulkarni
 
Windows Azure HDInsight Service
Windows Azure HDInsight Service
Neil Mackenzie
 
DevOps Meetup ansible
DevOps Meetup ansible
sriram_rajan
 
Import Database Data using RODBC in R Studio
Import Database Data using RODBC in R Studio
Rupak Roy
 
SF Big Analytics 20190612: Building highly efficient data lakes using Apache ...
SF Big Analytics 20190612: Building highly efficient data lakes using Apache ...
Chester Chen
 
From oracle to hadoop with Sqoop and other tools
From oracle to hadoop with Sqoop and other tools
Guy Harrison
 
Introduction to scoop and its functions
Introduction to scoop and its functions
Rupak Roy
 
Sqoop Explanation with examples and syntax
Sqoop Explanation with examples and syntax
dspyanand
 
Disaster Recovery and Cloud Migration for your Apache Hive Warehouse
Disaster Recovery and Cloud Migration for your Apache Hive Warehouse
Sankar H
 
Apache sqoop with an use case
Apache sqoop with an use case
Davin Abraham
 
Real-Time Data Loading from MySQL to Hadoop
Real-Time Data Loading from MySQL to Hadoop
Continuent
 
Introduction to Sqoop | Big Data Hadoop Spark Tutorial | CloudxLab
Introduction to Sqoop | Big Data Hadoop Spark Tutorial | CloudxLab
CloudxLab
 
Hw09 Sqoop Database Import For Hadoop
Hw09 Sqoop Database Import For Hadoop
Cloudera, Inc.
 
Introduction to Sqoop Aaron Kimball Cloudera Hadoop User Group UK
Introduction to Sqoop Aaron Kimball Cloudera Hadoop User Group UK
Skills Matter
 
Hive explanation with examples and syntax
Hive explanation with examples and syntax
dspyanand
 
Disaster Recovery and Cloud Migration for your Apache Hive Warehouse
Disaster Recovery and Cloud Migration for your Apache Hive Warehouse
DataWorks Summit
 
Apache scoop overview
Apache scoop overview
Nisanth Simon
 
Get started with Microsoft SQL Polybase
Get started with Microsoft SQL Polybase
Henk van der Valk
 
Windows Azure HDInsight Service
Windows Azure HDInsight Service
Neil Mackenzie
 
DevOps Meetup ansible
DevOps Meetup ansible
sriram_rajan
 
Import Database Data using RODBC in R Studio
Import Database Data using RODBC in R Studio
Rupak Roy
 
SF Big Analytics 20190612: Building highly efficient data lakes using Apache ...
SF Big Analytics 20190612: Building highly efficient data lakes using Apache ...
Chester Chen
 
From oracle to hadoop with Sqoop and other tools
From oracle to hadoop with Sqoop and other tools
Guy Harrison
 
Ad

More from Edureka! (20)

What to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | Edureka
Edureka!
 
Top 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
Edureka!
 
Top 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
Edureka!
 
Tableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | Edureka
Edureka!
 
Python Programming Tutorial | Edureka
Python Programming Tutorial | Edureka
Edureka!
 
Top 5 PMP Certifications | Edureka
Top 5 PMP Certifications | Edureka
Edureka!
 
Top Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | Edureka
Edureka!
 
Linux Mint Tutorial | Edureka
Linux Mint Tutorial | Edureka
Edureka!
 
How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| Edureka
Edureka!
 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | Edureka
Edureka!
 
RPA in 2020 | Edureka
RPA in 2020 | Edureka
Edureka!
 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | Edureka
Edureka!
 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | Edureka
Edureka!
 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | Edureka
Edureka!
 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
Edureka!
 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | Edureka
Edureka!
 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka
Edureka!
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
Edureka!
 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | Edureka
Edureka!
 
Introduction to DevOps | Edureka
Introduction to DevOps | Edureka
Edureka!
 
What to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | Edureka
Edureka!
 
Top 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
Edureka!
 
Top 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
Edureka!
 
Tableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | Edureka
Edureka!
 
Python Programming Tutorial | Edureka
Python Programming Tutorial | Edureka
Edureka!
 
Top 5 PMP Certifications | Edureka
Top 5 PMP Certifications | Edureka
Edureka!
 
Top Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | Edureka
Edureka!
 
Linux Mint Tutorial | Edureka
Linux Mint Tutorial | Edureka
Edureka!
 
How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| Edureka
Edureka!
 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | Edureka
Edureka!
 
RPA in 2020 | Edureka
RPA in 2020 | Edureka
Edureka!
 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | Edureka
Edureka!
 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | Edureka
Edureka!
 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | Edureka
Edureka!
 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
Edureka!
 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | Edureka
Edureka!
 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka
Edureka!
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
Edureka!
 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | Edureka
Edureka!
 
Introduction to DevOps | Edureka
Introduction to DevOps | Edureka
Edureka!
 
Ad

Recently uploaded (20)

AI VIDEO MAGAZINE - June 2025 - r/aivideo
AI VIDEO MAGAZINE - June 2025 - r/aivideo
1pcity Studios, Inc
 
The Future of Product Management in AI ERA.pdf
The Future of Product Management in AI ERA.pdf
Alyona Owens
 
" How to survive with 1 billion vectors and not sell a kidney: our low-cost c...
" How to survive with 1 billion vectors and not sell a kidney: our low-cost c...
Fwdays
 
10 Key Challenges for AI within the EU Data Protection Framework.pdf
10 Key Challenges for AI within the EU Data Protection Framework.pdf
Priyanka Aash
 
Salesforce Summer '25 Release Frenchgathering.pptx.pdf
Salesforce Summer '25 Release Frenchgathering.pptx.pdf
yosra Saidani
 
AI Agents and FME: A How-to Guide on Generating Synthetic Metadata
AI Agents and FME: A How-to Guide on Generating Synthetic Metadata
Safe Software
 
Mastering AI Workflows with FME by Mark Döring
Mastering AI Workflows with FME by Mark Döring
Safe Software
 
Techniques for Automatic Device Identification and Network Assignment.pdf
Techniques for Automatic Device Identification and Network Assignment.pdf
Priyanka Aash
 
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
caoyixuan2019
 
Quantum AI Discoveries: Fractal Patterns Consciousness and Cyclical Universes
Quantum AI Discoveries: Fractal Patterns Consciousness and Cyclical Universes
Saikat Basu
 
OpenPOWER Foundation & Open-Source Core Innovations
OpenPOWER Foundation & Open-Source Core Innovations
IBM
 
9-1-1 Addressing: End-to-End Automation Using FME
9-1-1 Addressing: End-to-End Automation Using FME
Safe Software
 
Cyber Defense Matrix Workshop - RSA Conference
Cyber Defense Matrix Workshop - RSA Conference
Priyanka Aash
 
cnc-processing-centers-centateq-p-110-en.pdf
cnc-processing-centers-centateq-p-110-en.pdf
AmirStern2
 
Wenn alles versagt - IBM Tape schützt, was zählt! Und besonders mit dem neust...
Wenn alles versagt - IBM Tape schützt, was zählt! Und besonders mit dem neust...
Josef Weingand
 
Securing Account Lifecycles in the Age of Deepfakes.pptx
Securing Account Lifecycles in the Age of Deepfakes.pptx
FIDO Alliance
 
Using the SQLExecutor for Data Quality Management: aka One man's love for the...
Using the SQLExecutor for Data Quality Management: aka One man's love for the...
Safe Software
 
Lessons Learned from Developing Secure AI Workflows.pdf
Lessons Learned from Developing Secure AI Workflows.pdf
Priyanka Aash
 
From Manual to Auto Searching- FME in the Driver's Seat
From Manual to Auto Searching- FME in the Driver's Seat
Safe Software
 
Raman Bhaumik - Passionate Tech Enthusiast
Raman Bhaumik - Passionate Tech Enthusiast
Raman Bhaumik
 
AI VIDEO MAGAZINE - June 2025 - r/aivideo
AI VIDEO MAGAZINE - June 2025 - r/aivideo
1pcity Studios, Inc
 
The Future of Product Management in AI ERA.pdf
The Future of Product Management in AI ERA.pdf
Alyona Owens
 
" How to survive with 1 billion vectors and not sell a kidney: our low-cost c...
" How to survive with 1 billion vectors and not sell a kidney: our low-cost c...
Fwdays
 
10 Key Challenges for AI within the EU Data Protection Framework.pdf
10 Key Challenges for AI within the EU Data Protection Framework.pdf
Priyanka Aash
 
Salesforce Summer '25 Release Frenchgathering.pptx.pdf
Salesforce Summer '25 Release Frenchgathering.pptx.pdf
yosra Saidani
 
AI Agents and FME: A How-to Guide on Generating Synthetic Metadata
AI Agents and FME: A How-to Guide on Generating Synthetic Metadata
Safe Software
 
Mastering AI Workflows with FME by Mark Döring
Mastering AI Workflows with FME by Mark Döring
Safe Software
 
Techniques for Automatic Device Identification and Network Assignment.pdf
Techniques for Automatic Device Identification and Network Assignment.pdf
Priyanka Aash
 
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
caoyixuan2019
 
Quantum AI Discoveries: Fractal Patterns Consciousness and Cyclical Universes
Quantum AI Discoveries: Fractal Patterns Consciousness and Cyclical Universes
Saikat Basu
 
OpenPOWER Foundation & Open-Source Core Innovations
OpenPOWER Foundation & Open-Source Core Innovations
IBM
 
9-1-1 Addressing: End-to-End Automation Using FME
9-1-1 Addressing: End-to-End Automation Using FME
Safe Software
 
Cyber Defense Matrix Workshop - RSA Conference
Cyber Defense Matrix Workshop - RSA Conference
Priyanka Aash
 
cnc-processing-centers-centateq-p-110-en.pdf
cnc-processing-centers-centateq-p-110-en.pdf
AmirStern2
 
Wenn alles versagt - IBM Tape schützt, was zählt! Und besonders mit dem neust...
Wenn alles versagt - IBM Tape schützt, was zählt! Und besonders mit dem neust...
Josef Weingand
 
Securing Account Lifecycles in the Age of Deepfakes.pptx
Securing Account Lifecycles in the Age of Deepfakes.pptx
FIDO Alliance
 
Using the SQLExecutor for Data Quality Management: aka One man's love for the...
Using the SQLExecutor for Data Quality Management: aka One man's love for the...
Safe Software
 
Lessons Learned from Developing Secure AI Workflows.pdf
Lessons Learned from Developing Secure AI Workflows.pdf
Priyanka Aash
 
From Manual to Auto Searching- FME in the Driver's Seat
From Manual to Auto Searching- FME in the Driver's Seat
Safe Software
 
Raman Bhaumik - Passionate Tech Enthusiast
Raman Bhaumik - Passionate Tech Enthusiast
Raman Bhaumik
 

Apache Sqoop Tutorial | Sqoop: Import & Export Data From MySQL To HDFS | Hadoop Training | Edureka

  • 2. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Topics to be covered… Problems with RDBMS Need for Sqoop Introduction to Sqoop Features of Sqoop Sqoop Architecture Sqoop Commands & Demo using MySQL 1 2 3 4 5 6
  • 3. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Problems with Relational Database Data Importing was Tedious task Difficult to handle large datasets Cant store Unstructured Data Time Consuming Task RDBMS
  • 4. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Need for Sqoop Data Analysis Command Line Interface High Performance
  • 5. Copyright © 2017, edureka and/or its affiliates. All rights reserved. What is Sqoop? RDBMS HDFS Import Tool used to transfer bulk data between HDFS & Relational Database Servers
  • 6. Copyright © 2017, edureka and/or its affiliates. All rights reserved. What is Sqoop? RDBMS HDFS Export Tool used to transfer bulk data between HDFS & Relational Database Servers
  • 7. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Features of Sqoop Full Load Incremental Load Parallel Import/Export Compression Kerberos Security Integration Data loading directly to HIVE
  • 8. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Sqoop Architecture
  • 9. Copyright © 2017, edureka and/or its affiliates. All rights reserved. How Sqoop Import & Export Works?
  • 11. IMPORT EXPORT LIST DATABASE LIST TABLES CODEGEN Sqoop Import Command Import Import each table of the RDBMS in Hadoop $ sqoop import --connect jdbc:mysql://<ip address>/<database name> --table <mysql_table name> --username <username_for_mysql_user> --password <Password> --target-dir <target directory where data needs to be imported>
  • 12. IMPORT EXPORT LIST DATABASE LIST TABLES CODEGEN Sqoop Export Command Exports data from HDFS to RDBMS Export $ sqoop export --connect jdbc:mysql://<ip address>/<database name> --table <mysql_table name> --username <username_for_mysql_user> --password <Password> --export-dir <directory name where data needs to be exported>
  • 13. IMPORT EXPORT LIST DATABASE LIST TABLES CODEGEN Sqoop List Database It lists the databases present in the relational database Sqoop $ sqoop list databases --connect jdbc:mysql://<ip address>/<database name> --table <mysql_table name> --username <username_for_mysql_user> --password <Password>
  • 14. IMPORT EXPORT LIST DATABASE LIST TABLES CODEGEN Sqoop List Tables ID Name Age Address 1 Jino 25 Kottayam 2 Neha 22 Dharwad 3 Chaitra 23 Bangalore It lists the available tables in the database $ sqoop list tables --connect jdbc:mysql://<ip address>/<database name> --table <mysql_table name> --username <username_for_mysql_user> --password <Password>
  • 15. IMPORT EXPORT LIST DATABASE LIST TABLES CODEGEN Sqoop Codegen Generates DAO Class automatically Generates Java Class file Source code can be recreated $ sqoop codegen --connect jdbc:mysql://<ip address>/<database name> --table <mysql_table name> --username <username_for_mysql_user> --password <Password>