Technology |
Modern |
Traditional |
Volume |
Usually in PetaBytes |
Usually in GigaBytes |
Operations |
Storage, processing, retrieval and pattern extraction from data |
Storage, processing, retrieval and pattern mining of data |
Fault Tolerance |
Hadoop is highly fault tolerant |
SQL has good fault tolerance |
Storage |
Stores data in the form of key-value pairs, tables, hash map etc in distributed systems. |
Stores structured data in tabular format with fixed schema in cloud |
Scaling |
Linear |
Non linear |
Providers |
Cloudera, Horton work, AWS etc. provides Hadoop systems. |
Well-known industry leaders in SQL systems are Microsoft, SAP, Oracle etc. |
Data Access |
Batch oriented data access |
Interactive and batch oriented data access |
Cost |
It is open source and systems can be cost effectively scaled |
It is licensed and costs a fortune to buy a SQL server, moreover if system runs out of storage additional charges also emerge |
Time |
Statements are executed very quickly |
SQL syntax is slow when executed in millions of rows |
Optimization |
It stores data in HDFS and process though Map Reduce with huge optimization techniques. |
It does not have any advanced optimization techniques |
Structure |
Dynamic schema, capable of storing and processing log data, real-time data, images, videos, sensor data etc.(both structured and unstructured) |
Static Schema, capable of storing data(fixed schema) in tabular format only(structured) |
Data Update |
Write data once, read data multiple times |
Read and Write data multiple times |
Integrity |
Low |
High |
Interaction |
Hadoop uses JDBC(Java Database Connectivity) to communicate with SQL systems to send and receive data |
SQL systems can read and write data to Hadoop systems |
Hardware |
Uses commodity hardware |
Uses propriety hardware |
Training |
Learning Hadoop for entry-level as well as seasoned profession is moderately hard |
Learning SQL is easy for even entry-level professionals |