SlideShare a Scribd company logo
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytics Using R | Edureka
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
Topics For Today’s Session
Introduction To Data Analytics
Data Cleaning and Manipulation
Statistics
Data Visualization
Machine Learning
Roles, Responsibilities & Salary
Hands-On
Copyright © 2018, edureka and/or its affiliates. All rights reserved.
Introduction To Data Analytics
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
Why Data Analytics?
Gather Hidden Insights01
Generate Reports 02
Perform Market Analysis03
Improve Business Requirement 04
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
What is Data Analytics?
Data Analytics refers to the techniques to analyse data to enhanced productivity and business gain.
Business
Administration
Exploratory Data
Analysis
Growth in Business
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
Who is a Data Analyst?
Collect Data Analyse Data Create Reports
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
Data Analyst Skills
Statistics Data Cleaning
EDA Data Visualization
Machine Learning
Copyright © 2018, edureka and/or its affiliates. All rights reserved.
Statistics
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
Statistics
Statistics is a branch of mathematics dealing with data collection and organization, analysis, interpretation and presentation.
Analyse Data
Build a Model Infer Result
Descriptive
Inferential
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
Categories in Statistics – Descriptive Statistics
Descriptive
Descriptive statistics uses the data to provide descriptions of the population, either through numerical calculations or graphs or
tables.
Characteristics of Data
Descriptive Statistics
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
Categories in Statistics – Descriptive Statistics
Descriptive
There are mainly two measures you need to understand in Descriptive Statistics.
Measures of Centre01
Measures of Spread 02
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
Descriptive Statistics – Measures of Centre
Descriptive
There are 3 terms, you have to understand in Measures of Centre.
Mean
Measure of average of all the values in a sample is
called Mean.
110 + 110 + 93 + 96 + 90 + 110 + 110 + 110
8
= 103.625
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
Descriptive Statistics – Measures of Centre
Descriptive
There are 3 terms, you have to understand in Measures of Centre.
Measure of average of all the values in a sample is
called Mean.
110 + 110 + 93 + 96 + 90 + 110 + 110 + 110
8
= 103.625
Measure of the central value of the sample set is
called Median.
21,21,21.3,22.8,23,23,23,23
22.8+23
2
= 22.9
Measure of the central value of the sample set is
called Median.
21,21,21.3,22.8,23,23,23,23
22.8+23
2
= 22.9
Median
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
Descriptive Statistics – Measures of Centre
Descriptive
There are 3 terms, you have to understand in Measures of Centre.
Measure of average of all the values in a sample is
called Mean.
110 + 110 + 93 + 96 + 90 + 110 + 110 + 110
8
= 103.625
Measure of the central value of the sample set is
called Median.
21,21,21.3,22.8,23,23,23,23
22.8+23
2
= 22.9
Measure of the central value of the sample set is
called Median.
21,21,21.3,22.8,23,23,23,23
22.8+23
2
= 22.9
Mode
The value most recurrent in the sample set is
known as Mode.
21,21,22,23,24,25,25,25,26 Mode - 25
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
Descriptive Statistics – Measures of Spread
Descriptive
Range
Range is the given
measure of how spread
apart are the values in a
dataset.
Range = Max(𝑥𝑖) - Min(𝑥𝑖)
Inter Quartile Range
Inter Quartile
Range(IQR) is the
measure of variability,
based on dividing a
dataset into quartiles.
1 2 3 4 5 6 7 8
Q1 Q2 Q3
Variance
Variance describes how
much a random variable
differs from its expected
value.
It entails computing
squares of deviations.
Standard Deviation
Standard Deviation is
the measure of the
dispersion of a set of
data from its mean.
෍
𝑖=1
𝑁
=(𝑥𝑖−𝜇)²
1
𝑁
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
Categories in Statistics – Inferential Statistics
Descriptive
Inferential
Inferential Statistics generalizes a large dataset and applies probability to draw a conclusion. It allows us to infer data parameters
based on a statistical model using a sample data.
Statistical Model
Inferential Statistics
Start
Process Step
Decision
Answer
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
Inferential Statistics – Hypothesis Testing
Descriptive
Inferential
Statisticians use hypothesis testing to formally check whether the hypothesis is accepted or rejected.
State the Hypotheses – This stage
involves stating the null and alternative
hypotheses.
Formulate an Analysis Plan – This stage involves the
construction of an analysis plan.
Analyse Sample Data – This stage involves the calculation and
interpretation of the test statistic as described in the analysis plan.
Interpret Results – This stage involves the application of the decision rule described in
the analysis plan.
Hypothesis testing is conducted in the following manner:
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
Descriptive vs Inferential Statistics
Descriptive Statistics Inferential Statistics
Concerned with Properties of
Population
Makes inferences from the sample
Presents data in a meaningful manner
Compares and predicts the future
outcomes
Outcomes are shown in form of
charts, tables and graphs
Outcomes are in the form of
probability scores
Describes the known data
Tries to make conclusions beyond the
data available
Measures of central tendency and
spread of data
Hypothesis Testing and Analysis of
variance.
Copyright © 2018, edureka and/or its affiliates. All rights reserved.
Data Cleaning and Manipulation
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
Data Cleaning and Manipulation
Data Cleaning
The process of detecting and correcting corrupt or
inaccurate records from a database is said to be Data
Cleaning.
Data Manipulation
The process of changing data to make it more
organized and easy to read is known as Data
Manipulation.
Copyright © 2018, edureka and/or its affiliates. All rights reserved.
Data Visualization
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
Data Visualization
Data Visualization is the representation of data inform of charts, diagram etc.
Bar Graph Scatter Plot Pie Chart
Box Plot Line Graph
Copyright © 2018, edureka and/or its affiliates. All rights reserved.
Bonus:
Machine Learning
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
Machine Learning
Machine Learning is a concept which allows the machine to learn from examples and experience, and that too without being
explicitly programmed.
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
Data Analyst: Roles and Responsibilities
Determining Organizational Goals Mine Data Data Cleaning
Analyzing Data Pinpointing Trends and Patterns Creating Reports with Visualizations
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
Salary of Data Analyst
Average Salary (US)
Average Salary (IND)
$83,878
₹404,660
Copyright © 2018, edureka and/or its affiliates. All rights reserved.
Need of R
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
Need of R
R is open-source and freely available.
R is cross-platform compatible.
R is a powerful scripting language.
R is highly flexible and evolved.
Copyright © 2018, edureka and/or its affiliates. All rights reserved.
Hands-On
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
Hands-On
To perform data analysis on the below data set and gather some insights.
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
Data Analytics @edureka
Program
Starts
2nd
Week
7th Week 15th Week
11th
Week
01
02
03
04
Statistics Essentials
Probability Bayesian Interference
Regression Making Statistics
Data Analytics with R
Data Manipulation Exploratory Analysis Regression
Data Visualization Data Mining Sentiment Analysis
SAS Training
Advanced Statistical Techniques SAS Macros
PROC SQL SAS ODS Advanced SAS Procedures
Tableau Training
LOD Expressions Tableau Desktop Tableau Public
Data Visualization Integration with R
Graduated as Data Analyst
Self-Paced
Instructor - Led
Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification
Data Analytics @edureka
QlikView
Certification
Training
Advanced
MS Excel
2010
R
Programming
Certification
Training
Analytics for
Retail Banks
Decision Tree
Modelling
Using R
Certification
Training
Machine
Learning
with Mahout
Certification
Training
Advanced
Predictive
Modelling in
R
Certification
Training
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytics Using R | Edureka

More Related Content

What's hot (20)

PPTX
Introduction to data science
Sampath Kumar
 
PPTX
Introduction to data analytics
Umasree Raghunath
 
PPTX
kinds of analytics
Benila Paul
 
PDF
Data Analytics PowerPoint Presentation Slides
SlideTeam
 
PPTX
Introduction to Data Mining
DataminingTools Inc
 
ODP
Introduction To Analytics
Alex Meadows
 
PPTX
Data visualization
Jan Willem Tulp
 
PPTX
Data analytics
Bhanu Pratap
 
PPTX
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
Simplilearn
 
PPTX
Introduction to Business Data Analytics
VadivelM9
 
PDF
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Edureka!
 
PDF
DI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
DATAVERSITY
 
PDF
Data visualization introduction
ManokamnaKochar1
 
PPTX
Introduction to Data Analytics
NR Computer Learning Center
 
PPTX
Data analytics vs. Data analysis
Dr. C.V. Suresh Babu
 
PDF
How to Become a Data Analyst? | Data Analyst Skills | Data Analyst Training |...
Edureka!
 
PPTX
Data visualization
Sushil kasar
 
PPTX
Data Analytics
Srinimf-Slides
 
PPTX
Introduction to Data Visualization
Stephen Tracy
 
PPTX
Predictive Analytics - An Overview
MachinePulse
 
Introduction to data science
Sampath Kumar
 
Introduction to data analytics
Umasree Raghunath
 
kinds of analytics
Benila Paul
 
Data Analytics PowerPoint Presentation Slides
SlideTeam
 
Introduction to Data Mining
DataminingTools Inc
 
Introduction To Analytics
Alex Meadows
 
Data visualization
Jan Willem Tulp
 
Data analytics
Bhanu Pratap
 
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
Simplilearn
 
Introduction to Business Data Analytics
VadivelM9
 
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Edureka!
 
DI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
DATAVERSITY
 
Data visualization introduction
ManokamnaKochar1
 
Introduction to Data Analytics
NR Computer Learning Center
 
Data analytics vs. Data analysis
Dr. C.V. Suresh Babu
 
How to Become a Data Analyst? | Data Analyst Skills | Data Analyst Training |...
Edureka!
 
Data visualization
Sushil kasar
 
Data Analytics
Srinimf-Slides
 
Introduction to Data Visualization
Stephen Tracy
 
Predictive Analytics - An Overview
MachinePulse
 

Similar to Data Analytics For Beginners | Introduction To Data Analytics | Data Analytics Using R | Edureka (20)

PPTX
Data analytics presentation- Management career institute
PoojaPatidar11
 
PPTX
Big Data for Pearson Btec Higher level 3.ppt
abigailjudith8
 
PPTX
Data-Driven Decision Making: A Data Analyst's Journey
lakshmipriyaexcelr
 
PDF
Skillshikshapdf on data analytics course
prateekahuja2411
 
PPTX
Data-Analyst course-Demystified ppt.pptx
lakshmipriyaexcelr
 
PDF
Unveiling the Power of Data Analytics Transforming Insights into Action.pdf
Kajal Digital
 
PPTX
What is Data analytics? How is data analytics a better career option?
Aspire Techsoft Academy
 
PPTX
Data-Analyst-career-path
SumitAgarwal65690
 
PPTX
Introduction-to-Data-Analysis_Final Content.pptx
ItismeItisnotme
 
PDF
lec1.pdf
nimmakiran1
 
PPTX
Data Processing & Explain each term in details.pptx
PratikshaSurve4
 
PDF
Boost Your Career with Data Analyst.pdf
jashwanthmuthumula
 
PPTX
How to become Data Analyst?
Intellipaat
 
PDF
Data Analyst Job Description | Edureka
Edureka!
 
PPTX
Top Data Analytics Training institute In Pune
pradeepghosh97
 
PPTX
What are Entry Level Data Analyst Jobs?: A Guide Skills
optnation1
 
PDF
Statistics For Data Science | Statistics Using R Programming Language | Hypot...
Edureka!
 
PPTX
Unit2
DrChetanNagar
 
PDF
MASTERPIECE TO EXCEL IN DATA ANALYSIS WITH EXCEL.pdf
Michael784113
 
PDF
23.pdf
JeanJaggu
 
Data analytics presentation- Management career institute
PoojaPatidar11
 
Big Data for Pearson Btec Higher level 3.ppt
abigailjudith8
 
Data-Driven Decision Making: A Data Analyst's Journey
lakshmipriyaexcelr
 
Skillshikshapdf on data analytics course
prateekahuja2411
 
Data-Analyst course-Demystified ppt.pptx
lakshmipriyaexcelr
 
Unveiling the Power of Data Analytics Transforming Insights into Action.pdf
Kajal Digital
 
What is Data analytics? How is data analytics a better career option?
Aspire Techsoft Academy
 
Data-Analyst-career-path
SumitAgarwal65690
 
Introduction-to-Data-Analysis_Final Content.pptx
ItismeItisnotme
 
lec1.pdf
nimmakiran1
 
Data Processing & Explain each term in details.pptx
PratikshaSurve4
 
Boost Your Career with Data Analyst.pdf
jashwanthmuthumula
 
How to become Data Analyst?
Intellipaat
 
Data Analyst Job Description | Edureka
Edureka!
 
Top Data Analytics Training institute In Pune
pradeepghosh97
 
What are Entry Level Data Analyst Jobs?: A Guide Skills
optnation1
 
Statistics For Data Science | Statistics Using R Programming Language | Hypot...
Edureka!
 
MASTERPIECE TO EXCEL IN DATA ANALYSIS WITH EXCEL.pdf
Michael784113
 
23.pdf
JeanJaggu
 
Ad

More from Edureka! (20)

PDF
What to learn during the 21 days Lockdown | Edureka
Edureka!
 
PDF
Top 10 Dying Programming Languages in 2020 | Edureka
Edureka!
 
PDF
Top 5 Trending Business Intelligence Tools | Edureka
Edureka!
 
PDF
Tableau Tutorial for Data Science | Edureka
Edureka!
 
PDF
Python Programming Tutorial | Edureka
Edureka!
 
PDF
Top 5 PMP Certifications | Edureka
Edureka!
 
PDF
Top Maven Interview Questions in 2020 | Edureka
Edureka!
 
PDF
Linux Mint Tutorial | Edureka
Edureka!
 
PDF
How to Deploy Java Web App in AWS| Edureka
Edureka!
 
PDF
Importance of Digital Marketing | Edureka
Edureka!
 
PDF
RPA in 2020 | Edureka
Edureka!
 
PDF
Email Notifications in Jenkins | Edureka
Edureka!
 
PDF
EA Algorithm in Machine Learning | Edureka
Edureka!
 
PDF
Cognitive AI Tutorial | Edureka
Edureka!
 
PDF
AWS Cloud Practitioner Tutorial | Edureka
Edureka!
 
PDF
Blue Prism Top Interview Questions | Edureka
Edureka!
 
PDF
Big Data on AWS Tutorial | Edureka
Edureka!
 
PDF
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
Edureka!
 
PDF
Kubernetes Installation on Ubuntu | Edureka
Edureka!
 
PDF
Introduction to DevOps | Edureka
Edureka!
 
What to learn during the 21 days Lockdown | Edureka
Edureka!
 
Top 10 Dying Programming Languages in 2020 | Edureka
Edureka!
 
Top 5 Trending Business Intelligence Tools | Edureka
Edureka!
 
Tableau Tutorial for Data Science | Edureka
Edureka!
 
Python Programming Tutorial | Edureka
Edureka!
 
Top 5 PMP Certifications | Edureka
Edureka!
 
Top Maven Interview Questions in 2020 | Edureka
Edureka!
 
Linux Mint Tutorial | Edureka
Edureka!
 
How to Deploy Java Web App in AWS| Edureka
Edureka!
 
Importance of Digital Marketing | Edureka
Edureka!
 
RPA in 2020 | Edureka
Edureka!
 
Email Notifications in Jenkins | Edureka
Edureka!
 
EA Algorithm in Machine Learning | Edureka
Edureka!
 
Cognitive AI Tutorial | Edureka
Edureka!
 
AWS Cloud Practitioner Tutorial | Edureka
Edureka!
 
Blue Prism Top Interview Questions | Edureka
Edureka!
 
Big Data on AWS Tutorial | Edureka
Edureka!
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
Edureka!
 
Kubernetes Installation on Ubuntu | Edureka
Edureka!
 
Introduction to DevOps | Edureka
Edureka!
 
Ad

Recently uploaded (20)

PDF
Hello I'm "AI" Your New _________________
Dr. Tathagat Varma
 
DOCX
Daily Lesson Log MATATAG ICT TEchnology 8
LOIDAALMAZAN3
 
PPTX
MARTSIA: A Tool for Confidential Data Exchange via Public Blockchain - Poster...
Michele Kryston
 
PDF
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
ScyllaDB
 
PDF
“Scaling i.MX Applications Processors’ Native Edge AI with Discrete AI Accele...
Edge AI and Vision Alliance
 
PDF
Redefining Work in the Age of AI - What to expect? How to prepare? Why it mat...
Malinda Kapuruge
 
PDF
Open Source Milvus Vector Database v 2.6
Zilliz
 
PDF
Cracking the Code - Unveiling Synergies Between Open Source Security and AI.pdf
Priyanka Aash
 
PPTX
UserCon Belgium: Honey, VMware increased my bill
stijn40
 
PDF
Python Conference Singapore - 19 Jun 2025
ninefyi
 
PPTX
MARTSIA: A Tool for Confidential Data Exchange via Public Blockchain - Pitch ...
Michele Kryston
 
PDF
Optimizing the trajectory of a wheel loader working in short loading cycles
Reno Filla
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
My Journey from CAD to BIM: A True Underdog Story
Safe Software
 
PDF
Hyderabad MuleSoft In-Person Meetup (June 21, 2025) Slides
Ravi Tamada
 
PDF
Automating the Geo-Referencing of Historic Aerial Photography in Flanders
Safe Software
 
PDF
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
ScyllaDB
 
PPTX
Paycifi - Programmable Trust_Breakfast_PPTXT
FinTech Belgium
 
PDF
FME as an Orchestration Tool with Principles From Data Gravity
Safe Software
 
PPTX
Practical Applications of AI in Local Government
OnBoard
 
Hello I'm "AI" Your New _________________
Dr. Tathagat Varma
 
Daily Lesson Log MATATAG ICT TEchnology 8
LOIDAALMAZAN3
 
MARTSIA: A Tool for Confidential Data Exchange via Public Blockchain - Poster...
Michele Kryston
 
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
ScyllaDB
 
“Scaling i.MX Applications Processors’ Native Edge AI with Discrete AI Accele...
Edge AI and Vision Alliance
 
Redefining Work in the Age of AI - What to expect? How to prepare? Why it mat...
Malinda Kapuruge
 
Open Source Milvus Vector Database v 2.6
Zilliz
 
Cracking the Code - Unveiling Synergies Between Open Source Security and AI.pdf
Priyanka Aash
 
UserCon Belgium: Honey, VMware increased my bill
stijn40
 
Python Conference Singapore - 19 Jun 2025
ninefyi
 
MARTSIA: A Tool for Confidential Data Exchange via Public Blockchain - Pitch ...
Michele Kryston
 
Optimizing the trajectory of a wheel loader working in short loading cycles
Reno Filla
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
My Journey from CAD to BIM: A True Underdog Story
Safe Software
 
Hyderabad MuleSoft In-Person Meetup (June 21, 2025) Slides
Ravi Tamada
 
Automating the Geo-Referencing of Historic Aerial Photography in Flanders
Safe Software
 
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
ScyllaDB
 
Paycifi - Programmable Trust_Breakfast_PPTXT
FinTech Belgium
 
FME as an Orchestration Tool with Principles From Data Gravity
Safe Software
 
Practical Applications of AI in Local Government
OnBoard
 

Data Analytics For Beginners | Introduction To Data Analytics | Data Analytics Using R | Edureka

  • 2. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification Topics For Today’s Session Introduction To Data Analytics Data Cleaning and Manipulation Statistics Data Visualization Machine Learning Roles, Responsibilities & Salary Hands-On
  • 3. Copyright © 2018, edureka and/or its affiliates. All rights reserved. Introduction To Data Analytics
  • 4. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification Why Data Analytics? Gather Hidden Insights01 Generate Reports 02 Perform Market Analysis03 Improve Business Requirement 04
  • 5. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification What is Data Analytics? Data Analytics refers to the techniques to analyse data to enhanced productivity and business gain. Business Administration Exploratory Data Analysis Growth in Business
  • 6. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification Who is a Data Analyst? Collect Data Analyse Data Create Reports
  • 7. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification Data Analyst Skills Statistics Data Cleaning EDA Data Visualization Machine Learning
  • 8. Copyright © 2018, edureka and/or its affiliates. All rights reserved. Statistics
  • 9. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification Statistics Statistics is a branch of mathematics dealing with data collection and organization, analysis, interpretation and presentation. Analyse Data Build a Model Infer Result
  • 11. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification Categories in Statistics – Descriptive Statistics Descriptive Descriptive statistics uses the data to provide descriptions of the population, either through numerical calculations or graphs or tables. Characteristics of Data Descriptive Statistics
  • 12. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification Categories in Statistics – Descriptive Statistics Descriptive There are mainly two measures you need to understand in Descriptive Statistics. Measures of Centre01 Measures of Spread 02
  • 13. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification Descriptive Statistics – Measures of Centre Descriptive There are 3 terms, you have to understand in Measures of Centre. Mean Measure of average of all the values in a sample is called Mean. 110 + 110 + 93 + 96 + 90 + 110 + 110 + 110 8 = 103.625
  • 14. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification Descriptive Statistics – Measures of Centre Descriptive There are 3 terms, you have to understand in Measures of Centre. Measure of average of all the values in a sample is called Mean. 110 + 110 + 93 + 96 + 90 + 110 + 110 + 110 8 = 103.625 Measure of the central value of the sample set is called Median. 21,21,21.3,22.8,23,23,23,23 22.8+23 2 = 22.9 Measure of the central value of the sample set is called Median. 21,21,21.3,22.8,23,23,23,23 22.8+23 2 = 22.9 Median
  • 15. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification Descriptive Statistics – Measures of Centre Descriptive There are 3 terms, you have to understand in Measures of Centre. Measure of average of all the values in a sample is called Mean. 110 + 110 + 93 + 96 + 90 + 110 + 110 + 110 8 = 103.625 Measure of the central value of the sample set is called Median. 21,21,21.3,22.8,23,23,23,23 22.8+23 2 = 22.9 Measure of the central value of the sample set is called Median. 21,21,21.3,22.8,23,23,23,23 22.8+23 2 = 22.9 Mode The value most recurrent in the sample set is known as Mode. 21,21,22,23,24,25,25,25,26 Mode - 25
  • 16. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification Descriptive Statistics – Measures of Spread Descriptive Range Range is the given measure of how spread apart are the values in a dataset. Range = Max(𝑥𝑖) - Min(𝑥𝑖) Inter Quartile Range Inter Quartile Range(IQR) is the measure of variability, based on dividing a dataset into quartiles. 1 2 3 4 5 6 7 8 Q1 Q2 Q3 Variance Variance describes how much a random variable differs from its expected value. It entails computing squares of deviations. Standard Deviation Standard Deviation is the measure of the dispersion of a set of data from its mean. ෍ 𝑖=1 𝑁 =(𝑥𝑖−𝜇)² 1 𝑁
  • 17. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification Categories in Statistics – Inferential Statistics Descriptive Inferential Inferential Statistics generalizes a large dataset and applies probability to draw a conclusion. It allows us to infer data parameters based on a statistical model using a sample data. Statistical Model Inferential Statistics Start Process Step Decision Answer
  • 18. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification Inferential Statistics – Hypothesis Testing Descriptive Inferential Statisticians use hypothesis testing to formally check whether the hypothesis is accepted or rejected. State the Hypotheses – This stage involves stating the null and alternative hypotheses. Formulate an Analysis Plan – This stage involves the construction of an analysis plan. Analyse Sample Data – This stage involves the calculation and interpretation of the test statistic as described in the analysis plan. Interpret Results – This stage involves the application of the decision rule described in the analysis plan. Hypothesis testing is conducted in the following manner:
  • 19. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification Descriptive vs Inferential Statistics Descriptive Statistics Inferential Statistics Concerned with Properties of Population Makes inferences from the sample Presents data in a meaningful manner Compares and predicts the future outcomes Outcomes are shown in form of charts, tables and graphs Outcomes are in the form of probability scores Describes the known data Tries to make conclusions beyond the data available Measures of central tendency and spread of data Hypothesis Testing and Analysis of variance.
  • 20. Copyright © 2018, edureka and/or its affiliates. All rights reserved. Data Cleaning and Manipulation
  • 21. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification Data Cleaning and Manipulation Data Cleaning The process of detecting and correcting corrupt or inaccurate records from a database is said to be Data Cleaning. Data Manipulation The process of changing data to make it more organized and easy to read is known as Data Manipulation.
  • 22. Copyright © 2018, edureka and/or its affiliates. All rights reserved. Data Visualization
  • 23. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification Data Visualization Data Visualization is the representation of data inform of charts, diagram etc. Bar Graph Scatter Plot Pie Chart Box Plot Line Graph
  • 24. Copyright © 2018, edureka and/or its affiliates. All rights reserved. Bonus: Machine Learning
  • 25. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification Machine Learning Machine Learning is a concept which allows the machine to learn from examples and experience, and that too without being explicitly programmed.
  • 26. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification Data Analyst: Roles and Responsibilities Determining Organizational Goals Mine Data Data Cleaning Analyzing Data Pinpointing Trends and Patterns Creating Reports with Visualizations
  • 27. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification Salary of Data Analyst Average Salary (US) Average Salary (IND) $83,878 ₹404,660
  • 28. Copyright © 2018, edureka and/or its affiliates. All rights reserved. Need of R
  • 29. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification Need of R R is open-source and freely available. R is cross-platform compatible. R is a powerful scripting language. R is highly flexible and evolved.
  • 30. Copyright © 2018, edureka and/or its affiliates. All rights reserved. Hands-On
  • 31. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification Hands-On To perform data analysis on the below data set and gather some insights.
  • 32. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification Data Analytics @edureka Program Starts 2nd Week 7th Week 15th Week 11th Week 01 02 03 04 Statistics Essentials Probability Bayesian Interference Regression Making Statistics Data Analytics with R Data Manipulation Exploratory Analysis Regression Data Visualization Data Mining Sentiment Analysis SAS Training Advanced Statistical Techniques SAS Macros PROC SQL SAS ODS Advanced SAS Procedures Tableau Training LOD Expressions Tableau Desktop Tableau Public Data Visualization Integration with R Graduated as Data Analyst Self-Paced Instructor - Led
  • 33. Data Analytics Master Program www.edureka.co/masters-program/data-analyst-certification Data Analytics @edureka QlikView Certification Training Advanced MS Excel 2010 R Programming Certification Training Analytics for Retail Banks Decision Tree Modelling Using R Certification Training Machine Learning with Mahout Certification Training Advanced Predictive Modelling in R Certification Training