SlideShare a Scribd company logo
7
Most read
8
Most read
11
Most read
Web Usage Pattern
SHAH RUSHABH R CE-111
SHREYANSH R KEJRIWAL CE-113
Outline
 Brief overview of Web mining
 Web usage mining
 Application areas of Web usage
mining
 Future research directions
Web Mining
 Web Mining is the application of
data mining techniques to discover
and retrieve useful information and
patterns from the World Wide Web
documents and services.
Web Mining Categories
 Web Content Mining- extracting
knowledge from the content of the
Web
 Web Structure Mining- discovering
the model underlying the link
structures of the Web
 Web Usage Mining- discovering
user’s navigation pattern and
predicting user’s behavior
Web Usage Mining Processes
 Preprocessing: conversion of the raw data
into the data abstraction (users, sessions,
episodes, clickstreams, and pageviews)
necessary for further applying the data
mining algorithm.
 Pattern Discovery: is the key component of
WUM, which converges the algorithms and
techniques from data mining, machine
learning, statistics and pattern recognition
etc. research categories.
 Pattern Analysis: Validation and
interpretation of the mined patterns
Web Usage Mining Processes
(Cont.)
Web Usage Mining- Preprocessing
 Data Cleaning: remove outliers and/or irrelative data
 User Identification: associate page references with
different users
 Session Identification: divide all pages accessed by a
user into sessions
 Path Completion: add important page access records
that are missing in the access log due to browser and
proxy server caching
 Formatting: format the sessions according to the type
of data mining to be accomplished.
Web Usage Mining -
Pattern Discovery Tasks
 Statistical Analysis: frequency analysis, mean,
median, etc.
◦ Improve system performance
◦ Provide support for marketing decisions
◦ Simplify site modification task
 Clustering:
◦ Clustering of users help to discover groups of users
with similar navigation patterns => provide
personalized Web content
 ◦ Clustering of pages help to discover groups of pages
having related content => search engine
Web Usage Mining -
Pattern Discovery Tasks (Cont.)
 Classification: the technique to map a data
item into one of several predefined classes
◦ Develop profile of users belonging to a
particular class or category
 Association Rules: discover correlations
among pages accessed together by a client
◦ Help the restructure of Web site
◦ Page prefetching
◦ Develop e-commerce marketing strategies
Web Usage Mining -
Pattern Discovery Tasks (Cont.)
 Sequential Patterns: extract frequently occurring
intersession patterns such that the presence of a set
of items followed by another item in time order
◦ Predict future user visit patterns=>placing ads or
recommendations
◦ Page prefeteching
 Dependency Modeling: determine if there are any
significant dependencies among the variables in the
Web domain
◦ Predict future Web resource consumption
◦ Develop business strategies to increase sales
◦ Improve navigational convenience of users
Web Usage Mining -
Pattern Analysis
 Pattern Analysis is the final stage of WUM,
which involves the validation and
interpretation of the mined pattern
 Validation: to eliminate the irrelative rules
or patterns and to extract the interesting
rules or patterns from the output of the
pattern discovery process
 Interpretation: the output of mining
algorithms is mainly in mathematic form
and not suitable for direct human
interpretations
Web Usage Mining -
Pattern Analysis Methodologies and Tools
 Visualization: help people to understand both real and
abstract concepts
◦ WebViz: Web is visualized as a direct graph
 Query mechanism: allow analysts to extract only
relevant and useful patterns by specifying constraints.
◦ WEBMINER
 On-Line Analytical Processing (OLAP): enable analysts
to perform ad hoc analysis of data in multiple
dimensions for decision-making
◦ WebLogMiner
Application Areas for
Web Usage Mining
 Personalized: discover the preference and
needs ofindividual Web users in order to
provide personalized Web site for certain
types of users
 Impersonalized: examine general user
navigation patterns in order to understand
how general users use the site
◦ System Improvement
◦ Site Modification
◦ Business Intelligence
◦ Web Characterization
Future Research Directions
 Usage Mining on Semantic Web
◦ Help to build semantic Web
◦ With semantic Web, WUM can be
improved
 Multimedia Web Data Mining
◦ Representation, problem solving and
learning from Multimedia data is
indeed a challenge
Future Research Directions
(Cont.)
 Analysis of Discovered Patterns
◦ Research on efficient, flexible and
powerful analysis tools
 More Applications
◦ Temporal evolutions of usage behavior
◦ Improving Web services
◦ Detect credit card fraud
◦ Privacy issues
Conclusion
 Web usage and data mining to find patterns is a
growing area with the growth of Web-based
applications
 Application of web usage data can be used to
better understand web usage, and apply this
specific knowledge to better serve users
 Web usage patterns and data mining can be the basis
for a great deal of future research
THANK YOU

More Related Content

PPTX
introduction to NOSQL Database
PPTX
Deductive databases
PPT
1.2 steps and functionalities
PPT
PPTX
Web mining
PPT
Data mining-primitives-languages-and-system-architectures2641
ODP
Web content mining
PPTX
Distributed Query Processing
introduction to NOSQL Database
Deductive databases
1.2 steps and functionalities
Web mining
Data mining-primitives-languages-and-system-architectures2641
Web content mining
Distributed Query Processing

What's hot (20)

PDF
Lecture6 introduction to data streams
PPTX
Classification in data mining
PPTX
DATA WRANGLING presentation.pptx
PPT
Class diagrams
PPTX
Data mining primitives
PDF
Web scraping in python
PPTX
Dynamic storage allocation techniques in Compiler design
PDF
Introduction to Machine Learning with SciKit-Learn
PPTX
Data preprocessing in Machine learning
PPTX
Query processing in Distributed Database System
PPTX
DISTRIBUTED DATABASE WITH RECOVERY TECHNIQUES
PPT
5.3 mining sequential patterns
PDF
Data preprocessing using Machine Learning
PPT
01 Data Mining: Concepts and Techniques, 2nd ed.
PPTX
Dbscan algorithom
PPTX
Query processing and optimization (updated)
PPTX
Data streaming fundamentals
PPTX
Data mining
PPT
Data preprocessing
PPTX
SQL, Embedded SQL, Dynamic SQL and SQLJ
Lecture6 introduction to data streams
Classification in data mining
DATA WRANGLING presentation.pptx
Class diagrams
Data mining primitives
Web scraping in python
Dynamic storage allocation techniques in Compiler design
Introduction to Machine Learning with SciKit-Learn
Data preprocessing in Machine learning
Query processing in Distributed Database System
DISTRIBUTED DATABASE WITH RECOVERY TECHNIQUES
5.3 mining sequential patterns
Data preprocessing using Machine Learning
01 Data Mining: Concepts and Techniques, 2nd ed.
Dbscan algorithom
Query processing and optimization (updated)
Data streaming fundamentals
Data mining
Data preprocessing
SQL, Embedded SQL, Dynamic SQL and SQLJ
Ad

Viewers also liked (20)

PPTX
Web content mining
PPTX
Web mining (structure mining)
PDF
Preprocessing of Web Log Data for Web Usage Mining
PDF
Web mining slides
ODP
Personal Web Usage Mining
PPTX
WEB MINING.
PPTX
Web mining
PPTX
Web Mining Presentation Final
PPT
Web Mining
PPT
A survey on web usage mining techniques
PPTX
Web mining
PPTX
Web mining tools
PDF
WWW2013: Web Usage Mining with Semantic Analysis
ODP
Introduction To OpenStreetMap - CrisisCamp Toronto
PPTX
Learning to Classify Users in Online Interaction Networks
PDF
Web Usage Mining: A Survey on User's Navigation Pattern from Web Logs
PPTX
ODP
Web mining
PDF
PDF
Birch
Web content mining
Web mining (structure mining)
Preprocessing of Web Log Data for Web Usage Mining
Web mining slides
Personal Web Usage Mining
WEB MINING.
Web mining
Web Mining Presentation Final
Web Mining
A survey on web usage mining techniques
Web mining
Web mining tools
WWW2013: Web Usage Mining with Semantic Analysis
Introduction To OpenStreetMap - CrisisCamp Toronto
Learning to Classify Users in Online Interaction Networks
Web Usage Mining: A Survey on User's Navigation Pattern from Web Logs
Web mining
Birch
Ad

Similar to Web Usage Pattern (20)

PPTX
Web mining and its types
PPT
Applying web mining application for user behavior understanding
PPT
applyingwebminingapplicationforuserbehaviorunderstanding-131215105223-phpapp0...
PDF
Classification of User & Pattern discovery in WUM: A Survey
PPTX
Webmining ppt
PDF
A Review on Pattern Discovery Techniques of Web Usage Mining
PDF
A Survey of Issues and Techniques of Web Usage Mining
PDF
A Novel Framework on Web Usage Mining
PPTX
Web usage mining
PDF
IRJET-A Survey on Web Personalization of Web Usage Mining
PPTX
Web mining
PDF
Pxc3893553
PPT
Minning WWW
PPTX
PDF
Web mining and social media mining
PPTX
PDF
PDF
Cl32543545
PDF
Cl32543545
DOCX
Minning www
Web mining and its types
Applying web mining application for user behavior understanding
applyingwebminingapplicationforuserbehaviorunderstanding-131215105223-phpapp0...
Classification of User & Pattern discovery in WUM: A Survey
Webmining ppt
A Review on Pattern Discovery Techniques of Web Usage Mining
A Survey of Issues and Techniques of Web Usage Mining
A Novel Framework on Web Usage Mining
Web usage mining
IRJET-A Survey on Web Personalization of Web Usage Mining
Web mining
Pxc3893553
Minning WWW
Web mining and social media mining
Cl32543545
Cl32543545
Minning www

More from Shreyansh Kejriwal (13)

PPTX
Apollo-Copper Merger and Acquisition
PPTX
Personalities Quiz
PDF
Marketing mackathon solution
PPTX
Crescendo !!!
PDF
Crescendo round1
PPTX
PPTX
A Health Bar
PPTX
Importance of a Stress Free Envioronment
PPT
Why satisfied customers defect ??
PPT
Operations and Services Costing
PPTX
Balance Sheet and Ratio Analysis of a Listed Company
DOCX
Comparative Analysis of Starbucks Vs Costa Coffee
Apollo-Copper Merger and Acquisition
Personalities Quiz
Marketing mackathon solution
Crescendo !!!
Crescendo round1
A Health Bar
Importance of a Stress Free Envioronment
Why satisfied customers defect ??
Operations and Services Costing
Balance Sheet and Ratio Analysis of a Listed Company
Comparative Analysis of Starbucks Vs Costa Coffee

Recently uploaded (20)

PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PDF
Foundation of Data Science unit number two notes
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
Global journeys: estimating international migration
PPTX
Moving the Public Sector (Government) to a Digital Adoption
PPTX
Understanding Prototyping in Design and Development
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PDF
Master Databricks SQL with AccentFuture – The Future of Data Warehousing
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPT
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
PPT
Quality review (1)_presentation of this 21
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
climate analysis of Dhaka ,Banglades.pptx
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PDF
Oracle OFSAA_ The Complete Guide to Transforming Financial Risk Management an...
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Foundation of Data Science unit number two notes
oil_refinery_comprehensive_20250804084928 (1).pptx
Global journeys: estimating international migration
Moving the Public Sector (Government) to a Digital Adoption
Understanding Prototyping in Design and Development
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Master Databricks SQL with AccentFuture – The Future of Data Warehousing
IB Computer Science - Internal Assessment.pptx
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
Quality review (1)_presentation of this 21
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
climate analysis of Dhaka ,Banglades.pptx
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Oracle OFSAA_ The Complete Guide to Transforming Financial Risk Management an...

Web Usage Pattern

  • 1. Web Usage Pattern SHAH RUSHABH R CE-111 SHREYANSH R KEJRIWAL CE-113
  • 2. Outline  Brief overview of Web mining  Web usage mining  Application areas of Web usage mining  Future research directions
  • 3. Web Mining  Web Mining is the application of data mining techniques to discover and retrieve useful information and patterns from the World Wide Web documents and services.
  • 4. Web Mining Categories  Web Content Mining- extracting knowledge from the content of the Web  Web Structure Mining- discovering the model underlying the link structures of the Web  Web Usage Mining- discovering user’s navigation pattern and predicting user’s behavior
  • 5. Web Usage Mining Processes  Preprocessing: conversion of the raw data into the data abstraction (users, sessions, episodes, clickstreams, and pageviews) necessary for further applying the data mining algorithm.  Pattern Discovery: is the key component of WUM, which converges the algorithms and techniques from data mining, machine learning, statistics and pattern recognition etc. research categories.  Pattern Analysis: Validation and interpretation of the mined patterns
  • 6. Web Usage Mining Processes (Cont.)
  • 7. Web Usage Mining- Preprocessing  Data Cleaning: remove outliers and/or irrelative data  User Identification: associate page references with different users  Session Identification: divide all pages accessed by a user into sessions  Path Completion: add important page access records that are missing in the access log due to browser and proxy server caching  Formatting: format the sessions according to the type of data mining to be accomplished.
  • 8. Web Usage Mining - Pattern Discovery Tasks  Statistical Analysis: frequency analysis, mean, median, etc. ◦ Improve system performance ◦ Provide support for marketing decisions ◦ Simplify site modification task  Clustering: ◦ Clustering of users help to discover groups of users with similar navigation patterns => provide personalized Web content  ◦ Clustering of pages help to discover groups of pages having related content => search engine
  • 9. Web Usage Mining - Pattern Discovery Tasks (Cont.)  Classification: the technique to map a data item into one of several predefined classes ◦ Develop profile of users belonging to a particular class or category  Association Rules: discover correlations among pages accessed together by a client ◦ Help the restructure of Web site ◦ Page prefetching ◦ Develop e-commerce marketing strategies
  • 10. Web Usage Mining - Pattern Discovery Tasks (Cont.)  Sequential Patterns: extract frequently occurring intersession patterns such that the presence of a set of items followed by another item in time order ◦ Predict future user visit patterns=>placing ads or recommendations ◦ Page prefeteching  Dependency Modeling: determine if there are any significant dependencies among the variables in the Web domain ◦ Predict future Web resource consumption ◦ Develop business strategies to increase sales ◦ Improve navigational convenience of users
  • 11. Web Usage Mining - Pattern Analysis  Pattern Analysis is the final stage of WUM, which involves the validation and interpretation of the mined pattern  Validation: to eliminate the irrelative rules or patterns and to extract the interesting rules or patterns from the output of the pattern discovery process  Interpretation: the output of mining algorithms is mainly in mathematic form and not suitable for direct human interpretations
  • 12. Web Usage Mining - Pattern Analysis Methodologies and Tools  Visualization: help people to understand both real and abstract concepts ◦ WebViz: Web is visualized as a direct graph  Query mechanism: allow analysts to extract only relevant and useful patterns by specifying constraints. ◦ WEBMINER  On-Line Analytical Processing (OLAP): enable analysts to perform ad hoc analysis of data in multiple dimensions for decision-making ◦ WebLogMiner
  • 13. Application Areas for Web Usage Mining  Personalized: discover the preference and needs ofindividual Web users in order to provide personalized Web site for certain types of users  Impersonalized: examine general user navigation patterns in order to understand how general users use the site ◦ System Improvement ◦ Site Modification ◦ Business Intelligence ◦ Web Characterization
  • 14. Future Research Directions  Usage Mining on Semantic Web ◦ Help to build semantic Web ◦ With semantic Web, WUM can be improved  Multimedia Web Data Mining ◦ Representation, problem solving and learning from Multimedia data is indeed a challenge
  • 15. Future Research Directions (Cont.)  Analysis of Discovered Patterns ◦ Research on efficient, flexible and powerful analysis tools  More Applications ◦ Temporal evolutions of usage behavior ◦ Improving Web services ◦ Detect credit card fraud ◦ Privacy issues
  • 16. Conclusion  Web usage and data mining to find patterns is a growing area with the growth of Web-based applications  Application of web usage data can be used to better understand web usage, and apply this specific knowledge to better serve users  Web usage patterns and data mining can be the basis for a great deal of future research