SlideShare a Scribd company logo
Introduction to Elasticsearch for
Business Intelligence and
Application Insights
Presented by Sean Donnelly
at Data Works MD – May 29th 2019
@DonSeannelly
http://donseannelly.com
https://github.com/donseannelly
Sean Donnelly
Software Architect at Cyber4All
https://clark.centerCLARK
Cyber4All https://towson.edu/cyber4all
Outline
• What we will cover
• Introduction to Elasticsearch
• Fundamentals of Storage and Retrieval in Elasticsearch
• Why we are using it at CLARK
• The Elastic Stack
• Use Beyond Search
• What we will not cover
• Clusters, Nodes, Scaling, Sharding and other nitty-gritty details for getting to
know Elasticsearch
• Details on standing up the Elastic Stack
What is Elasticsearch?
RESTful search and analytics engine
What does it do?
• Uses Lucene indices under the hood
• Provides built-in tokenizers and analyzers
• Provides a score for the relevancy of each result to the query
• Provides binary matching via filters
• Supports pagination out of the box
PUT meetups/_doc/1
{
”name" : ”Data Works MD",
"date" : "2019-05-29T18:30:00"
}
Indexing a Document
Getting a Document
GET meetups/_doc/1
{
"_index" : ”meetups",
"_type" : "_doc",
"_id" : ”1",
"_version" : 1,
"_seq_no" : 10,
"_primary_term" : 1,
"found": true,
"_source" : {
”name" : ”Data Works MD",
"date" : "2019-05-29T18:30:00”
}
}
Searching for Documents
GET meetups/_search?q=name:Data Works MD
…
"hits": {
"total" : { "value": 1, "relation": "eq" },
"max_score": 1.3862944,
"hits" : [ {
"_index" : ”meetups",
"_type" : "_doc",
"_id" : ”1",
"_score" : 1.3862944,
"_source" : {
”name" : ”Data Works MD",
"date" : "2019-05-29T18:30:00”
}
} ]
}
Elasticsearch at CLARK
• Goals for search:
• Fast
• Full text
• Word stemming, stop words, fuzziness
• We already had documents formatted in
MongoDB
• Our searchable data is read-only in the
application
What it looks
like
Ingest
Store, Search, Analyze
Visualize
Ingest
Store, Search, Analyze
Visualize
Custom
Self Hosted Hosting
Key Use Cases
• Application Search
• Application Performance Monitoring
• have our response times slowed significantly?
• Logging
• explore the logs that we rarely (never) read
• Machine Learning
• Alert me if there is an unusual increase in abandoned shopping carts
• Alert me if there is an unusual change in HTTP status codes for a resource
Thanks!

More Related Content

What's hot (20)

PPTX
Google history nd architecture
Divyangee Jain
 
PPTX
Cited-by Linking
Crossref
 
PPTX
Data, data, everywhere? Not nearly enough!
UKSG: connecting the knowledge community
 
PDF
Collecting and Using Funding Data Crossref
Relawan Jurnal Indonesia
 
PDF
Barcelona 2014 CrossRef: What's Coming by Ed Pentz
Crossref
 
PPTX
Boosting your SEO with data markup
Manoj K G
 
PPTX
Reference linking and Cited-by
Crossref
 
PPTX
Collecting and using funding data in your publications
Crossref
 
PPTX
Data Modeling for Security, Privacy and Data Protection
Karen Lopez
 
PPTX
The Global reach of Crossref metadata
Crossref
 
PPTX
Introduction to SharePoint 2013 REST API
QUONTRASOLUTIONS
 
PPTX
How search engine work ppt
Shubham Chinchkar
 
PDF
A secure and dynamic multi
Shakas Technologies
 
PDF
MENGGUNAKAN METADATA PADA CROSSREF
Relawan Jurnal Indonesia
 
PDF
Empowering Analysts with Graph Visualization
Neo4j
 
PPTX
Multiple Resolution and handling content available in multiple places
Crossref
 
PPTX
Introduction to Linked Data 1/5
Juan Sequeda
 
PPT
SLA RSS Presentation
August Jackson
 
PDF
20170621_System requirements of data journal platform
Yasuyuki Minamiyama
 
PPTX
Neo4j GraphDay Munich - Life & Health Sciences Intro to Graphs
Neo4j
 
Google history nd architecture
Divyangee Jain
 
Cited-by Linking
Crossref
 
Data, data, everywhere? Not nearly enough!
UKSG: connecting the knowledge community
 
Collecting and Using Funding Data Crossref
Relawan Jurnal Indonesia
 
Barcelona 2014 CrossRef: What's Coming by Ed Pentz
Crossref
 
Boosting your SEO with data markup
Manoj K G
 
Reference linking and Cited-by
Crossref
 
Collecting and using funding data in your publications
Crossref
 
Data Modeling for Security, Privacy and Data Protection
Karen Lopez
 
The Global reach of Crossref metadata
Crossref
 
Introduction to SharePoint 2013 REST API
QUONTRASOLUTIONS
 
How search engine work ppt
Shubham Chinchkar
 
A secure and dynamic multi
Shakas Technologies
 
MENGGUNAKAN METADATA PADA CROSSREF
Relawan Jurnal Indonesia
 
Empowering Analysts with Graph Visualization
Neo4j
 
Multiple Resolution and handling content available in multiple places
Crossref
 
Introduction to Linked Data 1/5
Juan Sequeda
 
SLA RSS Presentation
August Jackson
 
20170621_System requirements of data journal platform
Yasuyuki Minamiyama
 
Neo4j GraphDay Munich - Life & Health Sciences Intro to Graphs
Neo4j
 

Similar to Introduction to Elasticsearch for Business Intelligence and Application Insights (20)

PDF
Enhancement of Searching and Analyzing the Document using Elastic Search
IRJET Journal
 
PDF
Elasticsearch Introduction at BigData meetup
Eric Rodriguez (Hiring in Lex)
 
PDF
Elasticsearch speed is key
Enterprise Search Warsaw Meetup
 
PDF
Explore Elasticsearch and Why It’s Worth Using
Inexture Solutions
 
PPTX
Elastic Search Capability Presentation.pptx
Knoldus Inc.
 
PPSX
Elasticsearch - basics and beyond
Ernesto Reig
 
PDF
Core_ElasticSearch_Outline
Sebastian Xu
 
PDF
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
ALTER WAY
 
PDF
Elasticsearch : petit déjeuner du 13 mars 2014
ALTER WAY
 
PDF
Crafting Solutions with the Elastic Stack: pragmatic takes and lessons learned
Guido Lena Cota
 
PDF
Making your elastic cluster perform - Jettro Coenradie - Codemotion Amsterdam...
Codemotion
 
PDF
ElasticSearch - index server used as a document database
Robert Lujo
 
PPTX
Elasticsearch workshop presentation
Laura Steggles
 
PDF
Introduction to Elasticsearch
Ruslan Zavacky
 
PPTX
Elasticsearch - DevNexus 2015
Roy Russo
 
PPTX
ELK Stack Certification Training Hyderabad - Elasticsearch Training in Hydera...
eshwarvisualpath
 
PPTX
Transforming your application with Elasticsearch
Brian Ritchie
 
PDF
Elasticsearch for Data Analytics
Felipe
 
PDF
Elasticsearch Tutorial | Getting Started with Elasticsearch | ELK Stack Train...
Edureka!
 
PPTX
ACM BPM and elasticsearch AMIS25
Getting value from IoT, Integration and Data Analytics
 
Enhancement of Searching and Analyzing the Document using Elastic Search
IRJET Journal
 
Elasticsearch Introduction at BigData meetup
Eric Rodriguez (Hiring in Lex)
 
Elasticsearch speed is key
Enterprise Search Warsaw Meetup
 
Explore Elasticsearch and Why It’s Worth Using
Inexture Solutions
 
Elastic Search Capability Presentation.pptx
Knoldus Inc.
 
Elasticsearch - basics and beyond
Ernesto Reig
 
Core_ElasticSearch_Outline
Sebastian Xu
 
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
ALTER WAY
 
Elasticsearch : petit déjeuner du 13 mars 2014
ALTER WAY
 
Crafting Solutions with the Elastic Stack: pragmatic takes and lessons learned
Guido Lena Cota
 
Making your elastic cluster perform - Jettro Coenradie - Codemotion Amsterdam...
Codemotion
 
ElasticSearch - index server used as a document database
Robert Lujo
 
Elasticsearch workshop presentation
Laura Steggles
 
Introduction to Elasticsearch
Ruslan Zavacky
 
Elasticsearch - DevNexus 2015
Roy Russo
 
ELK Stack Certification Training Hyderabad - Elasticsearch Training in Hydera...
eshwarvisualpath
 
Transforming your application with Elasticsearch
Brian Ritchie
 
Elasticsearch for Data Analytics
Felipe
 
Elasticsearch Tutorial | Getting Started with Elasticsearch | ELK Stack Train...
Edureka!
 
Ad

More from Data Works MD (18)

PPTX
Data Journalism at The Baltimore Banner
Data Works MD
 
PDF
Jolt’s Picks - Machine Learning and Major League Baseball Hit Streaks
Data Works MD
 
PPTX
Introducing DataWave
Data Works MD
 
PDF
Malware Detection, Enabled by Machine Learning
Data Works MD
 
PPTX
Using AWS, Terraform, and Ansible to Automate Splunk at Scale
Data Works MD
 
PDF
A Day in the Life of a Data Journalist
Data Works MD
 
PPTX
Robotics and Machine Learning: Working with NVIDIA Jetson Kits
Data Works MD
 
PDF
Connect Data and Devices with Apache NiFi
Data Works MD
 
PDF
Introduction to Machine Learning
Data Works MD
 
PPTX
Data in the City: Analytics and Civic Data in Baltimore
Data Works MD
 
PPTX
Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...
Data Works MD
 
PPTX
Automated Software Requirements Labeling
Data Works MD
 
PPTX
An Asynchronous Distributed Deep Learning Based Intrusion Detection System fo...
Data Works MD
 
PDF
RAPIDS – Open GPU-accelerated Data Science
Data Works MD
 
PPTX
Two Algorithms for Weakly Supervised Denoising of EEG Data
Data Works MD
 
PDF
Detecting Lateral Movement with a Compute-Intense Graph Kernel
Data Works MD
 
PDF
Predictive Analytics and Neighborhood Health
Data Works MD
 
PPTX
Social Network Analysis Workshop
Data Works MD
 
Data Journalism at The Baltimore Banner
Data Works MD
 
Jolt’s Picks - Machine Learning and Major League Baseball Hit Streaks
Data Works MD
 
Introducing DataWave
Data Works MD
 
Malware Detection, Enabled by Machine Learning
Data Works MD
 
Using AWS, Terraform, and Ansible to Automate Splunk at Scale
Data Works MD
 
A Day in the Life of a Data Journalist
Data Works MD
 
Robotics and Machine Learning: Working with NVIDIA Jetson Kits
Data Works MD
 
Connect Data and Devices with Apache NiFi
Data Works MD
 
Introduction to Machine Learning
Data Works MD
 
Data in the City: Analytics and Civic Data in Baltimore
Data Works MD
 
Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...
Data Works MD
 
Automated Software Requirements Labeling
Data Works MD
 
An Asynchronous Distributed Deep Learning Based Intrusion Detection System fo...
Data Works MD
 
RAPIDS – Open GPU-accelerated Data Science
Data Works MD
 
Two Algorithms for Weakly Supervised Denoising of EEG Data
Data Works MD
 
Detecting Lateral Movement with a Compute-Intense Graph Kernel
Data Works MD
 
Predictive Analytics and Neighborhood Health
Data Works MD
 
Social Network Analysis Workshop
Data Works MD
 
Ad

Recently uploaded (20)

PDF
FME as an Orchestration Tool with Principles From Data Gravity
Safe Software
 
PDF
Bridging CAD, IBM TRIRIGA & GIS with FME: The Portland Public Schools Case
Safe Software
 
PDF
Next level data operations using Power Automate magic
Andries den Haan
 
PDF
Automating the Geo-Referencing of Historic Aerial Photography in Flanders
Safe Software
 
PDF
Quantum AI Discoveries: Fractal Patterns Consciousness and Cyclical Universes
Saikat Basu
 
PDF
Redefining Work in the Age of AI - What to expect? How to prepare? Why it mat...
Malinda Kapuruge
 
PDF
''Taming Explosive Growth: Building Resilience in a Hyper-Scaled Financial Pl...
Fwdays
 
PDF
My Journey from CAD to BIM: A True Underdog Story
Safe Software
 
PPTX
Smart Factory Monitoring IIoT in Machine and Production Operations.pptx
Rejig Digital
 
PPTX
The birth and death of Stars - earth and life science
rizellemarieastrolo
 
PDF
LLM Search Readiness Audit - Dentsu x SEO Square - June 2025.pdf
Nick Samuel
 
PDF
Enhancing Environmental Monitoring with Real-Time Data Integration: Leveragin...
Safe Software
 
PDF
Darley - FIRST Copenhagen Lightning Talk (2025-06-26) Epochalypse 2038 - Time...
treyka
 
PDF
Simplify Your FME Flow Setup: Fault-Tolerant Deployment Made Easy with Packer...
Safe Software
 
PDF
How to Visualize the ​Spatio-Temporal Data Using CesiumJS​
SANGHEE SHIN
 
PPSX
Usergroup - OutSystems Architecture.ppsx
Kurt Vandevelde
 
PPTX
2025 HackRedCon Cyber Career Paths.pptx Scott Stanton
Scott Stanton
 
PDF
DoS Attack vs DDoS Attack_ The Silent Wars of the Internet.pdf
CyberPro Magazine
 
PDF
5 Things to Consider When Deploying AI in Your Enterprise
Safe Software
 
PDF
Why aren't you using FME Flow's CPU Time?
Safe Software
 
FME as an Orchestration Tool with Principles From Data Gravity
Safe Software
 
Bridging CAD, IBM TRIRIGA & GIS with FME: The Portland Public Schools Case
Safe Software
 
Next level data operations using Power Automate magic
Andries den Haan
 
Automating the Geo-Referencing of Historic Aerial Photography in Flanders
Safe Software
 
Quantum AI Discoveries: Fractal Patterns Consciousness and Cyclical Universes
Saikat Basu
 
Redefining Work in the Age of AI - What to expect? How to prepare? Why it mat...
Malinda Kapuruge
 
''Taming Explosive Growth: Building Resilience in a Hyper-Scaled Financial Pl...
Fwdays
 
My Journey from CAD to BIM: A True Underdog Story
Safe Software
 
Smart Factory Monitoring IIoT in Machine and Production Operations.pptx
Rejig Digital
 
The birth and death of Stars - earth and life science
rizellemarieastrolo
 
LLM Search Readiness Audit - Dentsu x SEO Square - June 2025.pdf
Nick Samuel
 
Enhancing Environmental Monitoring with Real-Time Data Integration: Leveragin...
Safe Software
 
Darley - FIRST Copenhagen Lightning Talk (2025-06-26) Epochalypse 2038 - Time...
treyka
 
Simplify Your FME Flow Setup: Fault-Tolerant Deployment Made Easy with Packer...
Safe Software
 
How to Visualize the ​Spatio-Temporal Data Using CesiumJS​
SANGHEE SHIN
 
Usergroup - OutSystems Architecture.ppsx
Kurt Vandevelde
 
2025 HackRedCon Cyber Career Paths.pptx Scott Stanton
Scott Stanton
 
DoS Attack vs DDoS Attack_ The Silent Wars of the Internet.pdf
CyberPro Magazine
 
5 Things to Consider When Deploying AI in Your Enterprise
Safe Software
 
Why aren't you using FME Flow's CPU Time?
Safe Software
 

Introduction to Elasticsearch for Business Intelligence and Application Insights

  • 1. Introduction to Elasticsearch for Business Intelligence and Application Insights Presented by Sean Donnelly at Data Works MD – May 29th 2019
  • 2. @DonSeannelly http://donseannelly.com https://github.com/donseannelly Sean Donnelly Software Architect at Cyber4All https://clark.centerCLARK Cyber4All https://towson.edu/cyber4all
  • 3. Outline • What we will cover • Introduction to Elasticsearch • Fundamentals of Storage and Retrieval in Elasticsearch • Why we are using it at CLARK • The Elastic Stack • Use Beyond Search • What we will not cover • Clusters, Nodes, Scaling, Sharding and other nitty-gritty details for getting to know Elasticsearch • Details on standing up the Elastic Stack
  • 4. What is Elasticsearch? RESTful search and analytics engine
  • 5. What does it do? • Uses Lucene indices under the hood • Provides built-in tokenizers and analyzers • Provides a score for the relevancy of each result to the query • Provides binary matching via filters • Supports pagination out of the box
  • 6. PUT meetups/_doc/1 { ”name" : ”Data Works MD", "date" : "2019-05-29T18:30:00" } Indexing a Document
  • 7. Getting a Document GET meetups/_doc/1 { "_index" : ”meetups", "_type" : "_doc", "_id" : ”1", "_version" : 1, "_seq_no" : 10, "_primary_term" : 1, "found": true, "_source" : { ”name" : ”Data Works MD", "date" : "2019-05-29T18:30:00” } }
  • 8. Searching for Documents GET meetups/_search?q=name:Data Works MD … "hits": { "total" : { "value": 1, "relation": "eq" }, "max_score": 1.3862944, "hits" : [ { "_index" : ”meetups", "_type" : "_doc", "_id" : ”1", "_score" : 1.3862944, "_source" : { ”name" : ”Data Works MD", "date" : "2019-05-29T18:30:00” } } ] }
  • 9. Elasticsearch at CLARK • Goals for search: • Fast • Full text • Word stemming, stop words, fuzziness • We already had documents formatted in MongoDB • Our searchable data is read-only in the application
  • 13. Key Use Cases • Application Search • Application Performance Monitoring • have our response times slowed significantly? • Logging • explore the logs that we rarely (never) read • Machine Learning • Alert me if there is an unusual increase in abandoned shopping carts • Alert me if there is an unusual change in HTTP status codes for a resource

Editor's Notes

  • #3: I am an Elasticsearch enthusiast
  • #5: Allows you to index your data and make it searchable
  • #6: Fast – inverted indices, finite state transducers, BKD trees for numeric and geo data, column store for analytics Prototype to Production easily – interact with one node the way you would with 1000
  • #10: Any action taken on it is actually creating a new document
  • #13: This modular approach allows you to build fantastic, custom data pipelines for ingestion. Integrate your existing pipeline directly into Elasticsearch Ingest message queues into Logstash Ship filebeats to logstash or directly to elasticsearch