SlideShare a Scribd company logo
Microsoft Confidential
Understanding the latent value in all content
Text
(1) Validate enrichment pipeline
Tags
“throwing”, “ball”, “girl”, “grass”, “basketball”
Caption
“A girl throwing a ball”
Entities
Persons
“Anita Christiansen”,
“Conrad Nuber”,
Locations
“Bothell”, “Woodinville”
Organization
“Litware Insurance Corp.”
Knowledge Mining with Azure Search Technical Deck
Computer Vision
Face
Emotion
Content Moderator
Video Indexer
Custom Vision
Service
Custom Decision
Q-n-A Maker
Language
Understanding (LUIS)
Text Analytics
Bing Spell Check
Translator Text
Speaker
Recognition
Bing Speech
Custom Speech
Translator Speech
Unified Speech
Service
Bing Autosuggest
Bing Search
Bing Entity Search
Bing Statistics add-in
Bing Visual Search
Bing Custom Search
Management Free
Keyword search
Faceting
Geospatial support
Multi-Language Support
Suggestions/auto-complete
Customizable scoring models
Proximity Search
Synonyms
etc.
INGEST
Data in any
format, any
Azure store
ENRICH EXPLORE
Annotations
Cognitive skills
Search
Annotated
Documents
Customer
Data
Built-in Cognitive Skills
OCR,
Key Phrase Extraction,
People Names,
Company Names,
Sentiment Analyzer,
Computer Vision,
etc.
Search
Index
.pdf
.doc
.jpeg
…
Third Party Enrichers
Custom classification models,
Custom entity extraction,
etc.
Azure Machine
Learning
Annotated
Documents
Built-in Cognitive Skills
OCR,
Key Phrase Extraction,
People Names,
Company Names,
Sentiment Analyzer,
Computer Vision,
etc.
Search
Index
Third Party Enrichers
Custom classification models,
Custom entity extraction,
etc.
Azure Machine
Learning
Customer
Data
.pdf
.doc
.jpeg
…
Knowledge Mining with Azure Search Technical Deck
Annotated
Documents
Search
Index
Built-in Cognitive Skills
OCR,
Key Phrase Extraction,
People Names,
Company Names,
Sentiment Analyzer,
Computer Vision,
etc.
Third Party Enrichers
Custom classification models,
Custom entity extraction,
etc.
Customer
Data
.pdf
.doc
.jpeg
…
Key Phrase Extraction
Sentiment Analysis
Organization Entity Extraction
Location Entity Extraction
Persons Entity Extraction
Language Detection
Face Detection
Tag Extraction
Celebrity Recognition
Landmark Detection
Handwriting Recognition (Preview)
Printed Text Recognition
Knowledge Mining with Azure Search Technical Deck
…,
{
"@odata.type": "#Microsoft.Skills.Custom.WebApiSkill",
"uri" "https://myskill.azurewebsites.net/api/OrgId"
"httpHeaders": {"Api-Key": "mySecret" },
"context": "/document/organizations/*" ,
"inputs":
[
{ "name": “organizationName", "source": "/document/organizations/*" },
],
"outputs":
[
{ "name": "organizationId", "targetName": "organizationId" }
]
},
{
"values": [
{
"recordId": "7cad2",
"data":
{
"myOuput1": “animals"
}
},
{
"recordId": "7cad3",
"data":
{
"myOutput1": “colors"
}
},
…
]
}
{
"values": [
{
"recordId": "7cad2",
"data":
{
"myInput1": "fox",
"myInput2": "cat",
}
},
{
"recordId": "7cad3",
"data":
{
"myInput1": "blue",
"myInput2": "red",
}
},
…
]
}
Azure Machine
Learning
content
keyPhrases
organizations
docClass
content
normalized
images
language
tags
orgs
content
content
"skills": [
{
"@odata.type": "#Microsoft.Skills.Text.LanguageDetectionSkill",
"inputs":
[
{ "name": "text", "source": "/document/content" }
],
"outputs":
[
{ "name": "languageCode", "targetName": "myLanguageCode" },
{ "name": "languageName", "targetName": "myLanguageName" }
]
},
…,
{
"@odata.type": "#Microsoft.Skills.Text.NamedEntityRecognitionSkill",
"categories": [ "Organization" ],
"defaultLanguageCode": "en",
"inputs":
[
{ "name": "text", "source": "/document/content" },
"name" "languageCode" "source" "/document/myLanguageCode"
],
"outputs":
[
{ "name": "organizations", "targetName": "organizations" }
]
},
content
normalized
images
language
tags
orgs
content
content
…,
{
"@odata.type": "#Microsoft.Skills.Text.NamedEntityRecognitionSkill",
"categories": [ "Organization" ],
"defaultLanguageCode": "en",
"inputs":
[
{ "name": "text", "source": "/document/content" },
"name" "languageCode" "source" "/document/myLanguagecode"
],
"outputs":
[
{ "name": "organizations", "targetName": "organizations" }
]
},
…,
{
"@odata.type": "#Microsoft.Skills.Custom.WebApiSkill",
"uri" "https://myskill.azurewebsites.net/api/OrgId"
"context": "/document/organizations/*" ,
"httpHeaders": {"Api-Key": "mySecret" },
"inputs":
[
{ "name": “organizationName", "source": "/document/organizations/*" },
],
"outputs":
[
{ "name": "organizationId", "targetName": "organizationId" }
]
},
Search
Index
Built-in Cognitive Skills
OCR,
Key Phrase Extraction,
People Names,
Company Names,
Sentiment Analyzer,
Computer Vision,
etc.
Third Party Enrichers
Custom classification models,
Custom entity extraction,
etc.
Customer
Data
.pdf
.doc
.jpeg
…
Annotated
Documents
/document
/languageCode /keyPhrases /organizations /images
/1
/2
/…
/n
/1
/2
/…
/n
organizationId
organizationId
organizationId
organizationId
/1
/2
/…
/n
tags
tags
tags
tags
document.pdf
Annotated
Documents
Built-in Cognitive Skills
OCR,
Key Phrase Extraction,
People Names,
Company Names,
Sentiment Analyzer,
Computer Vision,
etc.
Third Party Enrichers
Custom classification models,
Custom entity extraction,
etc.
Customer
Data
.pdf
.doc
.jpeg
…
Search
Index
/document
/keyPhrases
/0
/1
/…
/n
/organizations
/0
/1
/…
/n
organizationId
organizationId
organizationId
organizationId
/images
/0
/1
/…
/n
tags
tags
tags
tags
New Indexer Property
{
…
"outputFieldMappings":
[
{
"sourceFieldName":
"/document/organizations/*/organizationId",
"targetFieldName":
"orgIds"
} ,
…
]
}
Annotated
Documents
Customer
Data
Built-in Cognitive Skills
OCR,
Key Phrase Extraction,
People Names,
Company Names,
Sentiment Analyzer,
Computer Vision,
etc.
Search
Index
.pdf
.doc
.jpeg
…
Third Party Enrichers
Custom classification models,
Custom entity extraction,
etc.
Azure Machine
Learning
Knowledge Mining with Azure Search Technical Deck
“Lorem ipsum dolor sit amet,
consectetur adipiscing elit, sed
do eiusmod tempor incididunt ut
labore et dolore magna aliqua. Ut
enim ad minim veniam, quis
nostrud exercitation ullamco
laboris nisi…”
Class A
Class B
Class C
“Lorem ipsum dolor sit amet,
consectetur adipiscing elit, sed
do eiusmod tempor incididunt ut
labore et dolore magna aliqua. Ut
enim ad minim veniam, quis
nostrud exercitation ullamco
laboris nisi…”
“Lorem ipsum dolor sit amet,
consectetur adipiscing elit, sed
do eiusmod tempor incididunt ut
labore et dolore magna aliqua. Ut
enim ad minim veniam, quis
nostrud exercitation ullamco
laboris nisi…”
Entity type A
Entity type B
Knowledge Mining with Azure Search Technical Deck
Knowledge Mining with Azure Search Technical Deck
Labeled
Data
Custom
Entity
Extraction
Template
Azure ML
Annotated
Documents
Customer
Data
Search
Index
Knowledge Mining with Azure Search Technical Deck
Cognitive Search
Documentation | Sign up for Azure Search
Azure Machine Learning Package for Text Analytics
Documentation | Create a Data Science Virtual Machine
Cognitive Services
Documentation | Sign up
Knowledge Mining with Azure Search Technical Deck

More Related Content

PPTX
Hospital Profile PPT.PPTX
PPTX
WAREHOUSE .pptx
PPTX
Management Information Systems in Apollo Hospitals
PDF
be.as Manufacturing for SAP Business One
PDF
Cognitive search oil and gas
PPTX
Cognitive Search: Announcing the smartest enterprise search engine, now with ...
ODP
The need for sophistication in modern search engine implementations
PPTX
Pepper worldparis 2017 ai4all
Hospital Profile PPT.PPTX
WAREHOUSE .pptx
Management Information Systems in Apollo Hospitals
be.as Manufacturing for SAP Business One
Cognitive search oil and gas
Cognitive Search: Announcing the smartest enterprise search engine, now with ...
The need for sophistication in modern search engine implementations
Pepper worldparis 2017 ai4all

Similar to Knowledge Mining with Azure Search Technical Deck (7)

PPTX
Notes from the field on customizing your AI using Cognitive Services
PDF
関西Azure勉強会 Cognitive Services アップデート_20180628
PDF
Microsoft Cognitive Service, Tap into the Power of Machine Learning with Easy...
PPTX
Understanding Pre-Built AI: AI for Every Developer
PDF
Use O365 and Azure Cognitive Services for intelligent search
PDF
Azure Cognitive Services
PDF
Microsoft Build 2023 Updates – Copilot Stack and Azure OpenAI Service (Machin...
Notes from the field on customizing your AI using Cognitive Services
関西Azure勉強会 Cognitive Services アップデート_20180628
Microsoft Cognitive Service, Tap into the Power of Machine Learning with Easy...
Understanding Pre-Built AI: AI for Every Developer
Use O365 and Azure Cognitive Services for intelligent search
Azure Cognitive Services
Microsoft Build 2023 Updates – Copilot Stack and Azure OpenAI Service (Machin...
Ad

More from Nicholas Vossburg (19)

PPTX
SAP on Azure Technical Pitch Deck
PPTX
NoSQL Migration Technical Pitch Deck
PPTX
NoSQL Migration to Azure Cosmos DB Pitch Deck
PPTX
Cosmos DB Tech Pitch
PPTX
Azure Cosmos DB Pricing 101 Infographic
PPTX
Azure Comsos DB Use Cases
PPTX
Linux on Azure Pitch Deck
PPTX
High Performance Computing Pitch Deck
PPTX
Machine Learning Pitch Deck
PPTX
Deep Learning Technical Pitch Deck
PPTX
Internet of Things Pitch Deck
PPTX
Cloud Scale Analytics Pitch Deck
PPTX
Azure Database Services for MySQL PostgreSQL and MariaDB
PPTX
Azure Cosmos DB L100 Pitch Deck
PPTX
Microsoft Cloud Adoption Framework for Azure: Thru Partner Governance Workshop
PPTX
Microsoft Cloud Adoption Framework for Azure: Governance Conversation
PPTX
Azure Migration Program Overview
PPTX
Azure Migration Program Pitch Deck
PPTX
Windows Server 2008 End of Support Pitch Deck
SAP on Azure Technical Pitch Deck
NoSQL Migration Technical Pitch Deck
NoSQL Migration to Azure Cosmos DB Pitch Deck
Cosmos DB Tech Pitch
Azure Cosmos DB Pricing 101 Infographic
Azure Comsos DB Use Cases
Linux on Azure Pitch Deck
High Performance Computing Pitch Deck
Machine Learning Pitch Deck
Deep Learning Technical Pitch Deck
Internet of Things Pitch Deck
Cloud Scale Analytics Pitch Deck
Azure Database Services for MySQL PostgreSQL and MariaDB
Azure Cosmos DB L100 Pitch Deck
Microsoft Cloud Adoption Framework for Azure: Thru Partner Governance Workshop
Microsoft Cloud Adoption Framework for Azure: Governance Conversation
Azure Migration Program Overview
Azure Migration Program Pitch Deck
Windows Server 2008 End of Support Pitch Deck
Ad

Recently uploaded (20)

PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
August Patch Tuesday
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Encapsulation theory and applications.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Getting Started with Data Integration: FME Form 101
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
A Presentation on Artificial Intelligence
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Univ-Connecticut-ChatGPT-Presentaion.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
August Patch Tuesday
Reach Out and Touch Someone: Haptics and Empathic Computing
A comparative analysis of optical character recognition models for extracting...
Accuracy of neural networks in brain wave diagnosis of schizophrenia
Diabetes mellitus diagnosis method based random forest with bat algorithm
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
Assigned Numbers - 2025 - Bluetooth® Document
Encapsulation theory and applications.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
NewMind AI Weekly Chronicles - August'25-Week II
Getting Started with Data Integration: FME Form 101
Encapsulation_ Review paper, used for researhc scholars
Mobile App Security Testing_ A Comprehensive Guide.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
A Presentation on Artificial Intelligence
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...

Knowledge Mining with Azure Search Technical Deck

Editor's Notes

  • #3: Understanding latent value in all content
  • #8: I verified accuracy of this slide with Giampaolo. Notes : Voice Font is part of Unified Speech Service. Custom Decision is not out for //build.
  • #10: INGEST (Understanding documents in a variety of format) AUGMENT (Extract “information”, Create structure out of the unstructured.) EXPLORE (Search)
  • #13: MongoDB?
  • #18: TODO: Change properties (foo  bar)
  • #20: We use the term skillset to refer to all the skills that should be run as part of the enrichment process. In a basic example…
  • #21: Sometimes you need to do something more complex. For instance, you may want to use the language you detected to improve the accuracy of the key-phrase extractor. Or you may want to get metadata of metadata.
  • #24: Sometimes you need to do something more complex. For instance, you may want to use the language you detected to improve the accuracy of the key-phrase extractor. Or you may want to get metadata of metadata.
  • #28: At each step of enrichment more structure is added to the document. Before-a-skill and after-a-skill diagram. (SHOW RESTFUL CALL)
  • #30: At each step of enrichment more structure is added to the document. Before-a-skill and after-a-skill diagram. (SHOW RESTFUL CALL)
  • #35: http://medicalentitydetector.azurewebsites.net/
  • #39: TODO: Add link to Cognitive Services.