SlideShare a Scribd company logo
Web data is mostly incomplete
• Wikidata misses the fact that M. Sottile is an actor of Reservoir Dogs
• YAGO covers only 24,770 hasChild facts
• DBpedia contains 1 out of 29 spouses of Glynn Wolfe1
1 Guinness record holder for world’s most married man
Yet many topics are complete
cast member
publication date
Matt Damon
Casey Affleck
2002
. . . . . . . . . . . . . . . . . . . . . . . . . . .
Yet many topics are complete
. . . . . . . . . . . . . . . . . . . . . . . . . . .
MANAGING AND CONSUMING
COMPLETENESS INFORMATION
FOR RDF DATA SOURCES
Fariz Darari
Supervisors: Werner Nutt and Sebastian Rudolph
PhD Thesis Defense in Bolzano on July 14, 2017
Research questions
• Formalize?
How can we formalize statements about RDF completeness?
• Use?
How can completeness statements be leveraged for checking
the completeness and soundness of SPARQL queries?
• Optimize?
How do we efficiently perform completeness reasoning?
• Create?
How can we collect completeness statements?
Formalize Use Optimize Create
Related Work
• Completeness framework for relational databases
(Motro, 1989),(Levy, 1996),(Razniewski and Nutt, 2011)
• Closed predicates for Description Logics
(Lutz et al, 2013),(Ngo et al, 2016)
• Completeness types on the Semantic Web:
Schema, property, population, interlinking
(Fürber and Hepp, 2011),(Zaveri et al, 2016)
(gerry,actor,matt)
(gerry,actor,casey)
G
Give me Gerry’s actors
SELECT * WHERE
{ gerry actor ?a }
?a/matt
?a/casey
variable
(S,P,O)
Background:
RDF graphs and SPARQL queries
BGP Fragment of SPARQL
(gerry,actor,leo)
(titanic,actor,leo)
G’
Completeness statements
Complete for all of Gerry’s actors
Compl({(gerry,actor,?a)})
(gerry,actor,matt)
(gerry,actor,casey)
X
G
The extension pair (G,G’) satisfies the statement
In general:
>=1 triple pattern
Query completeness: Titanic
(gerry,actor,matt)
(gerry,actor,casey)
Give me Titanic’s actors
SELECT * WHERE
{ titanic actor ?a }
(titanic,actor,leo)
?a/leo- Incomplete!
G G’
Query completeness: Gerry
(gerry,actor,matt)
(gerry,actor,casey)
Give me Gerry’s actors
SELECT * WHERE
{ gerry actor ?a }
(titanic,actor,leo)
?a/matt
?a/casey
?a/matt
?a/casey Complete!
G G’
Recap: Research questions
Formalize Use Optimize Create
We have formalized completeness statements
and query completeness by means of extension pairs
Checking query completeness:
Data-agnostic
Query Q
Completeness statements C
Boolean isComplete
For every extension pair (G,G’) satisfying C,
it is the case [[Q]]G’ ⊆ [[Q]]G
Completeness
checking
Data-agnostic completeness checking:
Example
Give me Gerry’s actors
SELECT * WHERE
{ gerry actor ?a }
Data Source
Compl({(gerry,actor,?a)})
...
Query completeness guarantee: Yes!
Give me Gerry’s actors along with their children and spouses
SELECT * WHERE
{ gerry actor ?a . ?a child ?c . ?a spouse ?s }
?a/a
?c/c
?s/s
G
G’
Assume some answer over G’
(gerry,actor,a)
(a,child,c)
(a,spouse,s)
Compl({(gerry,actor,?a),(?a,child,?c)})
Compl({(?a,spouse,?s)})
.....
?a/a
?c/c
?s/s
Complete!
Data-agnostic completeness checking:
More example
Data Source
Compl({(gerry,actor,?a)})
Compl({(matt,child,?c)})
Compl({(casey,child,?c)})
Query completeness guarantee: No!
Give me Gerry’s actors
and their children
SELECT * WHERE
{ gerry actor ?a .
?a child ?c }
Data-agnostic completeness checking:
Limitation
Checking query completeness:
Data-aware
Query Q
Completeness statements C
Boolean isComplete
Completeness
checking
Graph G
For every extension pair (G,G’) satisfying C,
it is the case [[Q]]G’ ⊆ [[Q]]G
Data-aware completeness checking:
Example
Compl({(gerry,actor,?a)})
Compl({(matt,child,?c)})
Compl({(casey,child,?c)})
...(gerry,actor,matt)
(gerry,actor,casey)
Give me Gerry’s actors
and their children
SELECT * WHERE
{ gerry actor ?a .
?a child ?c }
Compl({(gerry,actor,?a)})
Compl({(matt,child,?c)})
Compl({(casey,child,?c)})
...(gerry,actor,matt)
(gerry,actor,casey)
Give me Gerry’s actors
and their children
SELECT * WHERE
{ gerry actor ?a .
?a child ?c }
Give me Matt’s children
SELECT * WHERE
{ matt child ?c } +
Give me Casey’s children
SELECT * WHERE
{ casey child ?c }
Query completeness guarantee: Yes!
Query completeness guarantee: Yes!
Query completeness guarantee: Yes!
Data-aware completeness checking:
Example
Query Q
Completeness statements C
Boolean isComplete
Completeness
checking
Graph G
Divide-and-conquer Algorithm
Take the body P of the query Q
isComplete(P,C,G):
if P = { } // base case
return true
P’ := FindMatch(P,C,G)
if P’ = { }
return false
else
{ u1, u2, …, un } := [[P’]]G
return isComplete(u1P’’,C,G) ∧ isComplete(u2P’’,C,G) ∧ … ∧ isComplete(unP’’,C,G)
P’’ := P  P’
Data-aware completeness checking:
Algorithm
Query Q
Completeness statements C
Boolean isComplete
Completeness
checking
Graph G
* P is the body of Q
** X denotes “fixed”
Data-aware completeness checking:
Complexity
Recap: Research questions
Formalize Use Optimize Create
We have characterized the problem
of checking query completeness
for data-agnostic setting and data-aware setting
Give me the actors who were not acting in Gerry
SELECT * WHERE
{ ?a type Actor .
FILTER NOT EXISTS { gerry actor ?a }}
Queries with negation
- --
Answers
Incomplete data may lead to unsound answers!
In general:
Multiple negation parts
Query Q
Completeness statements C
Boolean isSound
Soundness
checkingGraph G
Checking answer soundness
Answer u
For every extension pair (G,G’) satisfying C,
if u ∈ [[Q]]G then u ∈ [[Q]]G’
Compl({(?m,actor,leo)})
(leo,type,Actor)
(matt,type,Actor)
(titanic,actor,leo)
Give me the actors who were not acting in Gerry
SELECT * WHERE
{ ?a type Actor .
FILTER NOT EXISTS { gerry actor ?a }}
?a/leo : Answer soundness guarantee?
?a/matt: Answer soundness guarantee?
Checking answer soundness:
Example
Compl({(?m,actor,leo)})
(leo,type,Actor)
(matt,type,Actor)
(titanic,actor,leo)
Give me the actors who were not acting in Gerry
SELECT * WHERE
{ ?a type Actor .
FILTER NOT EXISTS { gerry actor ?a }}
?a/leo : Answer soundness guarantee?
Was Leo acting in Gerry?
SELECT * WHERE
{ gerry actor leo }
Query completeness guarantee: Yes!?a/leo : Answer soundness guarantee? Yes!
Checking “Leo” soundness
Compl({(?m,actor,leo)})
(leo,type,Actor)
(matt,type,Actor)
(titanic,actor,leo)
Give me the actors who were not acting in Gerry
SELECT * WHERE
{ ?a type Actor .
FILTER NOT EXISTS { gerry actor ?a }}
Checking “Matt” soundness
?a/matt: Answer soundness guarantee?
Was Matt acting in Gerry?
SELECT * WHERE
{ gerry actor matt }
Query completeness guarantee: No!?a/matt: Answer soundness guarantee? No!
Query Q
Completeness statements C
Boolean isSound
Soundness
checkingGraph G
Answer u
The answer u is sound wrt C, Q, and G
iff
each positive query generated from
applying u over each negative part of the query Q
is complete wrt C and G
Reduction Theorem
Checking answer soundness
by checking data-aware completeness
Query Q
Completeness statements C
Boolean isSound
Soundness
checking
Checking pattern soundness
For every extension pair (G,G’) satisfying C,
it is the case [[Q]]G ⊆ [[Q]]G’
Data Source
Compl({(gerry,actor,?a)})
Give me the actors who were not acting in Gerry
SELECT * WHERE
{ ?a type Actor .
FILTER NOT EXISTS { gerry actor ?a }}
Pattern soundness guarantee?
Checking pattern soundness:
Example
Data Source
Compl({(gerry,actor,?a)})
Give me the actors who were not acting in Gerry
SELECT * WHERE
{ ?a type Actor .
FILTER NOT EXISTS { gerry actor ?a }}
Pattern soundness guarantee?
Give me actors acting in Gerry
SELECT * WHERE
{ gerry actor ?a }
Query completeness guarantee: Yes!
Pattern soundness guarantee? Yes!
Checking pattern soundness:
Example
The query Q is sound wrt C
iff
each negative part of the query Q is complete wrt C
Reduction Theorem
Query Q
Completeness statements C
Boolean isSound
Soundness
checking
Checking pattern soundness
by checking data-agnostic completeness
Recap: Research questions
Formalize Use Optimize Create
We have characterized the problem of checking
query completeness and query soundness
Linked Open Quality Data Cloud
Are we ready to reason with big quality data?
Data-agnostic optimization:
Ignoring irrelevant statements
Data Source
Compl({(gerry,actor,?a)})
Compl({(?m,actor,?a)})
...
Compl({(titanic,actor,?a)})
Compl({(?p,spouse,?s)})
...
Give Gerry’s actors
SELECT * WHERE
{ gerry actor ?a } Constants
gerry,actor
Constants
gerry,actor
actor
. . .
titanic,actor
spouse
. . .
A statement contributes to query completeness only if
its constants are among the query’s
Data-agnostic optimization:
Evaluation
Setup
Queries: BGP queries from LSQ query logs dataset
Completeness statements: For every query as above, take the body P,
and construct Compl(P)
Reasoning implementation: Java with Apache Jena library
Results
Source NQ NC T-QE T-CR Overhead
DBpedia 334,000 331,000 18 ms 0.08 ms 0.44%
SWC 108,000 44,000 36 ms 0.12 ms 0.33%
LGD 22,000 21,000 8 ms 0.05 ms 0.60%
NQ: Number of queries
NC: Number of non-redundant statements
T-QE: Average runtime of query evaluation
T-CR: Average runtime of completeness reasoning
Data-aware reasoning
cannot reuse data-agnostic optimization
Compl({(gerry,actor,?a),(?a,child,?c)})
Compl({(titanic,actor,?a),(?a,child,?c)})
Compl({(frozen,actor,?a),(?a,child,?c)})
...
Give Matt’s children
SELECT * WHERE
{ matt child ?c }
(gerry,actor,matt)
(gerry,actor,casey)
Query completeness guarantee: Yes!
. . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . .
We capture similar completeness statements by
completeness templates
Complete for all actors of movies:
Pulp Fiction, Gerry, . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Data-aware optimization:
Completeness templates
Complete for all street names of communities:
Abingdon, Shippon, . . . .
Complete for X:
Instance 1, Instance 2, …
OpenStreetMap
Compl({(gerry,actor,?a),(?a,child,?c)})
Compl({(titanic,actor,?a),(?a,child,?c)})
Compl({(frozen,actor,?a),(?a,child,?c)})
...
Give Matt’s children
SELECT * WHERE
{ matt child ?c }
(gerry,actor,matt)
(gerry,actor,casey)
Query completeness guarantee: Yes!
Completeness templates enable
simultaneous processing of statements
Compl({(?m,actor,?a),(?a,child,?c)})
s.t. {?m/gerry,?m/titanic,?m/frozen,...}
1 ?a/matt
2 ?m/gerry
StatementsTemplate
Data-aware optimization:
Evaluation
Setup
Graph: Direct-statement fragment of Wikidata (110 mio triples)
Queries and statements: Based on 66 BGP queries from Wikidata query examples page
Each BGP query acts as a base to generate test queries and statements
Reasoning implementation: Java with Apache Jena library
Triple store: Apache Jena TDB
Results
Average reasoning time with 1,100 queries and
• 445,000 statements : 15 s
vs.
• 66 templates : 140 ms
Recap: Research questions
Formalize Use Optimize Create
We have seen
how to efficiently check query completeness
by indexing completeness statements
Linked Open Quality Data Cloud
How can we create big quality data?
Creating completeness statements
via cardinality extraction
“Wikidata is complete
for all children of Trump”
Around 19% numbers on Wikipedia
express relation cardinalities!
Count: 5
Machine learning task:
Given text, find the correct cardinalities
of certain relations
|(trump,child,*)| = 5
• Trump has five children
• Trump involved in more than 3500 legal actions
• Manafort is one of his campaign managers.
Texts with numbers on Trump’s Wikipedia page:
Label positive when:
1. Number corresponds to the exact triple count
2. Number modifies a noun
Training data generation for relation “child”
Distantly-supervised training data
Training data is labeled automatically based on some rules
Count: 5
Sequence labeling classifier
learn which context words often occur around the positively labeled data
and apply it to new data
Matt has two children
Child cardinality?
Experimental evaluation gives mixed precision results:
• hasPart for creative work : 58%
• hasPart for musical ensemble : 80%
• contains administrative division : 85%
• child : 38%
• spouse : 84%
Recap: Research questions
Formalize Use Optimize Create
We have seen how one can extract cardinality information
from text on the Web that can be leveraged
to generate completeness statements
Other research results
• Data can grow and therefore
completeness statements can become outdated…
Solution: Time-aware completeness reasoning
• How can we create completeness statements
in a collaborative way?
Demonstrator systems: CORNER and COOL-WD
Conclusions
 Formalize
 Use
 Optimize
 Create
Future work
? Richer operators for statements
“Complete for all unibz PhD students who don’t speak German
and who are (above 40) or (below 20) of age”
? OWL incorporation
The statement Compl({(?m,type,Centaur)}) can ensure
the completeness of the query “Give me all men who are also horses”
if we know that: Centaur Man ⨅ Horse
? Completeness case studies with RDF data publishers
What do communities of Wikidata, YAGO, and
DBpedia think of completeness (and its use cases)?
≡
Publications
C1. Paramita Mirza, Simon Razniewski, Fariz Darari, Gerhard Weikum:
Cardinal Virtues: Extracting Relation Cardinalities from Text. ACL (Short Papers) 2017.
C2. Fariz Darari, Simon Razniewski, Radityo Eko Prasojo, Werner Nutt:
Enabling Fine-grained RDF Data Completeness Assessment. ICWE 2016.
W1. Radityo Eko Prasojo, Fariz Darari, Simon Razniewski, Werner Nutt: Managing and
Consuming Completeness Information for Wikidata Using COOL-WD. COLD 2016.
P1. Fariz Darari, Radityo Eko Prasojo, Werner Nutt: Expressing No-Value Information in RDF.
ISWC (Posters & Demos) 2015. (Nominated for Best Poster)
P2. Fariz Darari, Simon Razniewski, Werner Nutt: Bridging the Semantic Gap between RDF and
SPARQL Using Completeness Statements. ISWC (Posters & Demos) 2014.
D1. Fariz Darari, Radityo Eko Prasojo, Werner Nutt: CORNER: A Completeness Reasoner
for SPARQL Queries Over RDF Data Sources. ESWC (Posters & Demos) 2014.
Potential Publications
J1. Fariz Darari, Werner Nutt, Simon Razniewski, Sebastian Rudolph:
Ensuring the Completeness and Soundness of SPARQL Queries
Using Completeness Statements about RDF Data Sources.
To be submitted to a journal.
J2. Fariz Darari, Werner Nutt, Giuseppe Pirro,Simon Razniewski:
Completeness Management for RDF Data Sources. Under revision for a journal.
C3. Paramita Mirza, Simon Razniewski, Fariz Darari, Gerhard Weikum:
Extracting Quantifications of Knowledge Base Facts from Text.
To be submitted to a conference.
D2. Erdal Domi, Simon Razniewski, Fariz Darari, Werner Nutt: ProWD: Profiling WikiData.
To be submitted to ISWC 2017 (Posters & Demos).
D3. Fariz Darari, Radityo Eko Prasojo, Simon Razniewski, Werner Nutt:
COOL-WD: A Completeness Tool for Wikidata. Submitted to ISWC 2017 (Posters & Demos).
Thanks!
Technical Appendix
Algorithm
for data-aware completeness checking
Query Q
Completeness statements C
Boolean isComplete
Completeness
checking
Graph G
Divide-and-conquer Algorithm
Take the body P of the query Q
isComplete(P,C,G):
if P = { } // base case
return true
P’ := FindMatch(P,C,G)
if P’ = { }
return false
else if
{ u1, u2, …, un } := [[P’]]G
return isComplete(u1P’’,C,G) ∧ isComplete(u2P’’,C,G) ∧ … ∧ isComplete(unP’’,C,G)
P’’ := P  P’
[[P’]]G = { }
return true
else
Time-aware completeness reasoning
Movies of Matt
2002
Actors of Gerry
?
2007
Actors of Gerry Actors of Gerry
. . . . .
Movies of Matt Movies of Matt
Time-aware completeness reasoning
Give me Gerry’s actors that have won an Oscar
SELECT * WHERE
{ gerry actor ?a . ?a award Oscar }
Query completeness guarantee: Yes, up to 2016!
Time-aware completeness reasoning
Data Source
Compl({(gerry,actor,?a)}, )
Compl({(?a,award,Oscar)},2016)
Creating completeness statements:
CORNER
Creating completeness statements:
COOL_-WD
Creating completeness statements:
COOL_-WD
. . . . . . . . .
. . . . . . . . .
Cardinality extraction:
ACL 2017 results
Cardinality extraction:
Detailed results
Cardinality extraction:
Detailed results
CRF: An illustration
Incorporate the context around the observed token,
that is, that comes before and after the token.
Ingredients:
• Feature functions, with the inputs of:
Sentence, position of word in the sentence,
the label of the current word,
the label of the previos word.
The output: real number
• Sum over weighted feature functions
The first sum runs over each feature function j,
the inner sum runs over each position of the sentence.
• The sum is transformed into probabilities
by exponentiating and normalizing.
Basically the sequential version of logistic regression.
Query soundness experiment cases
Query soundness experiment
Dissertation Defense - Managing and Consuming Completeness Information for RDF Data Sources
Linked Data principles
OWL 2 RL
Dissertation Defense - Managing and Consuming Completeness Information for RDF Data Sources
RDFS-aware completeness
reasoning

More Related Content

PPT
Introduction to Python
PDF
Python quick guide
PDF
Boolean Algebra
PPTX
Bfs & dfs application
PDF
基礎からわかる確率過程
PDF
Group theory
PDF
Python Decision Making And Loops.pdf
PDF
Python programming : Inheritance and polymorphism
Introduction to Python
Python quick guide
Boolean Algebra
Bfs & dfs application
基礎からわかる確率過程
Group theory
Python Decision Making And Loops.pdf
Python programming : Inheritance and polymorphism

What's hot (20)

PDF
Machine Learning Model Evaluation Methods
PDF
SETS, RELATIONS AND FUNCTIONS - JEE Main 2014
PPT
Fisher, neyman pearson and beyond
PDF
Properties of Caputo Operator and Its Applications to Linear Fractional Diffe...
PDF
Red Black Trees
PPTX
Building blocks of deep learning
PPTX
getting started with numpy and pandas.pptx
PDF
Calculo II (1-2016)(1er parcial).pdf
PPTX
dm_13_RelationsAndTheirProperties (1).pptx
PDF
Chapter 6 part2-Introduction to Inference-Tests of Significance, Stating Hyp...
PDF
Graph Theory: Connectivity & Isomorphism
PDF
Geodesic Method in Computer Vision and Graphics
PPTX
Decision Tree - ID3
PPTX
DDPG algortihm for angry birds
PPTX
Exploratory Data Analysis using Python
PDF
Introduction to Generative Adversarial Networks
PDF
미등록단어 문제 해결을 위한 비지도학습 기반 한국어자연어처리 방법론 및 응용
PDF
The Sandbox Container Directory
PDF
MATHS SYMBOLS - EXPONENTIALS + LOGARITHMS and THEIR PROPERTIES
PPT
03 truncation errors
Machine Learning Model Evaluation Methods
SETS, RELATIONS AND FUNCTIONS - JEE Main 2014
Fisher, neyman pearson and beyond
Properties of Caputo Operator and Its Applications to Linear Fractional Diffe...
Red Black Trees
Building blocks of deep learning
getting started with numpy and pandas.pptx
Calculo II (1-2016)(1er parcial).pdf
dm_13_RelationsAndTheirProperties (1).pptx
Chapter 6 part2-Introduction to Inference-Tests of Significance, Stating Hyp...
Graph Theory: Connectivity & Isomorphism
Geodesic Method in Computer Vision and Graphics
Decision Tree - ID3
DDPG algortihm for angry birds
Exploratory Data Analysis using Python
Introduction to Generative Adversarial Networks
미등록단어 문제 해결을 위한 비지도학습 기반 한국어자연어처리 방법론 및 응용
The Sandbox Container Directory
MATHS SYMBOLS - EXPONENTIALS + LOGARITHMS and THEIR PROPERTIES
03 truncation errors
Ad

More from Fariz Darari (20)

PDF
Data X Museum - Hari Museum Internasional 2022 - WMID
PDF
[PUBLIC] quiz-01-midterm-solutions.pdf
PPTX
Free AI Kit - Game Theory
PPTX
Neural Networks and Deep Learning: An Intro
PPTX
NLP guest lecture: How to get text to confess what knowledge it has
PPTX
Supply and Demand - AI Talents
PPTX
Basic Python Programming: Part 01 and Part 02
PPTX
AI in education done properly
PPTX
Artificial Neural Networks: Pointers
PPTX
Open Tridharma at ICACSIS 2019
PDF
Defense Slides of Avicenna Wisesa - PROWD
PPTX
Seminar Laporan Aktualisasi - Tridharma Terbuka - Fariz Darari
PPTX
Foundations of Programming - Java OOP
PPTX
Recursion in Python
PDF
[ISWC 2013] Completeness statements about RDF data sources and their use for ...
PPTX
Testing in Python: doctest and unittest (Updated)
PPTX
Testing in Python: doctest and unittest
PPTX
Research Writing - 2018.07.18
PPTX
KOI - Knowledge Of Incidents - SemEval 2018
PPTX
Comparing Index Structures for Completeness Reasoning
Data X Museum - Hari Museum Internasional 2022 - WMID
[PUBLIC] quiz-01-midterm-solutions.pdf
Free AI Kit - Game Theory
Neural Networks and Deep Learning: An Intro
NLP guest lecture: How to get text to confess what knowledge it has
Supply and Demand - AI Talents
Basic Python Programming: Part 01 and Part 02
AI in education done properly
Artificial Neural Networks: Pointers
Open Tridharma at ICACSIS 2019
Defense Slides of Avicenna Wisesa - PROWD
Seminar Laporan Aktualisasi - Tridharma Terbuka - Fariz Darari
Foundations of Programming - Java OOP
Recursion in Python
[ISWC 2013] Completeness statements about RDF data sources and their use for ...
Testing in Python: doctest and unittest (Updated)
Testing in Python: doctest and unittest
Research Writing - 2018.07.18
KOI - Knowledge Of Incidents - SemEval 2018
Comparing Index Structures for Completeness Reasoning
Ad

Recently uploaded (20)

DOCX
Unit-3 cyber security network security of internet system
PDF
Best Practices for Testing and Debugging Shopify Third-Party API Integrations...
PDF
Tenda Login Guide: Access Your Router in 5 Easy Steps
PPT
Design_with_Watersergyerge45hrbgre4top (1).ppt
PPTX
Power Point - Lesson 3_2.pptx grad school presentation
PPTX
Introduction to Information and Communication Technology
PDF
Unit-1 introduction to cyber security discuss about how to secure a system
PPT
isotopes_sddsadsaadasdasdasdasdsa1213.ppt
PPTX
Database Information System - Management Information System
PPTX
artificialintelligenceai1-copy-210604123353.pptx
PDF
Slides PDF The World Game (s) Eco Economic Epochs.pdf
PPTX
Digital Literacy And Online Safety on internet
PPTX
INTERNET------BASICS-------UPDATED PPT PRESENTATION
PPTX
presentation_pfe-universite-molay-seltan.pptx
PPTX
innovation process that make everything different.pptx
PPTX
Mathew Digital SEO Checklist Guidlines 2025
PDF
💰 𝐔𝐊𝐓𝐈 𝐊𝐄𝐌𝐄𝐍𝐀𝐍𝐆𝐀𝐍 𝐊𝐈𝐏𝐄𝐑𝟒𝐃 𝐇𝐀𝐑𝐈 𝐈𝐍𝐈 𝟐𝟎𝟐𝟓 💰
PPTX
Job_Card_System_Styled_lorem_ipsum_.pptx
PDF
An introduction to the IFRS (ISSB) Stndards.pdf
PDF
Paper PDF World Game (s) Great Redesign.pdf
Unit-3 cyber security network security of internet system
Best Practices for Testing and Debugging Shopify Third-Party API Integrations...
Tenda Login Guide: Access Your Router in 5 Easy Steps
Design_with_Watersergyerge45hrbgre4top (1).ppt
Power Point - Lesson 3_2.pptx grad school presentation
Introduction to Information and Communication Technology
Unit-1 introduction to cyber security discuss about how to secure a system
isotopes_sddsadsaadasdasdasdasdsa1213.ppt
Database Information System - Management Information System
artificialintelligenceai1-copy-210604123353.pptx
Slides PDF The World Game (s) Eco Economic Epochs.pdf
Digital Literacy And Online Safety on internet
INTERNET------BASICS-------UPDATED PPT PRESENTATION
presentation_pfe-universite-molay-seltan.pptx
innovation process that make everything different.pptx
Mathew Digital SEO Checklist Guidlines 2025
💰 𝐔𝐊𝐓𝐈 𝐊𝐄𝐌𝐄𝐍𝐀𝐍𝐆𝐀𝐍 𝐊𝐈𝐏𝐄𝐑𝟒𝐃 𝐇𝐀𝐑𝐈 𝐈𝐍𝐈 𝟐𝟎𝟐𝟓 💰
Job_Card_System_Styled_lorem_ipsum_.pptx
An introduction to the IFRS (ISSB) Stndards.pdf
Paper PDF World Game (s) Great Redesign.pdf

Dissertation Defense - Managing and Consuming Completeness Information for RDF Data Sources

  • 1. Web data is mostly incomplete • Wikidata misses the fact that M. Sottile is an actor of Reservoir Dogs • YAGO covers only 24,770 hasChild facts • DBpedia contains 1 out of 29 spouses of Glynn Wolfe1 1 Guinness record holder for world’s most married man
  • 2. Yet many topics are complete cast member publication date Matt Damon Casey Affleck 2002 . . . . . . . . . . . . . . . . . . . . . . . . . . .
  • 3. Yet many topics are complete . . . . . . . . . . . . . . . . . . . . . . . . . . .
  • 4. MANAGING AND CONSUMING COMPLETENESS INFORMATION FOR RDF DATA SOURCES Fariz Darari Supervisors: Werner Nutt and Sebastian Rudolph PhD Thesis Defense in Bolzano on July 14, 2017
  • 5. Research questions • Formalize? How can we formalize statements about RDF completeness? • Use? How can completeness statements be leveraged for checking the completeness and soundness of SPARQL queries? • Optimize? How do we efficiently perform completeness reasoning? • Create? How can we collect completeness statements? Formalize Use Optimize Create
  • 6. Related Work • Completeness framework for relational databases (Motro, 1989),(Levy, 1996),(Razniewski and Nutt, 2011) • Closed predicates for Description Logics (Lutz et al, 2013),(Ngo et al, 2016) • Completeness types on the Semantic Web: Schema, property, population, interlinking (Fürber and Hepp, 2011),(Zaveri et al, 2016)
  • 7. (gerry,actor,matt) (gerry,actor,casey) G Give me Gerry’s actors SELECT * WHERE { gerry actor ?a } ?a/matt ?a/casey variable (S,P,O) Background: RDF graphs and SPARQL queries BGP Fragment of SPARQL
  • 8. (gerry,actor,leo) (titanic,actor,leo) G’ Completeness statements Complete for all of Gerry’s actors Compl({(gerry,actor,?a)}) (gerry,actor,matt) (gerry,actor,casey) X G The extension pair (G,G’) satisfies the statement In general: >=1 triple pattern
  • 9. Query completeness: Titanic (gerry,actor,matt) (gerry,actor,casey) Give me Titanic’s actors SELECT * WHERE { titanic actor ?a } (titanic,actor,leo) ?a/leo- Incomplete! G G’
  • 10. Query completeness: Gerry (gerry,actor,matt) (gerry,actor,casey) Give me Gerry’s actors SELECT * WHERE { gerry actor ?a } (titanic,actor,leo) ?a/matt ?a/casey ?a/matt ?a/casey Complete! G G’
  • 11. Recap: Research questions Formalize Use Optimize Create We have formalized completeness statements and query completeness by means of extension pairs
  • 12. Checking query completeness: Data-agnostic Query Q Completeness statements C Boolean isComplete For every extension pair (G,G’) satisfying C, it is the case [[Q]]G’ ⊆ [[Q]]G Completeness checking
  • 13. Data-agnostic completeness checking: Example Give me Gerry’s actors SELECT * WHERE { gerry actor ?a } Data Source Compl({(gerry,actor,?a)}) ... Query completeness guarantee: Yes!
  • 14. Give me Gerry’s actors along with their children and spouses SELECT * WHERE { gerry actor ?a . ?a child ?c . ?a spouse ?s } ?a/a ?c/c ?s/s G G’ Assume some answer over G’ (gerry,actor,a) (a,child,c) (a,spouse,s) Compl({(gerry,actor,?a),(?a,child,?c)}) Compl({(?a,spouse,?s)}) ..... ?a/a ?c/c ?s/s Complete! Data-agnostic completeness checking: More example
  • 15. Data Source Compl({(gerry,actor,?a)}) Compl({(matt,child,?c)}) Compl({(casey,child,?c)}) Query completeness guarantee: No! Give me Gerry’s actors and their children SELECT * WHERE { gerry actor ?a . ?a child ?c } Data-agnostic completeness checking: Limitation
  • 16. Checking query completeness: Data-aware Query Q Completeness statements C Boolean isComplete Completeness checking Graph G For every extension pair (G,G’) satisfying C, it is the case [[Q]]G’ ⊆ [[Q]]G
  • 18. Compl({(gerry,actor,?a)}) Compl({(matt,child,?c)}) Compl({(casey,child,?c)}) ...(gerry,actor,matt) (gerry,actor,casey) Give me Gerry’s actors and their children SELECT * WHERE { gerry actor ?a . ?a child ?c } Give me Matt’s children SELECT * WHERE { matt child ?c } + Give me Casey’s children SELECT * WHERE { casey child ?c } Query completeness guarantee: Yes! Query completeness guarantee: Yes! Query completeness guarantee: Yes! Data-aware completeness checking: Example
  • 19. Query Q Completeness statements C Boolean isComplete Completeness checking Graph G Divide-and-conquer Algorithm Take the body P of the query Q isComplete(P,C,G): if P = { } // base case return true P’ := FindMatch(P,C,G) if P’ = { } return false else { u1, u2, …, un } := [[P’]]G return isComplete(u1P’’,C,G) ∧ isComplete(u2P’’,C,G) ∧ … ∧ isComplete(unP’’,C,G) P’’ := P P’ Data-aware completeness checking: Algorithm
  • 20. Query Q Completeness statements C Boolean isComplete Completeness checking Graph G * P is the body of Q ** X denotes “fixed” Data-aware completeness checking: Complexity
  • 21. Recap: Research questions Formalize Use Optimize Create We have characterized the problem of checking query completeness for data-agnostic setting and data-aware setting
  • 22. Give me the actors who were not acting in Gerry SELECT * WHERE { ?a type Actor . FILTER NOT EXISTS { gerry actor ?a }} Queries with negation - -- Answers Incomplete data may lead to unsound answers! In general: Multiple negation parts
  • 23. Query Q Completeness statements C Boolean isSound Soundness checkingGraph G Checking answer soundness Answer u For every extension pair (G,G’) satisfying C, if u ∈ [[Q]]G then u ∈ [[Q]]G’
  • 24. Compl({(?m,actor,leo)}) (leo,type,Actor) (matt,type,Actor) (titanic,actor,leo) Give me the actors who were not acting in Gerry SELECT * WHERE { ?a type Actor . FILTER NOT EXISTS { gerry actor ?a }} ?a/leo : Answer soundness guarantee? ?a/matt: Answer soundness guarantee? Checking answer soundness: Example
  • 25. Compl({(?m,actor,leo)}) (leo,type,Actor) (matt,type,Actor) (titanic,actor,leo) Give me the actors who were not acting in Gerry SELECT * WHERE { ?a type Actor . FILTER NOT EXISTS { gerry actor ?a }} ?a/leo : Answer soundness guarantee? Was Leo acting in Gerry? SELECT * WHERE { gerry actor leo } Query completeness guarantee: Yes!?a/leo : Answer soundness guarantee? Yes! Checking “Leo” soundness
  • 26. Compl({(?m,actor,leo)}) (leo,type,Actor) (matt,type,Actor) (titanic,actor,leo) Give me the actors who were not acting in Gerry SELECT * WHERE { ?a type Actor . FILTER NOT EXISTS { gerry actor ?a }} Checking “Matt” soundness ?a/matt: Answer soundness guarantee? Was Matt acting in Gerry? SELECT * WHERE { gerry actor matt } Query completeness guarantee: No!?a/matt: Answer soundness guarantee? No!
  • 27. Query Q Completeness statements C Boolean isSound Soundness checkingGraph G Answer u The answer u is sound wrt C, Q, and G iff each positive query generated from applying u over each negative part of the query Q is complete wrt C and G Reduction Theorem Checking answer soundness by checking data-aware completeness
  • 28. Query Q Completeness statements C Boolean isSound Soundness checking Checking pattern soundness For every extension pair (G,G’) satisfying C, it is the case [[Q]]G ⊆ [[Q]]G’
  • 29. Data Source Compl({(gerry,actor,?a)}) Give me the actors who were not acting in Gerry SELECT * WHERE { ?a type Actor . FILTER NOT EXISTS { gerry actor ?a }} Pattern soundness guarantee? Checking pattern soundness: Example
  • 30. Data Source Compl({(gerry,actor,?a)}) Give me the actors who were not acting in Gerry SELECT * WHERE { ?a type Actor . FILTER NOT EXISTS { gerry actor ?a }} Pattern soundness guarantee? Give me actors acting in Gerry SELECT * WHERE { gerry actor ?a } Query completeness guarantee: Yes! Pattern soundness guarantee? Yes! Checking pattern soundness: Example
  • 31. The query Q is sound wrt C iff each negative part of the query Q is complete wrt C Reduction Theorem Query Q Completeness statements C Boolean isSound Soundness checking Checking pattern soundness by checking data-agnostic completeness
  • 32. Recap: Research questions Formalize Use Optimize Create We have characterized the problem of checking query completeness and query soundness
  • 33. Linked Open Quality Data Cloud Are we ready to reason with big quality data?
  • 34. Data-agnostic optimization: Ignoring irrelevant statements Data Source Compl({(gerry,actor,?a)}) Compl({(?m,actor,?a)}) ... Compl({(titanic,actor,?a)}) Compl({(?p,spouse,?s)}) ... Give Gerry’s actors SELECT * WHERE { gerry actor ?a } Constants gerry,actor Constants gerry,actor actor . . . titanic,actor spouse . . . A statement contributes to query completeness only if its constants are among the query’s
  • 35. Data-agnostic optimization: Evaluation Setup Queries: BGP queries from LSQ query logs dataset Completeness statements: For every query as above, take the body P, and construct Compl(P) Reasoning implementation: Java with Apache Jena library Results Source NQ NC T-QE T-CR Overhead DBpedia 334,000 331,000 18 ms 0.08 ms 0.44% SWC 108,000 44,000 36 ms 0.12 ms 0.33% LGD 22,000 21,000 8 ms 0.05 ms 0.60% NQ: Number of queries NC: Number of non-redundant statements T-QE: Average runtime of query evaluation T-CR: Average runtime of completeness reasoning
  • 36. Data-aware reasoning cannot reuse data-agnostic optimization Compl({(gerry,actor,?a),(?a,child,?c)}) Compl({(titanic,actor,?a),(?a,child,?c)}) Compl({(frozen,actor,?a),(?a,child,?c)}) ... Give Matt’s children SELECT * WHERE { matt child ?c } (gerry,actor,matt) (gerry,actor,casey) Query completeness guarantee: Yes!
  • 37. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . We capture similar completeness statements by completeness templates Complete for all actors of movies: Pulp Fiction, Gerry, . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Data-aware optimization: Completeness templates Complete for all street names of communities: Abingdon, Shippon, . . . . Complete for X: Instance 1, Instance 2, … OpenStreetMap
  • 38. Compl({(gerry,actor,?a),(?a,child,?c)}) Compl({(titanic,actor,?a),(?a,child,?c)}) Compl({(frozen,actor,?a),(?a,child,?c)}) ... Give Matt’s children SELECT * WHERE { matt child ?c } (gerry,actor,matt) (gerry,actor,casey) Query completeness guarantee: Yes! Completeness templates enable simultaneous processing of statements Compl({(?m,actor,?a),(?a,child,?c)}) s.t. {?m/gerry,?m/titanic,?m/frozen,...} 1 ?a/matt 2 ?m/gerry StatementsTemplate
  • 39. Data-aware optimization: Evaluation Setup Graph: Direct-statement fragment of Wikidata (110 mio triples) Queries and statements: Based on 66 BGP queries from Wikidata query examples page Each BGP query acts as a base to generate test queries and statements Reasoning implementation: Java with Apache Jena library Triple store: Apache Jena TDB Results Average reasoning time with 1,100 queries and • 445,000 statements : 15 s vs. • 66 templates : 140 ms
  • 40. Recap: Research questions Formalize Use Optimize Create We have seen how to efficiently check query completeness by indexing completeness statements
  • 41. Linked Open Quality Data Cloud How can we create big quality data?
  • 42. Creating completeness statements via cardinality extraction “Wikidata is complete for all children of Trump” Around 19% numbers on Wikipedia express relation cardinalities! Count: 5 Machine learning task: Given text, find the correct cardinalities of certain relations |(trump,child,*)| = 5
  • 43. • Trump has five children • Trump involved in more than 3500 legal actions • Manafort is one of his campaign managers. Texts with numbers on Trump’s Wikipedia page: Label positive when: 1. Number corresponds to the exact triple count 2. Number modifies a noun Training data generation for relation “child” Distantly-supervised training data Training data is labeled automatically based on some rules Count: 5
  • 44. Sequence labeling classifier learn which context words often occur around the positively labeled data and apply it to new data Matt has two children Child cardinality? Experimental evaluation gives mixed precision results: • hasPart for creative work : 58% • hasPart for musical ensemble : 80% • contains administrative division : 85% • child : 38% • spouse : 84%
  • 45. Recap: Research questions Formalize Use Optimize Create We have seen how one can extract cardinality information from text on the Web that can be leveraged to generate completeness statements
  • 46. Other research results • Data can grow and therefore completeness statements can become outdated… Solution: Time-aware completeness reasoning • How can we create completeness statements in a collaborative way? Demonstrator systems: CORNER and COOL-WD
  • 48. Future work ? Richer operators for statements “Complete for all unibz PhD students who don’t speak German and who are (above 40) or (below 20) of age” ? OWL incorporation The statement Compl({(?m,type,Centaur)}) can ensure the completeness of the query “Give me all men who are also horses” if we know that: Centaur Man ⨅ Horse ? Completeness case studies with RDF data publishers What do communities of Wikidata, YAGO, and DBpedia think of completeness (and its use cases)? ≡
  • 49. Publications C1. Paramita Mirza, Simon Razniewski, Fariz Darari, Gerhard Weikum: Cardinal Virtues: Extracting Relation Cardinalities from Text. ACL (Short Papers) 2017. C2. Fariz Darari, Simon Razniewski, Radityo Eko Prasojo, Werner Nutt: Enabling Fine-grained RDF Data Completeness Assessment. ICWE 2016. W1. Radityo Eko Prasojo, Fariz Darari, Simon Razniewski, Werner Nutt: Managing and Consuming Completeness Information for Wikidata Using COOL-WD. COLD 2016. P1. Fariz Darari, Radityo Eko Prasojo, Werner Nutt: Expressing No-Value Information in RDF. ISWC (Posters & Demos) 2015. (Nominated for Best Poster) P2. Fariz Darari, Simon Razniewski, Werner Nutt: Bridging the Semantic Gap between RDF and SPARQL Using Completeness Statements. ISWC (Posters & Demos) 2014. D1. Fariz Darari, Radityo Eko Prasojo, Werner Nutt: CORNER: A Completeness Reasoner for SPARQL Queries Over RDF Data Sources. ESWC (Posters & Demos) 2014.
  • 50. Potential Publications J1. Fariz Darari, Werner Nutt, Simon Razniewski, Sebastian Rudolph: Ensuring the Completeness and Soundness of SPARQL Queries Using Completeness Statements about RDF Data Sources. To be submitted to a journal. J2. Fariz Darari, Werner Nutt, Giuseppe Pirro,Simon Razniewski: Completeness Management for RDF Data Sources. Under revision for a journal. C3. Paramita Mirza, Simon Razniewski, Fariz Darari, Gerhard Weikum: Extracting Quantifications of Knowledge Base Facts from Text. To be submitted to a conference. D2. Erdal Domi, Simon Razniewski, Fariz Darari, Werner Nutt: ProWD: Profiling WikiData. To be submitted to ISWC 2017 (Posters & Demos). D3. Fariz Darari, Radityo Eko Prasojo, Simon Razniewski, Werner Nutt: COOL-WD: A Completeness Tool for Wikidata. Submitted to ISWC 2017 (Posters & Demos).
  • 53. Algorithm for data-aware completeness checking Query Q Completeness statements C Boolean isComplete Completeness checking Graph G Divide-and-conquer Algorithm Take the body P of the query Q isComplete(P,C,G): if P = { } // base case return true P’ := FindMatch(P,C,G) if P’ = { } return false else if { u1, u2, …, un } := [[P’]]G return isComplete(u1P’’,C,G) ∧ isComplete(u2P’’,C,G) ∧ … ∧ isComplete(unP’’,C,G) P’’ := P P’ [[P’]]G = { } return true else
  • 54. Time-aware completeness reasoning Movies of Matt 2002 Actors of Gerry ? 2007 Actors of Gerry Actors of Gerry . . . . . Movies of Matt Movies of Matt
  • 56. Give me Gerry’s actors that have won an Oscar SELECT * WHERE { gerry actor ?a . ?a award Oscar } Query completeness guarantee: Yes, up to 2016! Time-aware completeness reasoning Data Source Compl({(gerry,actor,?a)}, ) Compl({(?a,award,Oscar)},2016)
  • 59. Creating completeness statements: COOL_-WD . . . . . . . . . . . . . . . . . .
  • 63. CRF: An illustration Incorporate the context around the observed token, that is, that comes before and after the token. Ingredients: • Feature functions, with the inputs of: Sentence, position of word in the sentence, the label of the current word, the label of the previos word. The output: real number • Sum over weighted feature functions The first sum runs over each feature function j, the inner sum runs over each position of the sentence. • The sum is transformed into probabilities by exponentiating and normalizing. Basically the sequential version of logistic regression.

Editor's Notes

  • #2: Welcome to the open world of the Web, where data is generally considered incomplete. These three of the largest Web data sources, for example, have their incompleteness problems. Wikidata does not know if Sottile is an actor of Reservoir Dogs. (as of Jun 2017) There must be more hasChild information in reality than in YAGO. (Statistics from YAGO website: http://www.mpi-inf.mpg.de/departments/databases-and-information-systems/research/yago-naga/yago/statistics/) DBpedia underestimates Glynn Wolfe, the most capable man in the world! (as of Jun 2017 in comparison to http://articles.chicagotribune.com/1997-06-20/news/9706200268_1_world-s-most-married-man-linda-essex-wolfe-mr-wolfe) But, is this the end of story?
  • #3: Thankfully not. For many topics, the data is indeed complete! On this page about Gerry on Wikidata, without completeness information, we tend to consider the cast members as incomplete. Yet, the fact is that it is COMPLETE. We make this implicit information explicit by this completeness stamp.
  • #4: Also for data about Trump on Dbpedia, non-US people might not know if it is complete for all the his spouses, so we put the completeness stamp!
  • #5: Such an idea of having explicit completeness information motivates the research questions: how to manage such completeness information and how to consume it in the context of RDF data sources, in particular… (show next page)
  • #7: Relational databases => completeness for relational tables, and relation with SQL query completeness Closed predicates => draw more conclusions than open case Completeness types => a fraction of data compared to another data source treated as the gold standard
  • #9: Generally, any possible extension to the available graph will do. Completenesss statements restrict such extensions.
  • #10: Incomplete because answers over G’ are not contained in over G
  • #13: The ideal graphs are always abstracted. For this completeness problem, also available graphs are abstracted. Image credits: - https://www.iconfinder.com/icons/969265/cogwheels_configuration_gear_seo_setting_settings_icon#size=512
  • #14: Match completeness statements with the triple patterns of the query
  • #15: How does this work? Let me give a more elaborated example. Completeness statements are used like CONSTRUCT queries, to transfer the prototypical graph in G’ to G.
  • #16: We cannot find any matches for (?a child ?c)
  • #17: The available graph is in the input. Image credits: - https://www.iconfinder.com/icons/969265/cogwheels_configuration_gear_seo_setting_settings_icon#size=512
  • #18: Instead of giving up, with the data-aware approach, one can: instantiate complete parts of the query
  • #19: Instantiation results will generate two subqueries that are intuitively equivalent with the original query. If complete for all the subqueries, complete for the original query.
  • #20: Take the body because the completeness of Q is equivalent to the completeness of its body. Recursive algorithm, where P’ is the matched parts wrt. completeness statements that are to be instantiated. If no match can be found, then incomplete. Otherwise, evaluate the matched parts P’, get the mappings, and apply them to the remaining parts in P. Then the original query is complete iff all the subqueries are complete.
  • #21: Graph does not play a role, since our hardness proof uses a fixed-size graph. Fixing the P, would give polynomial size instantiations, the hard part is only on the FindMatch Fixing the C, would give a tractable check for FindMatch, so to refute the entailment, need to guess a counterexample mapping wrt. P.
  • #23: Non-monotonicity may draw a previously given answer!
  • #24: Image credits: - https://www.iconfinder.com/icons/969265/cogwheels_configuration_gear_seo_setting_settings_icon#size=512
  • #29: Image credits: - https://www.iconfinder.com/icons/969265/cogwheels_configuration_gear_seo_setting_settings_icon#size=512
  • #32: Contrast this with answer soundness checking
  • #34: LOQD Cloud = Datasets about LOD quality, which themselves are also published in Linked Data format
  • #35: Subset-querying techniques: Tries: (Hoffmann & Koehler, 1999) Inverted indexing: (Helmer & Moerkotte, 2003) Tries: (Savnik, 2013)
  • #37: Not only does data-aware reasoning require instantiation-step, but also the matching is done also over the available graph.
  • #38: IMDb has over 17000 editor-verified natural language completeness statements OpenStreetMap has over 2300 pages with completeness status
  • #39: Instead of evaluating each statement individually, we evaluate a generalized version of the statements, and check if there is any compatible instance.
  • #42: LOQD Cloud = Datasets about LOD quality, which themselves are also published in Linked Data format
  • #43: Automated method to create completeness statements. Idea is simple: match the cardinality information with the KB relation count.
  • #44: One of the benefits of distant-supervision training data generation is that: cheap training data with a relatively good quality
  • #45: creative work’s precision is low because of: small training data child’s precision is low because of: training data mixes children and siblings, political terms served; text assertions also include children information of someone else like her parent/sibling; and multiple marriages so information about children are found not in one but multiple sentences. Higher than ACL result because: incompleteness-resilient distant supervision, compositionality, linguistic variance
  • #48: For conclusions, we have formalized, used, optimized, and created completeness statements 
  • #55: We leverage completeness templates for data-aware optimization. Based on our observation, completeness statements in the real world are usually about similar topics. So we group completeness statements based on their topics, and then, we perform completeness reasoning over groups of completeness statements. This enables simultaneous processing of completeness statements.
  • #56: We leverage completeness templates for data-aware optimization. Based on our observation, completeness statements in the real world are usually about similar topics. So we group completeness statements based on their topics, and then, we perform completeness reasoning over groups of completeness statements. This enables simultaneous processing of completeness statements.
  • #57: https://www.iconfinder.com/icons/285673/clock_icon#size=128
  • #58: COOL-WD is more focused: it is short for Completeness Tool for Wikidata. We have developed a gadget that enables us to add CSs, and also see complete parts directly from Wikidata. Isn’t it cool?
  • #59: COOL-WD is more focused: it is short for Completeness Tool for Wikidata. We have developed a gadget that enables us to add CSs, and also see complete parts directly from Wikidata. Isn’t it cool?
  • #60: COOL-WD is more focused: it is short for Completeness Tool for Wikidata. We have developed a gadget that enables us to add CSs, and also see complete parts directly from Wikidata. Isn’t it cool?
  • #61: COOL-WD is more focused: it is short for Completeness Tool for Wikidata. We have developed a gadget that enables us to add CSs, and also see complete parts directly from Wikidata. Isn’t it cool?
  • #62: COOL-WD is more focused: it is short for Completeness Tool for Wikidata. We have developed a gadget that enables us to add CSs, and also see complete parts directly from Wikidata. Isn’t it cool?
  • #63: COOL-WD is more focused: it is short for Completeness Tool for Wikidata. We have developed a gadget that enables us to add CSs, and also see complete parts directly from Wikidata. Isn’t it cool?
  • #64: Marginal probability = unconditional probability (http://sites.nicholas.duke.edu/statsreview/probability/jmc/) CRF++ uses a combination of forward Viterbi and backward A* search. This combination yields the exact list of n-best results. -> better than the naïve exponential labeling Weights can be learned using gradient ascent.
  • #67: https://www.w3.org/DesignIssues/LinkedData.html
  • #68: https://www.w3.org/DesignIssues/LinkedData.html