SlideShare a Scribd company logo
‹#› Het begint met een idee
GRLC MAKES GITHUB TASTE
LIKE LINKED DATA APIS
Chefs
Albert Meroño-Peñuela
Rinke Hoekstra
Services and Applications over
Linked APIs and Data (SALAD)
ESWC
29-05-2016
Vrije Universiteit Amsterdam
 VU University Amsterdam – Computer
Science (Knowledge Representation &
Reasoning group)
 International Institute of Social
History (IISG), Amsterdam
 CLARIAH – National Infrastructure for
Digital Humanities
> DataLegend : Structured Data Hub
 Previously incubated by CEDAR –
Dutch historical censuses as 5-star
LOD
2
INSTITUTIONAL SLIDE
‹#› Het begint met een idee
DISCLAIMER
3
Frustration-
driven
research
‹#› Het begint met een idee
1. LD-CONSUMING
APPLICATIONS
4
‹#› Het begint met een idee
5 Het begint met een idee
 Publishing Dutch historical
censuses as 5-star LD
> Intensive use of RDF Data Cube
> Harmonization rules
> Provenance
 1st historical census data as Linked
Data (1795-1971)
 8 million observations (sex, marital
status, occupation position, housing type,
residence status)
 External links
> Geographical: 2.7M
> Occupations: 350K
> Belief: 250K
 High value for social historians
5 Faculty / department / title presentation
THE CEDAR STORY
Vrije Universiteit Amsterdam
 Historians can’t really write SPARQL
 Variety of access interfaces needed
6
CENSUS DATA QUERYING INTERFACES
Vrije Universiteit Amsterdam
 CLARIAH-WP4: Structured
data hub for social historians
 IPUMS, NAPP, CEDAR, etc
> Macro-, micro-, meso-data
> Civil registries, occupation, religion,
country-level economic indicators
> National (Netherlands) and
international
 Mostly CSV tables turned
into RDF Data Cube and
CSVW
 More than 1B triples already
 Higher variety of humanities
scholars  higher variety of
data access requirements)
7
SCALING VARIETY
Exi sts
Frequency Table
Variable does not yet existVariables
Mappings
Publish
Augment
Includes both external LinkedDataand
standard vocabularies, e.g. World Bank
External (Meta)Data
Existing Variables
& Codes
Provenance tracking of a
External Datasets
StructuredDataHub
‹#› Het begint met een idee8
‹#› Het begint met een idee
FRUSTRATION 1
9
This is
SPARQL
mess!!!1one
‹#› Het begint met een idee
‹#› Het begint met een idee
11 Het begint met een idee
 One .rq file for SPARQL query
 Good support of query curation
processes
> Versioning
> Branching
> Clone-pull-push
 Web-friendly features!
> One URI per query
> Uniquely identifiable
> De-referenceable
(raw.githubusercontent.com)
11 Faculty / department / title presentation
GITHUB AS A HUB OF
SPARQL QUERIES
‹#› Het begint met een idee
LESSON 1
12
Query
centralization
helps
maintaining
distributed
applications
‹#› Het begint met een idee
2. THE NEED FOR APIS
13
Vrije Universiteit Amsterdam
 Linked Data APIs emerge
 RESTful entry point to Linked Data hubs for Web applications
 OpenPHACTS
 …but the Linked Data API (e.g. Swagger spec, code itself) still
needs to be coded and maintained
14
MEANWHILE IN THE SEMANTIC WEB…
Vrije Universiteit Amsterdam
 Love story – thanks KMi!
 Automatically builds Swagger
specs and API code
 Takes SPARQL queries as input
(1 API operation = 1 SPARQL
query)
> API call functionality limited to SPARQL
expressivity
 Makes SPARQL queries uniquely
referenceable by using their
equivalent LDA operation
> Stores SPARQL internally
> But we already have uniquely
referenceable SPARQL…
15
BASIL
‹#› Het begint met een idee
FRUSTRATION 2
16
Copy-pasting 200
queries!!!
&
Organization
problem
‹#› Het begint met een idee
17 Het begint met een idee
 Cousin of BASIL in a SALAD 
 Same basic principle: 1 SPARQL
query = 1 API operation
 Automatically builds Swagger spec
and UI from SPARQL
But:
 External query management
 Organization of SPARQL queries in
the GitHub repo matches
organization of the API
 Thin layer – nothing stored server-
side
 Maps
> GitHub API
> Swagger spec
17 Faculty / department / title presentation
Vrije Universiteit Amsterdam
18
MAPPING GITHUB AND SWAGGER
Vrije Universiteit Amsterdam
19
SPARQL DECORATOR SYNTAX
Vrije Universiteit Amsterdam
20
THE GRLC SERVICE
 Assuming your repo is at https://github.com/:owner/:repo
and your grlc instance at :host,
> http://:host/:owner/:repo/spec returns the JSON swagger spec
> http://:host/:owner/:repo/api-docs returns the swagger UI
> http://:host/:owner/:repo/:operation?p_1=v_1...p_n=v_n calls
operation with specifiec parameter values
> Uses BASIL’s SPARQL variable name convention for query parameters
 Sends requests to
> https://api.github.com/repos/:owner/:repo to look for SPARQL queries and their
decorators
> https://raw.githubusercontent.com/:owner/:repo/master/file.rq to dereference
queries, get the SPARQL, and parse it
Vrije Universiteit Amsterdam
21
SPICED-UP SWAGGER UI
Vrije Universiteit Amsterdam
22
EVALUATION – USE CASES
 CEDAR: Access to census data for
historians
> Hides SPARQL
> Allows them to fill query parameters
through forms
> Co-existence of SPARQL and non-SPARQL
clients
 CLARIAH - Born Under a Bad Sign:
Do prenatal and early-life
conditions have an impact on
socioeconomic and health
outcomes later in life? (uses 1891
Canada and Sweden Linked Census Data)
> Reduction of coupling between SPARQL
libs and R
> Shorter R code – input stream as CSV
Vrije Universiteit Amsterdam
The spectrum of Linked Data clients: SPARQL intensive applications
vs RESTful API applications
grlc uses decoupling of SPARQL from all client applications
(including LDA) as a powerful practice
 Separates query curation workflows from everything else
 Allows at the same time
> Web-friendly SPARQL queries
> Web-friendly RESTful APIs
 Helps you to easily organise your LDA – just organise your SPARQL
repository and you’re set
 Try it out!
> http://grlc.clariah-sdh.eculture.labs.vu.nl
> https://github.com/CLARIAH/grlc
23
CONCLUSIONS
‹#› Het begint met een idee
THANK YOU!
@ALBERTMERONYO
DATALEGEND.NET
CLARIAH.NL
24

More Related Content

PPTX
Making social science more reproducible by encapsulating access to linked data
PPTX
Repeatable Semantic Queries for the Linked Data Agnostic
PDF
20170501 Distributed Network of Digital Heritage Information
PDF
Wehc - Linked Data for Economic-Social historians
PPTX
DSpace-CRIS: new features and contribution to the DSpace mainstream
PDF
Cenitpede: Analyzing Webcrawl
PDF
The Nature.com ontologies portal - Linked Science 2015
PDF
nanopub-java: A Java Library for Nanopublications
Making social science more reproducible by encapsulating access to linked data
Repeatable Semantic Queries for the Linked Data Agnostic
20170501 Distributed Network of Digital Heritage Information
Wehc - Linked Data for Economic-Social historians
DSpace-CRIS: new features and contribution to the DSpace mainstream
Cenitpede: Analyzing Webcrawl
The Nature.com ontologies portal - Linked Science 2015
nanopub-java: A Java Library for Nanopublications

What's hot (20)

PDF
Nanopublications and Decentralized Publishing
PPTX
balloon Fusion: SPARQL Rewriting Based on Unified Co-Reference Information
PDF
4Science presentes: ORCiD API Tutorial
PDF
Clariah WP4 dataLegend data stories
PDF
balloon: LOD forecasting - cloudy with a chance of services
PPTX
Integrating an electronic lab notebook with a university it environment rdmf ...
PPTX
DataverseNL as structured data hub
 
PPTX
An Approach for RDF-based Semantic Access to NoSQL Repositories
PPTX
Linked Open Data and DANS
 
PDF
DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...
PDF
Illuminating DSpace's Linked Data Support
PDF
Do it on your own - From 3 to 5 Star Linked Open Data with RMLio
PPT
Talis Platform: A Linked Data Engine
PPTX
Data munging and analysis
PPTX
ORCID at Crossref LIVE Indonesia
PPTX
ResourceSync Tutorial
PPTX
Esshc presentation ashkan
PDF
Documents, services, and data on the web
PPTX
Elastic search
Nanopublications and Decentralized Publishing
balloon Fusion: SPARQL Rewriting Based on Unified Co-Reference Information
4Science presentes: ORCiD API Tutorial
Clariah WP4 dataLegend data stories
balloon: LOD forecasting - cloudy with a chance of services
Integrating an electronic lab notebook with a university it environment rdmf ...
DataverseNL as structured data hub
 
An Approach for RDF-based Semantic Access to NoSQL Repositories
Linked Open Data and DANS
 
DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...
Illuminating DSpace's Linked Data Support
Do it on your own - From 3 to 5 Star Linked Open Data with RMLio
Talis Platform: A Linked Data Engine
Data munging and analysis
ORCID at Crossref LIVE Indonesia
ResourceSync Tutorial
Esshc presentation ashkan
Documents, services, and data on the web
Elastic search
Ad

Viewers also liked (20)

PPT
Introduccion a la generación de informes con R y LaTex
PDF
PPTX
Demo deck liveexp
PDF
Hydrogeological report of Barazan Plateau, North Goa District
PPTX
Instituto universitario de tecnologia
PPT
Best corporate-r-programming-training-in-mumbai
PDF
Como es la cirugía de catarata
PDF
Intern Project - Erika Goto
PDF
Kerala livsetock trend state planning board 1966 to 2007
PPTX
The cosmopolitan corporation
PDF
Iran oil and gas infrastructure
PPTX
Joint Indonesia-UK Conference on Computational Chemistry 2015
PDF
Teaching Close Reading
PDF
Gender and migration cwds key findings
PDF
Data Visualization With R: Learn To Combine Multiple Graphs
PDF
Metabolic acidosis
PPTX
Perencanaan Pembangunan Prasarana Air untuk Lahan Perkebunan
PPTX
Law of demand
PPT
Facial nerve injury
PPT
Taking off socks - self care skill
Introduccion a la generación de informes con R y LaTex
Demo deck liveexp
Hydrogeological report of Barazan Plateau, North Goa District
Instituto universitario de tecnologia
Best corporate-r-programming-training-in-mumbai
Como es la cirugía de catarata
Intern Project - Erika Goto
Kerala livsetock trend state planning board 1966 to 2007
The cosmopolitan corporation
Iran oil and gas infrastructure
Joint Indonesia-UK Conference on Computational Chemistry 2015
Teaching Close Reading
Gender and migration cwds key findings
Data Visualization With R: Learn To Combine Multiple Graphs
Metabolic acidosis
Perencanaan Pembangunan Prasarana Air untuk Lahan Perkebunan
Law of demand
Facial nerve injury
Taking off socks - self care skill
Ad

Similar to grlc Makes GitHub Taste Like Linked Data APIs (20)

PPTX
Automatic Query-Centric API for Routine Access to Linked Data
PPTX
grlc: Bridging the Gap Between RESTful APIs and Linked Data
PPTX
The Statistics of Stairway to Heaven: A Semantic Story About Digital Humanities
PDF
A BASILar Approach for Building Web APIs on top of SPARQL Endpoints
PPTX
One Score To Rule Them All: Semantics in Music Notation
PPTX
Why do they call it Linked Data when they want to say...?
PDF
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
PDF
Querying Linked Data with SPARQL
PDF
Querying Linked Data with SPARQL (2010)
KEY
Creating web applications with LODSPeaKr
PPTX
A Real-World Implementation of Linked Data
PPTX
Consuming Linked Data 4/5 Semtech2011
ODP
Web of data
KEY
Linked services
PDF
The state of the art in Linked Data
PDF
Web of Data Usage Mining
PDF
The Characteristics of a RESTful Semantic Web and Why They Are Important
PDF
PDF
DEVOXX UK 2018 - GraphQL as an alternative approach to REST
PPT
Re-using Media on the Web: Media fragment re-mixing and playout
Automatic Query-Centric API for Routine Access to Linked Data
grlc: Bridging the Gap Between RESTful APIs and Linked Data
The Statistics of Stairway to Heaven: A Semantic Story About Digital Humanities
A BASILar Approach for Building Web APIs on top of SPARQL Endpoints
One Score To Rule Them All: Semantics in Music Notation
Why do they call it Linked Data when they want to say...?
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Querying Linked Data with SPARQL
Querying Linked Data with SPARQL (2010)
Creating web applications with LODSPeaKr
A Real-World Implementation of Linked Data
Consuming Linked Data 4/5 Semtech2011
Web of data
Linked services
The state of the art in Linked Data
Web of Data Usage Mining
The Characteristics of a RESTful Semantic Web and Why They Are Important
DEVOXX UK 2018 - GraphQL as an alternative approach to REST
Re-using Media on the Web: Media fragment re-mixing and playout

More from Albert Meroño-Peñuela (14)

PPTX
List.MID: A MIDI-Based Benchmark for RDF Lists
PPTX
Modelling and Querying Lists in RDF. A Pragmatic Study
PPTX
What can I expect from an academic career? Valuable skills
PPTX
The MIDI Linked Data Cloud
PPTX
Historical Reasoning on the Web
PPTX
How does a knowledge graph sound like? (or: music is a graph)
PPTX
What Is Linked Historical Data?
PPTX
CBS CEDAR Presentation
PPTX
LSD Dimensions: Use and Reuse of Linked Statistical Data as RDF Data Cube
PDF
Non-Temporal Orderings for Extensional Concept Drift
PDF
Detecting and Reporting Extensional Concept Drift in Statistical Linked Data
PDF
Semantic Web for the Humanities
PPT
Linked Census Data
PPTX
Linked Humanities data
List.MID: A MIDI-Based Benchmark for RDF Lists
Modelling and Querying Lists in RDF. A Pragmatic Study
What can I expect from an academic career? Valuable skills
The MIDI Linked Data Cloud
Historical Reasoning on the Web
How does a knowledge graph sound like? (or: music is a graph)
What Is Linked Historical Data?
CBS CEDAR Presentation
LSD Dimensions: Use and Reuse of Linked Statistical Data as RDF Data Cube
Non-Temporal Orderings for Extensional Concept Drift
Detecting and Reporting Extensional Concept Drift in Statistical Linked Data
Semantic Web for the Humanities
Linked Census Data
Linked Humanities data

Recently uploaded (20)

PDF
Nekopoi APK 2025 free lastest update
PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PPTX
assetexplorer- product-overview - presentation
PDF
Digital Strategies for Manufacturing Companies
PPTX
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
PDF
System and Network Administration Chapter 2
PPTX
Operating system designcfffgfgggggggvggggggggg
PDF
Digital Systems & Binary Numbers (comprehensive )
PDF
top salesforce developer skills in 2025.pdf
PDF
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
PPTX
Embracing Complexity in Serverless! GOTO Serverless Bengaluru
PPTX
CHAPTER 2 - PM Management and IT Context
PPTX
L1 - Introduction to python Backend.pptx
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PDF
medical staffing services at VALiNTRY
PDF
How to Choose the Right IT Partner for Your Business in Malaysia
Nekopoi APK 2025 free lastest update
VVF-Customer-Presentation2025-Ver1.9.pptx
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
assetexplorer- product-overview - presentation
Digital Strategies for Manufacturing Companies
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
System and Network Administration Chapter 2
Operating system designcfffgfgggggggvggggggggg
Digital Systems & Binary Numbers (comprehensive )
top salesforce developer skills in 2025.pdf
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
Embracing Complexity in Serverless! GOTO Serverless Bengaluru
CHAPTER 2 - PM Management and IT Context
L1 - Introduction to python Backend.pptx
Wondershare Filmora 15 Crack With Activation Key [2025
medical staffing services at VALiNTRY
How to Choose the Right IT Partner for Your Business in Malaysia

grlc Makes GitHub Taste Like Linked Data APIs

  • 1. ‹#› Het begint met een idee GRLC MAKES GITHUB TASTE LIKE LINKED DATA APIS Chefs Albert Meroño-Peñuela Rinke Hoekstra Services and Applications over Linked APIs and Data (SALAD) ESWC 29-05-2016
  • 2. Vrije Universiteit Amsterdam  VU University Amsterdam – Computer Science (Knowledge Representation & Reasoning group)  International Institute of Social History (IISG), Amsterdam  CLARIAH – National Infrastructure for Digital Humanities > DataLegend : Structured Data Hub  Previously incubated by CEDAR – Dutch historical censuses as 5-star LOD 2 INSTITUTIONAL SLIDE
  • 3. ‹#› Het begint met een idee DISCLAIMER 3 Frustration- driven research
  • 4. ‹#› Het begint met een idee 1. LD-CONSUMING APPLICATIONS 4
  • 5. ‹#› Het begint met een idee 5 Het begint met een idee  Publishing Dutch historical censuses as 5-star LD > Intensive use of RDF Data Cube > Harmonization rules > Provenance  1st historical census data as Linked Data (1795-1971)  8 million observations (sex, marital status, occupation position, housing type, residence status)  External links > Geographical: 2.7M > Occupations: 350K > Belief: 250K  High value for social historians 5 Faculty / department / title presentation THE CEDAR STORY
  • 6. Vrije Universiteit Amsterdam  Historians can’t really write SPARQL  Variety of access interfaces needed 6 CENSUS DATA QUERYING INTERFACES
  • 7. Vrije Universiteit Amsterdam  CLARIAH-WP4: Structured data hub for social historians  IPUMS, NAPP, CEDAR, etc > Macro-, micro-, meso-data > Civil registries, occupation, religion, country-level economic indicators > National (Netherlands) and international  Mostly CSV tables turned into RDF Data Cube and CSVW  More than 1B triples already  Higher variety of humanities scholars  higher variety of data access requirements) 7 SCALING VARIETY Exi sts Frequency Table Variable does not yet existVariables Mappings Publish Augment Includes both external LinkedDataand standard vocabularies, e.g. World Bank External (Meta)Data Existing Variables & Codes Provenance tracking of a External Datasets StructuredDataHub
  • 8. ‹#› Het begint met een idee8
  • 9. ‹#› Het begint met een idee FRUSTRATION 1 9 This is SPARQL mess!!!1one
  • 10. ‹#› Het begint met een idee
  • 11. ‹#› Het begint met een idee 11 Het begint met een idee  One .rq file for SPARQL query  Good support of query curation processes > Versioning > Branching > Clone-pull-push  Web-friendly features! > One URI per query > Uniquely identifiable > De-referenceable (raw.githubusercontent.com) 11 Faculty / department / title presentation GITHUB AS A HUB OF SPARQL QUERIES
  • 12. ‹#› Het begint met een idee LESSON 1 12 Query centralization helps maintaining distributed applications
  • 13. ‹#› Het begint met een idee 2. THE NEED FOR APIS 13
  • 14. Vrije Universiteit Amsterdam  Linked Data APIs emerge  RESTful entry point to Linked Data hubs for Web applications  OpenPHACTS  …but the Linked Data API (e.g. Swagger spec, code itself) still needs to be coded and maintained 14 MEANWHILE IN THE SEMANTIC WEB…
  • 15. Vrije Universiteit Amsterdam  Love story – thanks KMi!  Automatically builds Swagger specs and API code  Takes SPARQL queries as input (1 API operation = 1 SPARQL query) > API call functionality limited to SPARQL expressivity  Makes SPARQL queries uniquely referenceable by using their equivalent LDA operation > Stores SPARQL internally > But we already have uniquely referenceable SPARQL… 15 BASIL
  • 16. ‹#› Het begint met een idee FRUSTRATION 2 16 Copy-pasting 200 queries!!! & Organization problem
  • 17. ‹#› Het begint met een idee 17 Het begint met een idee  Cousin of BASIL in a SALAD   Same basic principle: 1 SPARQL query = 1 API operation  Automatically builds Swagger spec and UI from SPARQL But:  External query management  Organization of SPARQL queries in the GitHub repo matches organization of the API  Thin layer – nothing stored server- side  Maps > GitHub API > Swagger spec 17 Faculty / department / title presentation
  • 20. Vrije Universiteit Amsterdam 20 THE GRLC SERVICE  Assuming your repo is at https://github.com/:owner/:repo and your grlc instance at :host, > http://:host/:owner/:repo/spec returns the JSON swagger spec > http://:host/:owner/:repo/api-docs returns the swagger UI > http://:host/:owner/:repo/:operation?p_1=v_1...p_n=v_n calls operation with specifiec parameter values > Uses BASIL’s SPARQL variable name convention for query parameters  Sends requests to > https://api.github.com/repos/:owner/:repo to look for SPARQL queries and their decorators > https://raw.githubusercontent.com/:owner/:repo/master/file.rq to dereference queries, get the SPARQL, and parse it
  • 22. Vrije Universiteit Amsterdam 22 EVALUATION – USE CASES  CEDAR: Access to census data for historians > Hides SPARQL > Allows them to fill query parameters through forms > Co-existence of SPARQL and non-SPARQL clients  CLARIAH - Born Under a Bad Sign: Do prenatal and early-life conditions have an impact on socioeconomic and health outcomes later in life? (uses 1891 Canada and Sweden Linked Census Data) > Reduction of coupling between SPARQL libs and R > Shorter R code – input stream as CSV
  • 23. Vrije Universiteit Amsterdam The spectrum of Linked Data clients: SPARQL intensive applications vs RESTful API applications grlc uses decoupling of SPARQL from all client applications (including LDA) as a powerful practice  Separates query curation workflows from everything else  Allows at the same time > Web-friendly SPARQL queries > Web-friendly RESTful APIs  Helps you to easily organise your LDA – just organise your SPARQL repository and you’re set  Try it out! > http://grlc.clariah-sdh.eculture.labs.vu.nl > https://github.com/CLARIAH/grlc 23 CONCLUSIONS
  • 24. ‹#› Het begint met een idee THANK YOU! @ALBERTMERONYO DATALEGEND.NET CLARIAH.NL 24

Editor's Notes

  • #17: Organization of the GitHub query repo matches the organization we want for its equivalent API Inception of the idea that the repo actually matches the API…