SlideShare a Scribd company logo
Jing Chen He • jinghe@us.ibm.com • Apache HBase PMC • JanusGraph TSC
Jason Plurad • pluradj@us.ibm.com • Apache TinkerPop PMC • JanusGraph TSC
HBaseCon West 2017 • June 12, 2017
Community-Driven Graphs with
JanusGraph
Agenda
Property Graphs
Graph Community
Introduction to JanusGraph
JanusGraph with HBase
2 #HBaseCon
Graph
 Born for relationship!
 Intuitive modeling
 Expressive querying
 Native analysis
3 #HBaseCon
https://tinkerpop.apache.org/docs/3.2.4/reference/#intro
Graph Data Use Cases
 Social network analysis
 Configuration management database
 Master data management
 Recommendation engines
 Knowledge graphs
 Internet of things
 Cybersecurity attack analysis
4 #HBaseCon
Apache TinkerPop
 Open source, vendor-agnostic,
graph computing framework
 Gremlin graph traversal language
5
Apache TinkerPop™
Maintainer Apache
Software
Foundation
License Apache
Latest Release 3.2.4
February 2017
https://tinkerpop.apache.org
#HBaseCon
Gremlin Graph Traversal Language
6 #HBaseCon
https://tinkerpop.apache.org/gremlin.html
TinkerPop Stack
7 #HBaseCon
https://tinkerpop.apache.org/docs/3.2.4/reference/#_graph_system_integration
Graph Landscape
8 #HBaseCon
https://tinkerpop.apache.org/gremlin.html#oltp-and-olap-traversals
 Scalable graph database distributed on
multi-machine clusters with pluggable storage
and indexing
 Fully-compliant with Apache TinkerPop graph
computing framework
 Vendor-neutral, open community with
open governance
– Founding members: Expero, Google, GRAKN.AI,
Hortonworks, IBM
– Latest members: Amazon, Netflix, Orchestral
Developments, Uber
9
JanusGraph™
Maintainer Linux
Foundation
License Apache
Latest
Release
0.1.0
April 2017
https://janusgraph.org
#HBaseCon
10 #HBaseCon
Architecture
Google Cloud Bigtable
http://docs.janusgraph.org/latest/arch-overview.html
11 #HBaseCon
Storage Model
http://docs.janusgraph.org/latest/data-model.html#_janusgraph_data_layout
12 #HBaseCon
Storage Model
http://docs.janusgraph.org/latest/data-model.html#_individual_edge_layout
13 #HBaseCon
with HBase
 HBase – Perfect Storage Backend for JanusGraph
Big enough for your biggest graph!
The storage model
Read and write speed
Scalability and partitioning
Strong consistency
Tight integration with Hadoop Ecosystem
Great open community!
http://docs.janusgraph.org/latest/hbase.html
14 #HBaseCon
with HBase
 HBase – Perfect Storage Backend for JanusGraph
Simple configuration!
 conf/janusgraph-hbase-solr.properties
 storage.backend=hbase
 storage.hostname=zookeeper-host1,zookeeper-host2,zookeeper-host3
 storage.hbase.table=janusgraph
 storage.hbase.ext.zookeeper.znode.parent=/hbase
 storage.hbase.ext.hbase.zookeeper.property.clientPort=2181
 Just open your graph!
 graph=JanusGraphFactory.open('conf/janusgraph-hbase-solr.properties')
Optional
Optional
15 #HBaseCon
with HBase
 HBase – Perfect Storage Backend for JanusGraph
Throw in an Index Backend for better performance
 conf/janusgraph-hbase-solr.properties
 index.search.backend=solr
 index.search.solr.mode=cloud
 index.search.solr.zookeeper-url=zookeeper-host1:2181/solr,zookeeper-
host2:2181/solr,zookeeper-host3:2181/solr
 index.search.solr.configset=janusgraph
16 #HBaseCon
with HBase
 HBase – Perfect Storage Backend for JanusGraph
Look into more details
 Stores to Column Families
 Edge store  e
 Index store  g
 ID store  i
 Transaction log store  l
 System property store  s
 CF attributes can be set. E.g. compression, TTL.
17 #HBaseCon
with HBase
 HBase – Perfect Storage Backend for JanusGraph
Look into more details
g.V().has("name", "Alice").out("knows").out("knows").values("name")
Query Plan to
Backend Store and
Index
Edge Store
Index Store
Index
provider
18 #HBaseCon
with HBase
 HBase – Perfect Storage Backend for JanusGraph
Look into more details
 A store (column family) is always specified.
 Get or Multi Get
 Batch to mutate
 Key range scan
 ColumnRangeFilter
 ColumnPaginationFilter
 HBase tuning
Edge Store
Index Store
19 #HBaseCon
with Google Cloud Bigtable
 Bigtable implements the HBase 1.0 client API
Need the latest version of the bigtable-hbase-1.0 artifact.
 storage.backend=hbase
 storage.hbase.ext.hbase.client.connection.impl=
com.google.cloud.bigtable.hbase1_0.BigtableConnection
 storage.hbase.ext.google.bigtable.project.id=
<Google Cloud Platform project id>
 storage.hbase.ext.google.bigtable.instance.id=<Bigtable instance id>
Thank you!

More Related Content

PDF
IBM Open by Design: Graph Technology
PDF
Enabling Multimodel Graphs with Apache TinkerPop
PDF
Graph Processing with Apache TinkerPop and Gremlin
PDF
JanusGraph: Looking Backward, Reaching Forward
PDF
Community-Driven Graphs with JanusGraph
PDF
JanusGraph, Jupyter Meetup NYC
PDF
On-boarding with JanusGraph Performance
PDF
Graph Processing with Titan and Scylla
IBM Open by Design: Graph Technology
Enabling Multimodel Graphs with Apache TinkerPop
Graph Processing with Apache TinkerPop and Gremlin
JanusGraph: Looking Backward, Reaching Forward
Community-Driven Graphs with JanusGraph
JanusGraph, Jupyter Meetup NYC
On-boarding with JanusGraph Performance
Graph Processing with Titan and Scylla

What's hot (20)

PDF
Start Flying with Python & Apache TinkerPop
PDF
Graph Computing with JanusGraph
PPTX
Large Scale Graph Analytics with JanusGraph
PPTX
Powers of Ten Redux
PDF
Graph Processing with Apache TinkerPop
PDF
Graph Computing with JanusGraph
PDF
HBaseCon2017 Community-Driven Graphs with JanusGraph
PDF
Graph Computing with Apache TinkerPop
PPTX
Janus graph lookingbackwardreachingforward
PDF
Exploring Graph Use Cases with JanusGraph
PDF
Airline Reservations and Routing: A Graph Use Case
PDF
Introduction to Data Engineer and Data Pipeline at Credit OK
PPTX
Graph Databases at Netflix
PDF
Presto Summit 2018 - 10 - Qubole
PDF
Presto talk @ Global AI conference 2018 Boston
PDF
Netflix running Presto in the AWS Cloud
PDF
Presto@Uber
PDF
Presto Summit 2018 - 07 - Lyft
PPTX
HBaseCon 2017: Community-Driven Graph with JanusGraph (updated)
PPTX
Presto@Netflix Presto Meetup 03-19-15
Start Flying with Python & Apache TinkerPop
Graph Computing with JanusGraph
Large Scale Graph Analytics with JanusGraph
Powers of Ten Redux
Graph Processing with Apache TinkerPop
Graph Computing with JanusGraph
HBaseCon2017 Community-Driven Graphs with JanusGraph
Graph Computing with Apache TinkerPop
Janus graph lookingbackwardreachingforward
Exploring Graph Use Cases with JanusGraph
Airline Reservations and Routing: A Graph Use Case
Introduction to Data Engineer and Data Pipeline at Credit OK
Graph Databases at Netflix
Presto Summit 2018 - 10 - Qubole
Presto talk @ Global AI conference 2018 Boston
Netflix running Presto in the AWS Cloud
Presto@Uber
Presto Summit 2018 - 07 - Lyft
HBaseCon 2017: Community-Driven Graph with JanusGraph (updated)
Presto@Netflix Presto Meetup 03-19-15
Ad

Similar to Community-Driven Graphs with JanusGraph (20)

PPTX
HBaseConAsia2018: Track2-5: JanusGraph-Distributed graph database with HBase
PPTX
Graph databases: Tinkerpop and Titan DB
PPTX
Large Scale Graph Analytics with JanusGraph
PDF
Michael stack -the state of apache h base
PPTX
Graph Data: a New Data Management Frontier
PPTX
Hadoop summit 2017 enterprise graph analytics
PPTX
Hadoop Summit 2017 Enterprise Graph Analytics
PPTX
A Survey of HBase Application Archetypes
PPTX
Enterprise large scale graph analytics and computing base on distribute graph...
PPTX
Unit II Hadoop Ecosystem_Updated.pptx
PDF
HBase ArcheTypes
PPTX
A Graph Service for Global Web Entities Traversal and Reputation Evaluation B...
PPTX
A Graph Service for Global Web Entities Traversal and Reputation Evaluation B...
PPTX
JanusGraph DataBase Concepts
PDF
Transforming Your Data with GraphDB: GraphDB Fundamentals, Jan 2018
PPTX
Attack on graph
PDF
IBM Graph – Graph Database-as-a-Service: Managing Data and Its Relationships ...
PDF
Intro to HBase - Lars George
PDF
GraphTech Ecosystem - part 1: Graph Databases
PDF
HBase and Impala Notes - Munich HUG - 20131017
HBaseConAsia2018: Track2-5: JanusGraph-Distributed graph database with HBase
Graph databases: Tinkerpop and Titan DB
Large Scale Graph Analytics with JanusGraph
Michael stack -the state of apache h base
Graph Data: a New Data Management Frontier
Hadoop summit 2017 enterprise graph analytics
Hadoop Summit 2017 Enterprise Graph Analytics
A Survey of HBase Application Archetypes
Enterprise large scale graph analytics and computing base on distribute graph...
Unit II Hadoop Ecosystem_Updated.pptx
HBase ArcheTypes
A Graph Service for Global Web Entities Traversal and Reputation Evaluation B...
A Graph Service for Global Web Entities Traversal and Reputation Evaluation B...
JanusGraph DataBase Concepts
Transforming Your Data with GraphDB: GraphDB Fundamentals, Jan 2018
Attack on graph
IBM Graph – Graph Database-as-a-Service: Managing Data and Its Relationships ...
Intro to HBase - Lars George
GraphTech Ecosystem - part 1: Graph Databases
HBase and Impala Notes - Munich HUG - 20131017
Ad

Recently uploaded (20)

PPTX
1_Introduction to advance data techniques.pptx
PPTX
Introduction to machine learning and Linear Models
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
SAP 2 completion done . PRESENTATION.pptx
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PDF
Fluorescence-microscope_Botany_detailed content
PDF
Mega Projects Data Mega Projects Data
PPT
Quality review (1)_presentation of this 21
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PDF
Business Analytics and business intelligence.pdf
1_Introduction to advance data techniques.pptx
Introduction to machine learning and Linear Models
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
SAP 2 completion done . PRESENTATION.pptx
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Qualitative Qantitative and Mixed Methods.pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Fluorescence-microscope_Botany_detailed content
Mega Projects Data Mega Projects Data
Quality review (1)_presentation of this 21
STERILIZATION AND DISINFECTION-1.ppthhhbx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Business Analytics and business intelligence.pdf

Community-Driven Graphs with JanusGraph

  • 1. Jing Chen He [email protected] • Apache HBase PMC • JanusGraph TSC Jason Plurad • [email protected] • Apache TinkerPop PMC • JanusGraph TSC HBaseCon West 2017 • June 12, 2017 Community-Driven Graphs with JanusGraph
  • 2. Agenda Property Graphs Graph Community Introduction to JanusGraph JanusGraph with HBase 2 #HBaseCon
  • 3. Graph  Born for relationship!  Intuitive modeling  Expressive querying  Native analysis 3 #HBaseCon https://tinkerpop.apache.org/docs/3.2.4/reference/#intro
  • 4. Graph Data Use Cases  Social network analysis  Configuration management database  Master data management  Recommendation engines  Knowledge graphs  Internet of things  Cybersecurity attack analysis 4 #HBaseCon
  • 5. Apache TinkerPop  Open source, vendor-agnostic, graph computing framework  Gremlin graph traversal language 5 Apache TinkerPop™ Maintainer Apache Software Foundation License Apache Latest Release 3.2.4 February 2017 https://tinkerpop.apache.org #HBaseCon
  • 6. Gremlin Graph Traversal Language 6 #HBaseCon https://tinkerpop.apache.org/gremlin.html
  • 9.  Scalable graph database distributed on multi-machine clusters with pluggable storage and indexing  Fully-compliant with Apache TinkerPop graph computing framework  Vendor-neutral, open community with open governance – Founding members: Expero, Google, GRAKN.AI, Hortonworks, IBM – Latest members: Amazon, Netflix, Orchestral Developments, Uber 9 JanusGraph™ Maintainer Linux Foundation License Apache Latest Release 0.1.0 April 2017 https://janusgraph.org #HBaseCon
  • 10. 10 #HBaseCon Architecture Google Cloud Bigtable http://docs.janusgraph.org/latest/arch-overview.html
  • 13. 13 #HBaseCon with HBase  HBase – Perfect Storage Backend for JanusGraph Big enough for your biggest graph! The storage model Read and write speed Scalability and partitioning Strong consistency Tight integration with Hadoop Ecosystem Great open community! http://docs.janusgraph.org/latest/hbase.html
  • 14. 14 #HBaseCon with HBase  HBase – Perfect Storage Backend for JanusGraph Simple configuration!  conf/janusgraph-hbase-solr.properties  storage.backend=hbase  storage.hostname=zookeeper-host1,zookeeper-host2,zookeeper-host3  storage.hbase.table=janusgraph  storage.hbase.ext.zookeeper.znode.parent=/hbase  storage.hbase.ext.hbase.zookeeper.property.clientPort=2181  Just open your graph!  graph=JanusGraphFactory.open('conf/janusgraph-hbase-solr.properties') Optional Optional
  • 15. 15 #HBaseCon with HBase  HBase – Perfect Storage Backend for JanusGraph Throw in an Index Backend for better performance  conf/janusgraph-hbase-solr.properties  index.search.backend=solr  index.search.solr.mode=cloud  index.search.solr.zookeeper-url=zookeeper-host1:2181/solr,zookeeper- host2:2181/solr,zookeeper-host3:2181/solr  index.search.solr.configset=janusgraph
  • 16. 16 #HBaseCon with HBase  HBase – Perfect Storage Backend for JanusGraph Look into more details  Stores to Column Families  Edge store  e  Index store  g  ID store  i  Transaction log store  l  System property store  s  CF attributes can be set. E.g. compression, TTL.
  • 17. 17 #HBaseCon with HBase  HBase – Perfect Storage Backend for JanusGraph Look into more details g.V().has("name", "Alice").out("knows").out("knows").values("name") Query Plan to Backend Store and Index Edge Store Index Store Index provider
  • 18. 18 #HBaseCon with HBase  HBase – Perfect Storage Backend for JanusGraph Look into more details  A store (column family) is always specified.  Get or Multi Get  Batch to mutate  Key range scan  ColumnRangeFilter  ColumnPaginationFilter  HBase tuning Edge Store Index Store
  • 19. 19 #HBaseCon with Google Cloud Bigtable  Bigtable implements the HBase 1.0 client API Need the latest version of the bigtable-hbase-1.0 artifact.  storage.backend=hbase  storage.hbase.ext.hbase.client.connection.impl= com.google.cloud.bigtable.hbase1_0.BigtableConnection  storage.hbase.ext.google.bigtable.project.id= <Google Cloud Platform project id>  storage.hbase.ext.google.bigtable.instance.id=<Bigtable instance id>

Editor's Notes

  • #2: Abstract: Graphs are well-suited for many use cases to express and process complex relationships among entities in enterprise and social contexts. Fueled by the growing interest in graphs, there are various graph databases and processing systems that dot the graph landscape. JanusGraph is a community-driven project that continues the legacy of Titan, a pioneer of open source graph databases. JanusGraph is a scalable graph database optimized for large scale transactional and analytical graph processing. In the session, we will introduce JanusGraph, which features full integration with the Apache TinkerPop graph stack. We will discuss JanusGraph's optimized storage model that relies on HBase for fast graph transversal and processing.
  • #4: Brief history with TinkerPop. Long history as an open source project.
  • #5: Brief history with TinkerPop. Long history as an open source project.
  • #6: Brief history with TinkerPop. Long history as an open source project.
  • #7: Lots of interesting parts to graph system integration. Lots of ways to extend and contribute.
  • #8: Lots of interesting parts to graph system integration. Lots of ways to extend and contribute.
  • #9: Lots of interesting parts to graph system integration. Lots of ways to extend and contribute.