SlideShare a Scribd company logo
FROM DATAFRAMES TO GRAPH
Data Science with pyTigerGraph
Parker Erickson
Graph+AI World 2020
ABOUT ME
• Senior at University of Minnesota pursuing
a B.S. and M.S. in Computer Science
• Creator of pyTigerGraph
• Part time software engineer at Optum
• Patent-pending inventor for a fraud
detection algorithm using graph ML
• Connect with me:
https://www.linkedin.com/in/parker-erickso
n/
PARKER ERICKSON ©2020
OBJECTIVES
• Learn about pyTigerGraph Python package
• What it is used for
• Learn why Data Scientists prefer to use Python
• Intro to Graph ML Algorithms
• Learn about when to use Python or GSQL
• Tradeoffs between them
• Learn how to get started with pyTigerGraph
• Future Direction
PARKER ERICKSON ©2020
WHY GRAPH?
• Graphs not only store data,
they store relationships
between data in things like:
• Social Networks
• Fraud Rings
• Recommendation Engines
• Graph Machine Learning
algorithms are new (~2016)
• Graphs can enable
Explainable AI
https://tech.ebayinc.com/research/explainable-reasoning-over-knowledge-graphs-for-recommendation/
PARKER ERICKSON ©2020
ENABLING GRAPH DATA SCIENCE
• Many organizations have data
scientists that are Python experts
• Tools should be what they are
comfortable with
• Lower need for simple GSQL queries
• Loading data, simple analysis
• Focus on algorithms and analysis,
not learning new languages
PARKER ERICKSON ©2020
PYTIGERGRAPH
• pyTigerGraph enables data scientists
to easily create, load, and analyze
graph relationships
• Opens the door to various graph
machine learning algorithms, such as:
• Node2Vec
• Graph Convolutional Neural Networks
• Graph Attention Networks
PARKER ERICKSON ©2020
TRADITIONAL ML + GRAPH
• Graph can accelerate traditional ML
workflows
• Lack of costly JOINs across tables
accelerate development
• Enrich knowledge graphs using outputs
from ML algorithms
• Sentiment Analysis
• Entity Extraction
PARKER ERICKSON ©2020
GRAPH MACHINE LEARNING
• Node2Vec
• Creates embeddings based on random
walks
• Based on Word2Vec
• Graph Neural Networks
• Uses “message passing” to generate
representations of vertices in graph
• Used for classification and regression
tasks
• Graph Convolutional Neural Networks
(Kipf & Welling, 2017)
• Graph Attention Networks (Veličković et
al., 2018)
• https://parkererickson.github.io/graph-
https://app.wandb.ai/yashkotadia/gatedgcn-pattern/reports/Part-1-Introduction-to-Graph-Neural-Networks-with-GatedGCN--V
mlldzoyMDg4MjA
PARKER ERICKSON ©2020
DEMO
PARKER ERICKSON ©2020
CONTRIBUTE
• Special thanks to:
• Jon Herke – TigerGraph
• Szilard Barany – TigerGraph
• Yaniv Ben-Ami – Carleton College
• Submit an Issue or Pull Request:
• https://github.com/pyTigerGraph/pyTigerGra
ph
PARKER ERICKSON ©2020
QUESTIONS
• Contact me at:
• parker.erickson30@gmail.com
• https://www.linkedin.com/in/parker-erickson/
• Notebooks are Here:
• https://github.com/parkererickson/graphAIWorldDataframeToGraph
• Connect with the Community on Discord:
• https://discord.gg/XM7Cn9w
PARKER ERICKSON ©2020

More Related Content

PDF
Hardware Accelerated Machine Learning Solution for Detecting Fraud and Money ...
PDF
Graph Gurus Episode 37: Modeling for Kaggle COVID-19 Dataset
PDF
Better Together: How Graph database enables easy data integration with Spark ...
PDF
Plume - A Code Property Graph Extraction and Analysis Library
PDF
Deploying an End-to-End TigerGraph Enterprise Architecture using Kafka, Maria...
PDF
TigerGraph.js
PDF
Unlocking Geospatial Analytics Use Cases with CARTO and Databricks
PDF
Real-time Big Data at FPT (for TechCamp University)
Hardware Accelerated Machine Learning Solution for Detecting Fraud and Money ...
Graph Gurus Episode 37: Modeling for Kaggle COVID-19 Dataset
Better Together: How Graph database enables easy data integration with Spark ...
Plume - A Code Property Graph Extraction and Analysis Library
Deploying an End-to-End TigerGraph Enterprise Architecture using Kafka, Maria...
TigerGraph.js
Unlocking Geospatial Analytics Use Cases with CARTO and Databricks
Real-time Big Data at FPT (for TechCamp University)

What's hot (20)

PDF
Cloud-Native Microservices
PDF
Big Data and ML on Google Cloud
PDF
Lambda Architecture and open source technology stack for real time big data
PPTX
H2O Machine Learning with KNIME Analytics Platform - Christian Dietz - H2O AI...
PPTX
Megan Kurka, H2O.ai - AutoDoc with H2O Driverless AI - H2O World 2019 NYC
PDF
Rakuten - Recommendation Platform
PDF
Data Warehousing Trends
PDF
Building Identity Graph at Scale for Programmatic Media Buying Using Apache S...
PDF
Building an ML Tool to predict Article Quality Scores using Delta & MLFlow
PDF
FrugalML: Using ML APIs More Accurately and Cheaply
PPTX
ML Workshop 1: A New Architecture for Machine Learning Logistics
PDF
Scaling Production Machine Learning Pipelines with Databricks
DOCX
Resume_Vignesh_ThulasiDass
PDF
Vertex AI: Pipelines for your MLOps workflows
PPTX
GraphTour - Neo4j Platform Overview
PDF
Neo4j GraphDay Seattle- Sept19- graphs are ai
PDF
Pinterest - Big Data Machine Learning Platform at Pinterest
PDF
Building Intelligent Solutions with Graphs, Stefan Kolmar, Neo4j
PDF
RedisGraph A Low Latency Graph DB: Pieter Cailliau
PPTX
Automatic Model Documentation with H2O
Cloud-Native Microservices
Big Data and ML on Google Cloud
Lambda Architecture and open source technology stack for real time big data
H2O Machine Learning with KNIME Analytics Platform - Christian Dietz - H2O AI...
Megan Kurka, H2O.ai - AutoDoc with H2O Driverless AI - H2O World 2019 NYC
Rakuten - Recommendation Platform
Data Warehousing Trends
Building Identity Graph at Scale for Programmatic Media Buying Using Apache S...
Building an ML Tool to predict Article Quality Scores using Delta & MLFlow
FrugalML: Using ML APIs More Accurately and Cheaply
ML Workshop 1: A New Architecture for Machine Learning Logistics
Scaling Production Machine Learning Pipelines with Databricks
Resume_Vignesh_ThulasiDass
Vertex AI: Pipelines for your MLOps workflows
GraphTour - Neo4j Platform Overview
Neo4j GraphDay Seattle- Sept19- graphs are ai
Pinterest - Big Data Machine Learning Platform at Pinterest
Building Intelligent Solutions with Graphs, Stefan Kolmar, Neo4j
RedisGraph A Low Latency Graph DB: Pieter Cailliau
Automatic Model Documentation with H2O
Ad

Similar to FROM DATAFRAMES TO GRAPH Data Science with pyTigerGraph (20)

PDF
DAY1-IOT MASTER CLASS V 1 (1) volume 1 pdf
PPTX
IoT-Fundamentals-And-Digital-Tranformation-Repaired.pptx
PPTX
vishwa ppt.pptxvishwa ppt.pptxvishwa ppt.pptx
DOCX
Ronak Agrawal 2018 Computer Science
PDF
From Developer to Data Scientist - Gaines Kergosien
PDF
20181123 dn2018 graph_analytics_k_patenge
PPTX
Using Connected Data and Graph Technology to Enhance Machine Learning and Art...
PPTX
Industrialiser spark
PDF
Bilal ahmed resume
PDF
20181019 code.talks graph_analytics_k_patenge
PDF
The Rise of Engineering-Driven Analytics by Loren Shure
PPTX
The years of the graph: The future of the future is here
PPTX
Graph Analytics on Data from Meetup.com
PDF
Deploying Python Machine Learning Models with Apache Spark with Brandon Hamri...
PDF
Taking Jupyter Notebooks and Apache Spark to the Next Level PixieDust with Da...
PDF
Introduction to Neo4j
PDF
BinaryEdge - Security Data Metrics and Measurements at Scale - BSidesLisbon 2015
PDF
Cloud architectures for data science
PDF
Project achievement
PPTX
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
DAY1-IOT MASTER CLASS V 1 (1) volume 1 pdf
IoT-Fundamentals-And-Digital-Tranformation-Repaired.pptx
vishwa ppt.pptxvishwa ppt.pptxvishwa ppt.pptx
Ronak Agrawal 2018 Computer Science
From Developer to Data Scientist - Gaines Kergosien
20181123 dn2018 graph_analytics_k_patenge
Using Connected Data and Graph Technology to Enhance Machine Learning and Art...
Industrialiser spark
Bilal ahmed resume
20181019 code.talks graph_analytics_k_patenge
The Rise of Engineering-Driven Analytics by Loren Shure
The years of the graph: The future of the future is here
Graph Analytics on Data from Meetup.com
Deploying Python Machine Learning Models with Apache Spark with Brandon Hamri...
Taking Jupyter Notebooks and Apache Spark to the Next Level PixieDust with Da...
Introduction to Neo4j
BinaryEdge - Security Data Metrics and Measurements at Scale - BSidesLisbon 2015
Cloud architectures for data science
Project achievement
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
Ad

More from TigerGraph (20)

PDF
MAXIMIZING THE VALUE OF SCIENTIFIC INFORMATION TO ACCELERATE INNOVATION
PDF
Building an accurate understanding of consumers based on real-world signals
PDF
Care Intervention Assistant - Omaha Clinical Data Information System
PDF
Correspondent Banking Networks
PDF
Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...
PDF
Fraud Detection and Compliance with Graph Learning
PDF
Fraudulent credit card cash-out detection On Graphs
PDF
Customer Experience Management
PDF
Graph+AI for Fin. Services
PDF
Davraz - A graph visualization and exploration software.
PDF
GRAPHS FOR THE FUTURE ENERGY SYSTEMS
PDF
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...
PDF
Machine Learning Feature Design with TigerGraph 3.0 No-Code GUI
PDF
Recommendation Engine with In-Database Machine Learning
PDF
Supply Chain and Logistics Management with Graph & AI
PDF
The key to creating a Golden Thread: the power of Graph Databases for Entity ...
PDF
TigerGraph UI Toolkits Financial Crimes
PDF
Training Graph Convolutional Neural Networks in Graph Database
PDF
Fraud prevention is better with TigerGraph inside
PDF
Deep Link Analytics Empowered by AI + Graph + Verticals
MAXIMIZING THE VALUE OF SCIENTIFIC INFORMATION TO ACCELERATE INNOVATION
Building an accurate understanding of consumers based on real-world signals
Care Intervention Assistant - Omaha Clinical Data Information System
Correspondent Banking Networks
Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...
Fraud Detection and Compliance with Graph Learning
Fraudulent credit card cash-out detection On Graphs
Customer Experience Management
Graph+AI for Fin. Services
Davraz - A graph visualization and exploration software.
GRAPHS FOR THE FUTURE ENERGY SYSTEMS
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...
Machine Learning Feature Design with TigerGraph 3.0 No-Code GUI
Recommendation Engine with In-Database Machine Learning
Supply Chain and Logistics Management with Graph & AI
The key to creating a Golden Thread: the power of Graph Databases for Entity ...
TigerGraph UI Toolkits Financial Crimes
Training Graph Convolutional Neural Networks in Graph Database
Fraud prevention is better with TigerGraph inside
Deep Link Analytics Empowered by AI + Graph + Verticals

Recently uploaded (20)

PPTX
Computer network topology notes for revision
PPTX
Database Infoormation System (DBIS).pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PDF
Clinical guidelines as a resource for EBP(1).pdf
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
Foundation of Data Science unit number two notes
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PDF
Business Analytics and business intelligence.pdf
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PDF
annual-report-2024-2025 original latest.
PPT
Quality review (1)_presentation of this 21
Computer network topology notes for revision
Database Infoormation System (DBIS).pptx
IB Computer Science - Internal Assessment.pptx
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Clinical guidelines as a resource for EBP(1).pdf
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
STUDY DESIGN details- Lt Col Maksud (21).pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Foundation of Data Science unit number two notes
Acceptance and paychological effects of mandatory extra coach I classes.pptx
IBA_Chapter_11_Slides_Final_Accessible.pptx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Business Analytics and business intelligence.pdf
Data_Analytics_and_PowerBI_Presentation.pptx
annual-report-2024-2025 original latest.
Quality review (1)_presentation of this 21

FROM DATAFRAMES TO GRAPH Data Science with pyTigerGraph

  • 1. FROM DATAFRAMES TO GRAPH Data Science with pyTigerGraph Parker Erickson Graph+AI World 2020
  • 2. ABOUT ME • Senior at University of Minnesota pursuing a B.S. and M.S. in Computer Science • Creator of pyTigerGraph • Part time software engineer at Optum • Patent-pending inventor for a fraud detection algorithm using graph ML • Connect with me: https://www.linkedin.com/in/parker-erickso n/ PARKER ERICKSON ©2020
  • 3. OBJECTIVES • Learn about pyTigerGraph Python package • What it is used for • Learn why Data Scientists prefer to use Python • Intro to Graph ML Algorithms • Learn about when to use Python or GSQL • Tradeoffs between them • Learn how to get started with pyTigerGraph • Future Direction PARKER ERICKSON ©2020
  • 4. WHY GRAPH? • Graphs not only store data, they store relationships between data in things like: • Social Networks • Fraud Rings • Recommendation Engines • Graph Machine Learning algorithms are new (~2016) • Graphs can enable Explainable AI https://tech.ebayinc.com/research/explainable-reasoning-over-knowledge-graphs-for-recommendation/ PARKER ERICKSON ©2020
  • 5. ENABLING GRAPH DATA SCIENCE • Many organizations have data scientists that are Python experts • Tools should be what they are comfortable with • Lower need for simple GSQL queries • Loading data, simple analysis • Focus on algorithms and analysis, not learning new languages PARKER ERICKSON ©2020
  • 6. PYTIGERGRAPH • pyTigerGraph enables data scientists to easily create, load, and analyze graph relationships • Opens the door to various graph machine learning algorithms, such as: • Node2Vec • Graph Convolutional Neural Networks • Graph Attention Networks PARKER ERICKSON ©2020
  • 7. TRADITIONAL ML + GRAPH • Graph can accelerate traditional ML workflows • Lack of costly JOINs across tables accelerate development • Enrich knowledge graphs using outputs from ML algorithms • Sentiment Analysis • Entity Extraction PARKER ERICKSON ©2020
  • 8. GRAPH MACHINE LEARNING • Node2Vec • Creates embeddings based on random walks • Based on Word2Vec • Graph Neural Networks • Uses “message passing” to generate representations of vertices in graph • Used for classification and regression tasks • Graph Convolutional Neural Networks (Kipf & Welling, 2017) • Graph Attention Networks (Veličković et al., 2018) • https://parkererickson.github.io/graph- https://app.wandb.ai/yashkotadia/gatedgcn-pattern/reports/Part-1-Introduction-to-Graph-Neural-Networks-with-GatedGCN--V mlldzoyMDg4MjA PARKER ERICKSON ©2020
  • 10. CONTRIBUTE • Special thanks to: • Jon Herke – TigerGraph • Szilard Barany – TigerGraph • Yaniv Ben-Ami – Carleton College • Submit an Issue or Pull Request: • https://github.com/pyTigerGraph/pyTigerGra ph PARKER ERICKSON ©2020
  • 11. QUESTIONS • Contact me at: • [email protected] • https://www.linkedin.com/in/parker-erickson/ • Notebooks are Here: • https://github.com/parkererickson/graphAIWorldDataframeToGraph • Connect with the Community on Discord: • https://discord.gg/XM7Cn9w PARKER ERICKSON ©2020