SlideShare a Scribd company logo
Collaborative Data Exploration using
Conversational AI
Anand Ranganathan
Co-Founder & Chief AI Officer
anand@unscrambl.com
How do people consume data & analytics today?
– through charts & dashboards
What’s wrong with dashboards
Limited or no drill-downs
Don’t know how the data for the dashboards
is produced
Can’t ask a slightly different question
Representative of an opinion; easy to
cherry-pick stats
Can be misleading
Analytics & BI tools have some ways to go
In making organizations truly data-driven
“Where can I find my
data? I don’t know
which database, table,
query or tool to use”
“The presentation
is tomorrow and
the BI team is
busy. What
should I do?”
“Why are
there so many
dashboards?”
26.5%
of organizations report
being data driven in 2022,
down from 37% in 2017.
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
Allow any user to ask text or voice questions of their data
and
receive back a natural language + visual analysis
of statistically relevant and actionable insights for that user.
What is Conversational Analytics?
*Note that in this talk, we focus on structured data stored in
relational format (e.g. SQL databases, Excel sheets, etc)
Qbo: Natural language conversations with data
within collaboration platforms
Hey QBO,
how many
policies are
expiring in
Oct 2022?
Hey QBO,
why were
new
acquisitions
in Feb lower?
Why have
sales
dropped in
the North
region this
month?
AI-Powered,
Data Analyst
Direct
Connection
to data
Sales
Management
Marketing
Natural
Language
Queries
Operations
HR
BUSINESS USERS
Qbo sits between users and disparate, siloed datasets
Support 20+ data connectors, and access via a web interface or Microsoft Teams
Unscrambl Qbo Demo :
on Covid Data
Demo
A (very simplified) overview of NLU pipeline
number of trips in winter 2017 by age and gender
Entity Recognition & Construction
SELECT anon_1."age group", anon_1.gender, count(*) AS "Count"
FROM (SELECT "TRIP_ANALYSIS".end_station_id AS "end station id", "TRIP_ANALYSIS".program_id AS "program id", "TRIP_ANALYSIS".start_station_id AS "start station id",
"TRIP_ANALYSIS".bikeid AS bikeid, CASE WHEN (:birth_year_1 - "TRIP_ANALYSIS".birth_year < :param_1) THEN :param_2 ELSE CASE WHEN (:birth_year_2 -
"TRIP_ANALYSIS".birth_year < :param_3) THEN :param_4 ELSE …, "TRIP_ANALYSIS".gender AS gender
FROM "TRIP_ANALYSIS"
WHERE ….
Identification of Query type and mapping to known concepts in DB
Type: Aggregation Query on Trips table with a group-by and a filter; age -> derived from birth year attribute; gender -> gender attribute; in winter
2017 -> 2017-12-23 and 2018-03-19 (filter)
Generate DB-specific SQL query
Get results, decide on visualization and narratives, and present back to user
Key Challenge : Bridging the gap between users and
data
● Users don’t know what to ask
● Users don’t know how to ask
● Users may pose questions in
an ambiguous manner
● Users may use terms not in the
dataset
Key Challenge : Bridging the gap between users and
data
● Users don’t know what to ask
● Users don’t know how to ask
● Users may pose questions in
an ambiguous manner
● Users may use terms not in the
dataset
Key Challenge : Bridging the gap between users and
data
● Data may be modeled in a variety of
ways
● Hidden semantics and assumptions
behind different tables and columns
● Data may be incomplete, unclean
● Data may be spread across silos
Common scenario we encounter in our
deployments :
Qbo doesn’t understand the user
Now, let’s assume Lila is part of a group channel
Collaborative data exploration to the rescue
● Many organizations and teams contain people with different levels
of data skills
● In the past, the data-have-nots were dependent on the data-haves
● Conversational AI can help them collaborate. Allow the
data-have-nots serve themselves sometimes, and the data-haves
jump in when they can’t
Direct
Connection
to data
Natural
Language
Queries
Collaborate with internal and external users to explore
& share data, insights and reports
With the right access control restrictions in place
Internal
Business Users
3rd Party
Partners
Collaborate with
Partners in the
context of “Teams”
or “Channels”
Interesting challenges around collaborative
data access
● How should access control work?
○ Based on the user asking the question?
○ Based on the user in the group channel with the “least” privilege?
● How do we keep track of context around multiple parallel conversations
in a group chat
● How do we keep track of versions and group edits in a collaborative
dashboard?
○ Especially when the charts are interactive or can be refreshed
Imagine…
#futureofwork
#futureofdata

More Related Content

PDF
Getting Insights through Conversational Chatbots for Analytics
PDF
Guided Interaction: Rethinking the Query-Result Paradigm
PPTX
Three Tools for "Human-in-the-loop" Data Science
PDF
2018 bi-trends-ebook
PDF
(eBook PDF) Data Analytics for Accounting 2nd Edition By Vernon Richardson
PPTX
Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...
PDF
My latest white paper
PPTX
Chat with Data Tool using LLM Agents and Langchain
Getting Insights through Conversational Chatbots for Analytics
Guided Interaction: Rethinking the Query-Result Paradigm
Three Tools for "Human-in-the-loop" Data Science
2018 bi-trends-ebook
(eBook PDF) Data Analytics for Accounting 2nd Edition By Vernon Richardson
Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...
My latest white paper
Chat with Data Tool using LLM Agents and Langchain

Similar to Data Con LA 2022 - Collaborative Data Exploration using Conversational AI (20)

PDF
Dirty Data? Clean it up! - Rocky Mountain DataCon 2016
PPTX
Student Activity Hub community Meeting 10-25-2017
PPTX
Introduction to Big Data Analytics
PDF
Break through the Analytics Barrier
PPTX
Interactive Analytics in Human Time
PPTX
Chapter 2 - EMTE.pptx
PDF
Data Analytics and Big Data on IoT
PPTX
Big data presentation for University of Reykjavik, Iceland, March 22
PDF
Semantic 'Radar' Steers Users to Insights in the Data Lake
PDF
PDF
The data we want
PDF
PDF
Organizing to Get Analytics Right
PPTX
June 2014 HUG: Interactive analytics over hadoop
PPT
Martin Stabe, interactive producer, Financial Times
PDF
Harness the power of data
PDF
The Emerging Data Lake IT Strategy
PPT
Whats A Data Warehouse
PDF
Human Machine Collaboration for Improved Analytical Processes
PDF
20CS601 - Big data Analytics - types of data , definition of big data
Dirty Data? Clean it up! - Rocky Mountain DataCon 2016
Student Activity Hub community Meeting 10-25-2017
Introduction to Big Data Analytics
Break through the Analytics Barrier
Interactive Analytics in Human Time
Chapter 2 - EMTE.pptx
Data Analytics and Big Data on IoT
Big data presentation for University of Reykjavik, Iceland, March 22
Semantic 'Radar' Steers Users to Insights in the Data Lake
The data we want
Organizing to Get Analytics Right
June 2014 HUG: Interactive analytics over hadoop
Martin Stabe, interactive producer, Financial Times
Harness the power of data
The Emerging Data Lake IT Strategy
Whats A Data Warehouse
Human Machine Collaboration for Improved Analytical Processes
20CS601 - Big data Analytics - types of data , definition of big data
Ad

More from Data Con LA (20)

PPTX
Data Con LA 2022 Keynotes
PPTX
Data Con LA 2022 Keynotes
PDF
Data Con LA 2022 Keynote
PPTX
Data Con LA 2022 - Startup Showcase
PPTX
Data Con LA 2022 Keynote
PDF
Data Con LA 2022 - Using Google trends data to build product recommendations
PPTX
Data Con LA 2022 - AI Ethics
PDF
Data Con LA 2022 - Improving disaster response with machine learning
PDF
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
PDF
Data Con LA 2022 - Real world consumer segmentation
PPTX
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
PPTX
Data Con LA 2022 - Moving Data at Scale to AWS
PDF
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
PDF
Data Con LA 2022 - Intro to Data Science
PDF
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
PPTX
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
PPTX
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
PPTX
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
PPTX
Data Con LA 2022 - Data Streaming with Kafka
PPTX
Data Con LA 2022 - Building Field-level Lineage from Scratch for Modern Data ...
Data Con LA 2022 Keynotes
Data Con LA 2022 Keynotes
Data Con LA 2022 Keynote
Data Con LA 2022 - Startup Showcase
Data Con LA 2022 Keynote
Data Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - AI Ethics
Data Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Intro to Data Science
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Building Field-level Lineage from Scratch for Modern Data ...
Ad

Recently uploaded (20)

PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PDF
annual-report-2024-2025 original latest.
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPT
Reliability_Chapter_ presentation 1221.5784
PDF
Introduction to the R Programming Language
PPTX
SAP 2 completion done . PRESENTATION.pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
Leprosy and NLEP programme community medicine
PDF
Introduction to Data Science and Data Analysis
PPT
Predictive modeling basics in data cleaning process
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
Managing Community Partner Relationships
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PDF
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
Miokarditis (Inflamasi pada Otot Jantung)
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
annual-report-2024-2025 original latest.
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Reliability_Chapter_ presentation 1221.5784
Introduction to the R Programming Language
SAP 2 completion done . PRESENTATION.pptx
IB Computer Science - Internal Assessment.pptx
Leprosy and NLEP programme community medicine
Introduction to Data Science and Data Analysis
Predictive modeling basics in data cleaning process
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Qualitative Qantitative and Mixed Methods.pptx
Clinical guidelines as a resource for EBP(1).pdf
Managing Community Partner Relationships
IBA_Chapter_11_Slides_Final_Accessible.pptx
Galatica Smart Energy Infrastructure Startup Pitch Deck
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...

Data Con LA 2022 - Collaborative Data Exploration using Conversational AI

  • 1. Collaborative Data Exploration using Conversational AI Anand Ranganathan Co-Founder & Chief AI Officer [email protected]
  • 2. How do people consume data & analytics today? – through charts & dashboards
  • 3. What’s wrong with dashboards Limited or no drill-downs Don’t know how the data for the dashboards is produced Can’t ask a slightly different question Representative of an opinion; easy to cherry-pick stats Can be misleading
  • 4. Analytics & BI tools have some ways to go In making organizations truly data-driven “Where can I find my data? I don’t know which database, table, query or tool to use” “The presentation is tomorrow and the BI team is busy. What should I do?” “Why are there so many dashboards?” 26.5% of organizations report being data driven in 2022, down from 37% in 2017.
  • 6. Allow any user to ask text or voice questions of their data and receive back a natural language + visual analysis of statistically relevant and actionable insights for that user. What is Conversational Analytics? *Note that in this talk, we focus on structured data stored in relational format (e.g. SQL databases, Excel sheets, etc)
  • 7. Qbo: Natural language conversations with data within collaboration platforms Hey QBO, how many policies are expiring in Oct 2022? Hey QBO, why were new acquisitions in Feb lower? Why have sales dropped in the North region this month? AI-Powered, Data Analyst
  • 8. Direct Connection to data Sales Management Marketing Natural Language Queries Operations HR BUSINESS USERS Qbo sits between users and disparate, siloed datasets Support 20+ data connectors, and access via a web interface or Microsoft Teams
  • 9. Unscrambl Qbo Demo : on Covid Data
  • 10. Demo
  • 11. A (very simplified) overview of NLU pipeline number of trips in winter 2017 by age and gender Entity Recognition & Construction SELECT anon_1."age group", anon_1.gender, count(*) AS "Count" FROM (SELECT "TRIP_ANALYSIS".end_station_id AS "end station id", "TRIP_ANALYSIS".program_id AS "program id", "TRIP_ANALYSIS".start_station_id AS "start station id", "TRIP_ANALYSIS".bikeid AS bikeid, CASE WHEN (:birth_year_1 - "TRIP_ANALYSIS".birth_year < :param_1) THEN :param_2 ELSE CASE WHEN (:birth_year_2 - "TRIP_ANALYSIS".birth_year < :param_3) THEN :param_4 ELSE …, "TRIP_ANALYSIS".gender AS gender FROM "TRIP_ANALYSIS" WHERE …. Identification of Query type and mapping to known concepts in DB Type: Aggregation Query on Trips table with a group-by and a filter; age -> derived from birth year attribute; gender -> gender attribute; in winter 2017 -> 2017-12-23 and 2018-03-19 (filter) Generate DB-specific SQL query Get results, decide on visualization and narratives, and present back to user
  • 12. Key Challenge : Bridging the gap between users and data
  • 13. ● Users don’t know what to ask ● Users don’t know how to ask ● Users may pose questions in an ambiguous manner ● Users may use terms not in the dataset Key Challenge : Bridging the gap between users and data
  • 14. ● Users don’t know what to ask ● Users don’t know how to ask ● Users may pose questions in an ambiguous manner ● Users may use terms not in the dataset Key Challenge : Bridging the gap between users and data ● Data may be modeled in a variety of ways ● Hidden semantics and assumptions behind different tables and columns ● Data may be incomplete, unclean ● Data may be spread across silos
  • 15. Common scenario we encounter in our deployments : Qbo doesn’t understand the user
  • 16. Now, let’s assume Lila is part of a group channel
  • 17. Collaborative data exploration to the rescue ● Many organizations and teams contain people with different levels of data skills ● In the past, the data-have-nots were dependent on the data-haves ● Conversational AI can help them collaborate. Allow the data-have-nots serve themselves sometimes, and the data-haves jump in when they can’t
  • 18. Direct Connection to data Natural Language Queries Collaborate with internal and external users to explore & share data, insights and reports With the right access control restrictions in place Internal Business Users 3rd Party Partners Collaborate with Partners in the context of “Teams” or “Channels”
  • 19. Interesting challenges around collaborative data access ● How should access control work? ○ Based on the user asking the question? ○ Based on the user in the group channel with the “least” privilege? ● How do we keep track of context around multiple parallel conversations in a group chat ● How do we keep track of versions and group edits in a collaborative dashboard? ○ Especially when the charts are interactive or can be refreshed