SlideShare a Scribd company logo
Self-Service Analytics – For Enterprise
Audience
• Sreejith Madhavan
– msreejith@yahoo.com
– https://www.linkedin.com/in/msreejith
Enterprise Analytics Portfolio – Lay of
The Land
Data Analytics – Basic Concepts
• Business Intelligence
o Using the available data to make factual business decisions
o “WHAT” is happening to your business right now?
• Business Analytics
o Steps that lead up to business decision
o Data Mining - process of looking for trends, patterns, or other useful
information within dataset
o Diagnostic analytics - “WHY” something is happening right now
o Predictive analytics - “WHAT Will” happen in future
o Prescriptive analytics - “WHAT Should be Done next”
Enterprise Analytics Landscape
• Enterprises typically have Users categorized broadly as -
o Business users – most interested in current metrics, fiscal trends, dashboards
o Engineering users – most interested in diagnostics (find needle-in-haystack),
deep-analytics
o An enterprise analytics solution stack should cover self-service needs to above
broad user-base
• Existing Data-stores Have Varying Use-cases
o Representing specialized data (application specific)
o Organizational units having independent solutions (IT, Engineering, Support etc..)
o Data architecture demands (BI tool backend, Datamarts, OLTP/OLAP etc)
• Enter Hadoop Datalake…
o Answering “Why” you need Hadoop Datalake in your Analytics landscape is critical
o What short, long term goals need to be met
o Not meant to be a one-stop-shop solution to replace existing Databases and
workflows
o Enterprise has several types of Users (by broad skill level) – A self service solution
stack should cater to broad User base by having mix-of several tools
Understanding Existing Data-Stores
Structured
data of Pre-
Computed
measures
Analytical
Cubes
Currently
SQL Server
Business
Analytics
system
Structured
data as Star
schema
with Dims
and Facts
Datamart
Currently
Oracle
Decision
Support
system/
Datamart
Structured,
Semi-
structured
data per
Event
granularity
Hive, M/R,
Datameer
Big Data
system
(Datalake)
Original
data
persisted in
its incoming
form
HDFS(M/R),
NFS
(Scripts),
REST
Raw Data
Highly granular and
complete dataset
Lower granularity and
subset of source data
Good for standard
Biz Metrics of
current and fiscal
trend
Good for interactive
Adhoc reporting
Good for diagnostic
mining and general
Adhoc reports at
scale
Useful to do ELT to
feed into other data
sources
Access
Interface/Tool
Data
Characteristic
Advanced Users (Data
Engineers/Scientists)
Enhance and persist
data-model, Develop
Deep insights
workflows
Frameworks, APIs
Map-reduce, Hive, Pig,
Spark, R, Programmatic
(JDBC..)
Technical Analysts
Generate Adhoc and
canned reports
SQL and
Transformation-
workflow based Tools
Oracle, SQL-Server,
Hive, R, Vertica,
Teradata, Datameer,
Tableau, PDI
Exec-users (Non-
Technical)
Consume predefined
metrics, Dashboards,
drag-n-drop what-if
analysis
Visual, Natural
language based tools
Tableau, OBIEE, PBA,
Excel, Microstrategy,
Search UI
End User Categories and Expectations
Usage
Characteristics
Interface
Characteristics
Sample Tools In each
Vertical
User and Use-case Requirement Considerations
• Demarcate target Users – Provision right Tool to right Users/Use-cases
– Not all users can should be given a Hadoop Datalake interface in self-service model
– Not one tool can fit all Use-cases
• Get to a Consolidated view of existing Data Sources to cover most
common domain objects to target “BI” based self-service model
• Data architecture - Data-layout and Data-model for the above
“Consolidated view”
– Star-schema vs Analytic Cube vs Flat OLTP schema
– MPP Analytic Database vs OLAP Cube vs DSS
– Traversing and Finding Metadata - Search interface to find entities, attributes and data
– Documentation covering data-model and data-dictionary
• Performance considerations
– High Performance and Concurrency support backend for interfacing BI Tools
– Scalable environment for batch, mining use-cases
– Interactive programmatic platform for data engineering
• Miscellaneous Operational Considerations (slide7)
Holistic View For Building E2E Analytics
Platform
Objectives For Holistic Analytics Platform
• Establish a self-service Analytics platform to cover BI and
Analytics use-cases for Internal users
• Support 3Vs of User types and Access patterns
o Volume of data
o Variety of Users (Programmatic and Non-technical)
o Variety of Queries (Adhoc, Not pre-defined)
o Velocity (Interactive query response, Dashboarding)
• Design Principles
o Embrace ideology of “one-tool doesn’t fit all use-cases and user preferences”
o Ease of Use (Front-end interface and Backend Data-model)
o Improved Performance to query response times
Datalake Analytics Platform – Conceptual View
MPP/Analytic
Database
PUAT Datamart Hive HDFS
BI Tool Front-End
Spark
Hue UI
(Hive, Search)
DataStore
Layer
Processing
Engine
Layer
Viz.and
Data
Access
Layer
• Focus on Data Processing & Integration frameworks
• Adhoc Data mining, complex data transformations, Machine learning
• 25-50 Concurrent users
• Focus on Visualization & Metrics (not Data Processing)
• Support Adhoc and Canned Self-service Reports
• 100+ Concurrent users
Extended
Datamodel
Cloudera Search
Spark CLI,
Hive Jdbc
(Programmatic
Access)
Datameer
(Non-
Programmatic)
Engineering focused Self-serve Reporting (Analysts &
Data engineers, Data scientists)
Business focused Self-serve Reporting (Analysts, Execs,
non-technical Audience)
Search
Front-End
Datalake Analytics Platform – Technology View
HDFS
(Orig Source)
Spark Data Prep
FW
M/R Daily HDFS
Transforms
HDFS
(Transformed)
Hive/Impala
Time based
SeqFile
Layout
System based
PARQUET
Layout
Adhoc Query
Hue UI/ Edge
Node CLI
Vertica MPP
Analytic DB
(12 month window)
On-demand
Parsed content
Datam
art
Structured
Config Feed
Cloudera Search
Indexing Prep FW
SSAS
Latest System
Snapshot raw
Latest Week Raw
& Structured
Data-
Prep/Transform
(SnapLogic/Data
meer)
Cloudera
Search Hue
UI
Tableau/Penta
ho BA
Spark
CLI/MLLib
Data-Prep/Filter
& Import
(SnapLogic)
DistributedR
Flattened
Star-schema
ZoomData
Raw
Data
Export
Published
Extended schema
Text search & Search AnalyticsSelf-serve BI
Reporting
Statistical Analytics Adhoc SQL Queries On-demand Data Transformations
Other
Sources…
Existing Components
Processing Workflows
New ComponentsOther
Legend
Evolving Other Operational Requirements
Agility and Productivity for End users
Monitoring and Governance
- Monitor & recover user, system jobs/service failures
- Analytics on Analytics – user and system behaviour
- Data quality, security etc
Ease of access to Data
- Abstracting data complexities, Provisioning prep’ed data to cover standard use-cases
- Query response times, Data mobility(transfer) issues
Understanding the Dataset
- Documentation, Catalog, Data Dictionary, Data Exploration
External References
• https://www.vertica.com/2014/04/18/facebook-and-vertica-a-case-for-mpp-databases/
• https://practicalanalytics.wordpress.com/2015/06/11/databianalytics-evolution-netflix/
• http://www.thebigdatainsightgroup.com/site/sites/default/files/Teradata's%20-
%20Big%20Data%20Architecture%20-%20Putting%20all%20your%20eggs%20in%20one%20basket.pdf
• http://www.slideshare.net/Dataconomy/hp-vertica-dataconomy
• http://www.bryanbrandow.com/2014/05/microstrategy-vs-tableau.html
• http://www.experfy.com/blog/pentaho-vs-tableau-comparison-visualization-dashboards/

More Related Content

PDF
Who Should Own Data Governance – IT or Business?
PDF
Data at the Speed of Business with Data Mastering and Governance
PDF
You Need a Data Catalog. Do You Know Why?
PPTX
Data Governance
PDF
RWDG Slides: What is a Data Steward to do?
PDF
The Marriage of the Data Lake and the Data Warehouse and Why You Need Both
PDF
Building a Data Strategy – Practical Steps for Aligning with Business Goals
PPTX
SPSNYC2019 - What is Common Data Model and how to use it?
Who Should Own Data Governance – IT or Business?
Data at the Speed of Business with Data Mastering and Governance
You Need a Data Catalog. Do You Know Why?
Data Governance
RWDG Slides: What is a Data Steward to do?
The Marriage of the Data Lake and the Data Warehouse and Why You Need Both
Building a Data Strategy – Practical Steps for Aligning with Business Goals
SPSNYC2019 - What is Common Data Model and how to use it?

What's hot (20)

PDF
Data Modeling Techniques
PPTX
Building a Data Analytics Center of Excellence - Digital Transformation
PDF
Creating a Data-Driven Organization, Crunchconf, October 2015
PDF
Data Architecture Strategies: Data Architecture for Digital Transformation
PPTX
Data Vault Vs Data Lake
PPTX
Zero to Snowflake Presentation
PDF
Modern Data architecture Design
PDF
Data Lake: A simple introduction
PDF
Data Analytics Strategy
PPTX
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
PDF
Straight Talk to Demystify Data Lineage
PDF
Data Architecture Strategies: The Rise of the Graph Database
PDF
Data Catalog as the Platform for Data Intelligence
PDF
Enabling a Data Mesh Architecture with Data Virtualization
PDF
Data Catalog for Better Data Discovery and Governance
PDF
Data Modelling is NOT just for RDBMS's
PDF
8 Steps to Creating a Data Strategy
PPTX
Snowflake Datawarehouse Architecturing
PPTX
Differentiate Big Data vs Data Warehouse use cases for a cloud solution
PDF
Time to Talk about Data Mesh
Data Modeling Techniques
Building a Data Analytics Center of Excellence - Digital Transformation
Creating a Data-Driven Organization, Crunchconf, October 2015
Data Architecture Strategies: Data Architecture for Digital Transformation
Data Vault Vs Data Lake
Zero to Snowflake Presentation
Modern Data architecture Design
Data Lake: A simple introduction
Data Analytics Strategy
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
Straight Talk to Demystify Data Lineage
Data Architecture Strategies: The Rise of the Graph Database
Data Catalog as the Platform for Data Intelligence
Enabling a Data Mesh Architecture with Data Virtualization
Data Catalog for Better Data Discovery and Governance
Data Modelling is NOT just for RDBMS's
8 Steps to Creating a Data Strategy
Snowflake Datawarehouse Architecturing
Differentiate Big Data vs Data Warehouse use cases for a cloud solution
Time to Talk about Data Mesh
Ad

Viewers also liked (13)

PDF
The Power of Self Service Reporting
DOC
Obiee metadata dictionary
PDF
Extending the Self-Service Capabilities of SAP BI with SAP BusinessObjects Ex...
PPTX
Agile collaborative practices
PPTX
Trivial works.com introduction
PPT
Agile Development For Rte Systems
PDF
Collaborative and agile development of mobile applications
PPTX
The Business Benefits of a Data-Driven, Self-Service BI Organization
PDF
Realtime Reporting using Spark Streaming
PDF
The Complete Guide to Embedded Analytics
PPT
Agile presentation
PPTX
Tableau Server Basics
PPTX
Overview of Agile Methodology
The Power of Self Service Reporting
Obiee metadata dictionary
Extending the Self-Service Capabilities of SAP BI with SAP BusinessObjects Ex...
Agile collaborative practices
Trivial works.com introduction
Agile Development For Rte Systems
Collaborative and agile development of mobile applications
The Business Benefits of a Data-Driven, Self-Service BI Organization
Realtime Reporting using Spark Streaming
The Complete Guide to Embedded Analytics
Agile presentation
Tableau Server Basics
Overview of Agile Methodology
Ad

Similar to Self Service Reporting & Analytics For an Enterprise (20)

PPTX
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
PDF
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
PPTX
Tableau and hadoop
PDF
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
PPTX
Big Data SE vs. SE for Big Data
PDF
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
PDF
18Mar14 Find the Hidden Signal in Market Data Noise Webinar
PDF
In-Memory Analytics - SAP Big Data - Analytics Tools Selection - SAP HANA & ...
PDF
Teradata - Presentation at Hortonworks Booth - Strata 2014
PPTX
Big data unit 2
PPTX
AzureDay - Introduction Big Data Analytics.
PPTX
Skillwise Big Data part 2
PDF
SFScon19 - Grazia Cazzin - KNOWAGE the open source answer to the new needs in...
PDF
Architecting Agile Data Applications for Scale
PPTX
No sql and sql - open analytics summit
PPTX
Introduction To Big Data & Hadoop
PPTX
Skilwise Big data
PDF
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
PPT
Kushal Data Warehousing PPT
PDF
Hadoop meets Agile! - An Agile Big Data Model
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
Tableau and hadoop
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data SE vs. SE for Big Data
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
18Mar14 Find the Hidden Signal in Market Data Noise Webinar
In-Memory Analytics - SAP Big Data - Analytics Tools Selection - SAP HANA & ...
Teradata - Presentation at Hortonworks Booth - Strata 2014
Big data unit 2
AzureDay - Introduction Big Data Analytics.
Skillwise Big Data part 2
SFScon19 - Grazia Cazzin - KNOWAGE the open source answer to the new needs in...
Architecting Agile Data Applications for Scale
No sql and sql - open analytics summit
Introduction To Big Data & Hadoop
Skilwise Big data
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
Kushal Data Warehousing PPT
Hadoop meets Agile! - An Agile Big Data Model

Recently uploaded (20)

PPTX
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
PPTX
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PDF
Oracle OFSAA_ The Complete Guide to Transforming Financial Risk Management an...
PPTX
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
PPT
DATA COLLECTION METHODS-ppt for nursing research
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PDF
Introduction to the R Programming Language
PPTX
Modelling in Business Intelligence , information system
PDF
How to run a consulting project- client discovery
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PDF
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
PPT
Predictive modeling basics in data cleaning process
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
Lecture1 pattern recognition............
PPTX
Managing Community Partner Relationships
PDF
annual-report-2024-2025 original latest.
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Oracle OFSAA_ The Complete Guide to Transforming Financial Risk Management an...
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
DATA COLLECTION METHODS-ppt for nursing research
Introduction-to-Cloud-ComputingFinal.pptx
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
Introduction to the R Programming Language
Modelling in Business Intelligence , information system
How to run a consulting project- client discovery
Galatica Smart Energy Infrastructure Startup Pitch Deck
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
Predictive modeling basics in data cleaning process
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Lecture1 pattern recognition............
Managing Community Partner Relationships
annual-report-2024-2025 original latest.

Self Service Reporting & Analytics For an Enterprise

  • 1. Self-Service Analytics – For Enterprise Audience • Sreejith Madhavan – [email protected] – https://www.linkedin.com/in/msreejith
  • 2. Enterprise Analytics Portfolio – Lay of The Land
  • 3. Data Analytics – Basic Concepts • Business Intelligence o Using the available data to make factual business decisions o “WHAT” is happening to your business right now? • Business Analytics o Steps that lead up to business decision o Data Mining - process of looking for trends, patterns, or other useful information within dataset o Diagnostic analytics - “WHY” something is happening right now o Predictive analytics - “WHAT Will” happen in future o Prescriptive analytics - “WHAT Should be Done next”
  • 4. Enterprise Analytics Landscape • Enterprises typically have Users categorized broadly as - o Business users – most interested in current metrics, fiscal trends, dashboards o Engineering users – most interested in diagnostics (find needle-in-haystack), deep-analytics o An enterprise analytics solution stack should cover self-service needs to above broad user-base • Existing Data-stores Have Varying Use-cases o Representing specialized data (application specific) o Organizational units having independent solutions (IT, Engineering, Support etc..) o Data architecture demands (BI tool backend, Datamarts, OLTP/OLAP etc) • Enter Hadoop Datalake… o Answering “Why” you need Hadoop Datalake in your Analytics landscape is critical o What short, long term goals need to be met o Not meant to be a one-stop-shop solution to replace existing Databases and workflows o Enterprise has several types of Users (by broad skill level) – A self service solution stack should cater to broad User base by having mix-of several tools
  • 5. Understanding Existing Data-Stores Structured data of Pre- Computed measures Analytical Cubes Currently SQL Server Business Analytics system Structured data as Star schema with Dims and Facts Datamart Currently Oracle Decision Support system/ Datamart Structured, Semi- structured data per Event granularity Hive, M/R, Datameer Big Data system (Datalake) Original data persisted in its incoming form HDFS(M/R), NFS (Scripts), REST Raw Data Highly granular and complete dataset Lower granularity and subset of source data Good for standard Biz Metrics of current and fiscal trend Good for interactive Adhoc reporting Good for diagnostic mining and general Adhoc reports at scale Useful to do ELT to feed into other data sources Access Interface/Tool Data Characteristic
  • 6. Advanced Users (Data Engineers/Scientists) Enhance and persist data-model, Develop Deep insights workflows Frameworks, APIs Map-reduce, Hive, Pig, Spark, R, Programmatic (JDBC..) Technical Analysts Generate Adhoc and canned reports SQL and Transformation- workflow based Tools Oracle, SQL-Server, Hive, R, Vertica, Teradata, Datameer, Tableau, PDI Exec-users (Non- Technical) Consume predefined metrics, Dashboards, drag-n-drop what-if analysis Visual, Natural language based tools Tableau, OBIEE, PBA, Excel, Microstrategy, Search UI End User Categories and Expectations Usage Characteristics Interface Characteristics Sample Tools In each Vertical
  • 7. User and Use-case Requirement Considerations • Demarcate target Users – Provision right Tool to right Users/Use-cases – Not all users can should be given a Hadoop Datalake interface in self-service model – Not one tool can fit all Use-cases • Get to a Consolidated view of existing Data Sources to cover most common domain objects to target “BI” based self-service model • Data architecture - Data-layout and Data-model for the above “Consolidated view” – Star-schema vs Analytic Cube vs Flat OLTP schema – MPP Analytic Database vs OLAP Cube vs DSS – Traversing and Finding Metadata - Search interface to find entities, attributes and data – Documentation covering data-model and data-dictionary • Performance considerations – High Performance and Concurrency support backend for interfacing BI Tools – Scalable environment for batch, mining use-cases – Interactive programmatic platform for data engineering • Miscellaneous Operational Considerations (slide7)
  • 8. Holistic View For Building E2E Analytics Platform
  • 9. Objectives For Holistic Analytics Platform • Establish a self-service Analytics platform to cover BI and Analytics use-cases for Internal users • Support 3Vs of User types and Access patterns o Volume of data o Variety of Users (Programmatic and Non-technical) o Variety of Queries (Adhoc, Not pre-defined) o Velocity (Interactive query response, Dashboarding) • Design Principles o Embrace ideology of “one-tool doesn’t fit all use-cases and user preferences” o Ease of Use (Front-end interface and Backend Data-model) o Improved Performance to query response times
  • 10. Datalake Analytics Platform – Conceptual View MPP/Analytic Database PUAT Datamart Hive HDFS BI Tool Front-End Spark Hue UI (Hive, Search) DataStore Layer Processing Engine Layer Viz.and Data Access Layer • Focus on Data Processing & Integration frameworks • Adhoc Data mining, complex data transformations, Machine learning • 25-50 Concurrent users • Focus on Visualization & Metrics (not Data Processing) • Support Adhoc and Canned Self-service Reports • 100+ Concurrent users Extended Datamodel Cloudera Search Spark CLI, Hive Jdbc (Programmatic Access) Datameer (Non- Programmatic) Engineering focused Self-serve Reporting (Analysts & Data engineers, Data scientists) Business focused Self-serve Reporting (Analysts, Execs, non-technical Audience) Search Front-End
  • 11. Datalake Analytics Platform – Technology View HDFS (Orig Source) Spark Data Prep FW M/R Daily HDFS Transforms HDFS (Transformed) Hive/Impala Time based SeqFile Layout System based PARQUET Layout Adhoc Query Hue UI/ Edge Node CLI Vertica MPP Analytic DB (12 month window) On-demand Parsed content Datam art Structured Config Feed Cloudera Search Indexing Prep FW SSAS Latest System Snapshot raw Latest Week Raw & Structured Data- Prep/Transform (SnapLogic/Data meer) Cloudera Search Hue UI Tableau/Penta ho BA Spark CLI/MLLib Data-Prep/Filter & Import (SnapLogic) DistributedR Flattened Star-schema ZoomData Raw Data Export Published Extended schema Text search & Search AnalyticsSelf-serve BI Reporting Statistical Analytics Adhoc SQL Queries On-demand Data Transformations Other Sources… Existing Components Processing Workflows New ComponentsOther Legend
  • 12. Evolving Other Operational Requirements Agility and Productivity for End users Monitoring and Governance - Monitor & recover user, system jobs/service failures - Analytics on Analytics – user and system behaviour - Data quality, security etc Ease of access to Data - Abstracting data complexities, Provisioning prep’ed data to cover standard use-cases - Query response times, Data mobility(transfer) issues Understanding the Dataset - Documentation, Catalog, Data Dictionary, Data Exploration
  • 13. External References • https://www.vertica.com/2014/04/18/facebook-and-vertica-a-case-for-mpp-databases/ • https://practicalanalytics.wordpress.com/2015/06/11/databianalytics-evolution-netflix/ • http://www.thebigdatainsightgroup.com/site/sites/default/files/Teradata's%20- %20Big%20Data%20Architecture%20-%20Putting%20all%20your%20eggs%20in%20one%20basket.pdf • http://www.slideshare.net/Dataconomy/hp-vertica-dataconomy • http://www.bryanbrandow.com/2014/05/microstrategy-vs-tableau.html • http://www.experfy.com/blog/pentaho-vs-tableau-comparison-visualization-dashboards/

Editor's Notes

  • #5: Business users (typically from Sales, Product management, Other execs) Engineering users (Developers, QA, Technical support engineers, Analysts, Data scientists)
  • #10: User Types: - Semi/non- technical users – easy to use drag-n-drop interface - advanced users - Programmatic and SQL based interfaces Improved Performance considerations - High Performance and Concurrent platform for user interactions via BI Tools - Scalable environment for batch, mining use-cases - nteractive programmatic platform for data engineering
  • #11: Business users workflows: - Self-service - Answer “What” questions - Analytic Database – consolidate data model supporting quick Vizn, Performance and lower learning curve Engineering users workflows: - Self-service – Answer “Why” and “What next” questions
  • #12: CLI – Command-line Interface MLLib – Machine learning Lib Data Prep FW – Data Preparation framework MPP – Massive Parallel Processing BI – Business Intelligence