SlideShare a Scribd company logo
Welcome
Remember when it looked like this.
 They were all pretty much alike?
It used to be easy…
Now it’s quite
c0nfuZ1nG!
The return of big iron?
But it’s also quite
  exciting!
We Lose: Joe Hellerstein (Berkeley) 2001

 “Databases are commoditised and cornered
to slow-moving, evolving, structure intensive,
applications that require schema evolution.“ …
“The internet companies are lost and we will
remain in the doldrums of the enterprise
space.” …
“As databases are black boxes which require a
lot of coaxing to get maximum performance”
What Happened?
The Web
With a lot of users.
we   changed scale
tack
        ge   d
we ch an
New Approach to Data Access
Simple

Pragmatic

Solved an insoluble problem

Unencumbered by tradition (good & bad)
With this came a Different Focus
Tradition                              No SQL
•  Global consistency                  •  Local consistency
•  Schema driven                       •  Schemaless
•  Reliable Network                    •  Unreliable Network
•  Highly Structured                   •  Semi-structured/
                                          Unstructured


       NoSQL / Big Data technologies really focus on load
       and volume problems by avoiding the complexities
       associated with traditional transactional storage
The ‘Relational Camp’ had
      been busy too
 Realisation that the traditional
 architecture was insufficient for
    various modern workloads
End of an Era Paper - 2007
“Because RDBMSs can be beaten by more
than an order of magnitude on the standard
OLTP benchmark, then there is no market
where they are competitive. As such, they
should be considered as legacy technology
more than a quarter of a century in age, for
which a complete redesign and re-architecting
is the appropriate next step.” – Michael
Stonebraker
No Longer a One-Size-Fits-All
There is a new and impressive breed
•  Products ~ 5 years old
•  Shared nothing (sharded)
•  Designed for SSD’s & 10GE
•  Large address spaces (256GB+)
•  No indexes (column oriented)
•  Dropping traditional tenets (referential integrity
   etc)
•  Surprisingly quick for big queries when
   compared with incumbent technologies.
Both types of solution have
        clear value
..and it’s not really a question of size



TB
     0 1 10    100        1000                10,000




      The majority of us live in the overlap region
More a question of utility …
which this tends to lead to
   composite offerings
Compose Solutions
So what does this mean for the
         enterprise?
80% Enterprise Databases < 1TB




                           This
                           Reference
                           is getting
                           pretty old
                           now, sorry
                           (2009)
Yet we often have a lot of them
Communication is Store & Forward
The outside world
Sometimes we’re a bit more
       organized!
But most of our data is not that accessible

       Core
    Operational
                                  Exposed
       Data
…and sharing is often an afterthought

       Core
    Operational
                             Exposed
       Data
Services can help
But as data is getting bigger and
             heavier
..it can make it hard to join data
            together
So we often we turn to some form of
    Enterprise Data Warehouse
   (or maybe data virtualization)
Big data tech sometimes provides a
    composite solution (or ETL)
Ability to model data is much more of a
        gating factor than raw size

             Dave Campbell
    (Microsoft – VLDB Keynote 2012)
Importing data into a standard
model is a slow and painful processs
An alternative is to use a Late
       Bound Schema
Combining structured & unstructured approaches in a
        layered fashion makes the process more nimble



  Structured                                       Late Bound
Standardisation                                     Schema
     Layer


 Raw Data
We take this kind of approach
•  Grid of machines
•  Late bound schema
•  Sharded, immutable data
•  Low latency (real time) and high throughput
   (grid) use cases
•  All data is observable (an event)
•  Interfaces: Standardised (safe) or Raw
   (uayor)
Both Raw & Standardised data is available
                    Operational        Relational
                  (real time / MR)     Analytics



  Object/SQL
Standardisation


 Raw Data
This helps to loosen the grip of the
    single schema, whilst also
     providing a more iterative
   approach to standardisation
Support for both one standardised and many
  bespoke models in the same technology



                               Raw Facts from
                               different systems
           Standardised
              Model
Next step: to centralise common
        processing tasks



       Standardised      Risk
          Model       Calculation
Are we back to the mainframe?
Thanks



http://www.benstopford.com

More Related Content

PPTX
Big iron 2 (published)
PDF
Beyond The Data Grid: Coherence, Normalisation, Joins and Linear Scalability
PDF
Microservices for a Streaming World
PDF
JAX London Slides
PDF
Strata Software Architecture NY: The Data Dichotomy
PDF
The Power of the Log
PDF
Balancing Replication and Partitioning in a Distributed Java Database
PDF
Putting the Micro into Microservices with Stateful Stream Processing
Big iron 2 (published)
Beyond The Data Grid: Coherence, Normalisation, Joins and Linear Scalability
Microservices for a Streaming World
JAX London Slides
Strata Software Architecture NY: The Data Dichotomy
The Power of the Log
Balancing Replication and Partitioning in a Distributed Java Database
Putting the Micro into Microservices with Stateful Stream Processing

What's hot (20)

PDF
Streaming, Database & Distributed Systems Bridging the Divide
PDF
The Data Dichotomy- Rethinking the Way We Treat Data and Services
PPTX
Database Virtualization: The Next Wave of Big Data
PPTX
Enterprise Architect's view of Couchbase 4.0 with N1QL
PPTX
NoSQL and Couchbase
PDF
No sql3 rmoug
PPTX
MinneBar 2013 - Scaling with Cassandra
PPT
SQL/NoSQL How to choose ?
PPTX
Introduction to NuoDB
PPT
No sql
KEY
NoSQL Databases: Why, what and when
PPTX
Relational databases vs Non-relational databases
PPTX
PPT
RDBMS vs NoSQL
PDF
The Future of Distributed Databases
PPTX
Consistency in NoSQL
PPTX
Presentation on Databases in the Cloud
PPT
SQL Azure Dec Update
PPTX
Introduction to DataStax Enterprise Graph Database
PPTX
Intro to Big Data and NoSQL
Streaming, Database & Distributed Systems Bridging the Divide
The Data Dichotomy- Rethinking the Way We Treat Data and Services
Database Virtualization: The Next Wave of Big Data
Enterprise Architect's view of Couchbase 4.0 with N1QL
NoSQL and Couchbase
No sql3 rmoug
MinneBar 2013 - Scaling with Cassandra
SQL/NoSQL How to choose ?
Introduction to NuoDB
No sql
NoSQL Databases: Why, what and when
Relational databases vs Non-relational databases
RDBMS vs NoSQL
The Future of Distributed Databases
Consistency in NoSQL
Presentation on Databases in the Cloud
SQL Azure Dec Update
Introduction to DataStax Enterprise Graph Database
Intro to Big Data and NoSQL
Ad

Viewers also liked (20)

PDF
Big Data & the Enterprise
PDF
Data Pipelines with Apache Kafka
PDF
A little bit of clojure
PDF
Linux Performance Tools
PDF
Where Does Big Data Meet Big Database - QCon 2012
PDF
Ideas for Distributing Skills Across a Continental Divide
PDF
Test-Oriented Languages: Is it time for a new era?
PPTX
Refactoring tested code - has mocking gone wrong?
PDF
Coherence Implementation Patterns - Sig Nov 2011
PDF
Top Mistakes When Writing Reactive Applications - Scala by the Bay 2016
PPTX
MessageBus vs MessageBus
ODP
Time Manager Workshop at #QGIS2015 Conference in Nodebo
PDF
User Focused Security at Netflix: Stethoscope
PPT
forward and backward chaining
PDF
Taking the friction out of microservice frameworks with Lagom
PDF
Stay productive while slicing up the monolith
PDF
Modernizing Applications with Microservices
PPTX
2016 Tableau in the Cloud - A Netflix Original (AWS Re:invent)
PDF
The state of sbt 0.13, sbt server, and sbt 1.0 (ScalaMatsuri ver)
PDF
Hyperscale Computing, Enterprise Agility with Mesosphere
Big Data & the Enterprise
Data Pipelines with Apache Kafka
A little bit of clojure
Linux Performance Tools
Where Does Big Data Meet Big Database - QCon 2012
Ideas for Distributing Skills Across a Continental Divide
Test-Oriented Languages: Is it time for a new era?
Refactoring tested code - has mocking gone wrong?
Coherence Implementation Patterns - Sig Nov 2011
Top Mistakes When Writing Reactive Applications - Scala by the Bay 2016
MessageBus vs MessageBus
Time Manager Workshop at #QGIS2015 Conference in Nodebo
User Focused Security at Netflix: Stethoscope
forward and backward chaining
Taking the friction out of microservice frameworks with Lagom
Stay productive while slicing up the monolith
Modernizing Applications with Microservices
2016 Tableau in the Cloud - A Netflix Original (AWS Re:invent)
The state of sbt 0.13, sbt server, and sbt 1.0 (ScalaMatsuri ver)
Hyperscale Computing, Enterprise Agility with Mesosphere
Ad

Similar to The return of big iron? (20)

PDF
Database Revolution - Exploratory Webcast
PDF
Database revolution opening webcast 01 18-12
PPT
NO SQL: What, Why, How
PPTX
Big Data Platforms: An Overview
PPTX
Big Data 2107 for Ribbon
PDF
One Size Doesn't Fit All: The New Database Revolution
PPTX
Big Data NoSQL 1017
PPTX
PPTX
Building an Effective Data Warehouse Architecture
KEY
Lessons from lhc
PDF
A beginners guide to Cloudera Hadoop
PPTX
Designing for the Cloud Tutorial - QCon SF 2009
PDF
Debunking "Purpose-Built Data Systems:": Enter the Universal Database
PPTX
Domain oriented development
PDF
Relational Database Stockholm Syndrome (Neal Murray, 6 Point 6) London 2019 C...
ODP
Data massage: How databases have been scaled from one to one million nodes
PPTX
SQL and NoSQL in SQL Server
PPTX
مقدمة عن NoSQL بالعربي
PDF
Simplifying Cloud Architectures with Data Virtualization
PPTX
JasperWorld 2012: Reinventing Data Management by Max Schireson
Database Revolution - Exploratory Webcast
Database revolution opening webcast 01 18-12
NO SQL: What, Why, How
Big Data Platforms: An Overview
Big Data 2107 for Ribbon
One Size Doesn't Fit All: The New Database Revolution
Big Data NoSQL 1017
Building an Effective Data Warehouse Architecture
Lessons from lhc
A beginners guide to Cloudera Hadoop
Designing for the Cloud Tutorial - QCon SF 2009
Debunking "Purpose-Built Data Systems:": Enter the Universal Database
Domain oriented development
Relational Database Stockholm Syndrome (Neal Murray, 6 Point 6) London 2019 C...
Data massage: How databases have been scaled from one to one million nodes
SQL and NoSQL in SQL Server
مقدمة عن NoSQL بالعربي
Simplifying Cloud Architectures with Data Virtualization
JasperWorld 2012: Reinventing Data Management by Max Schireson

More from Ben Stopford (16)

PPTX
10 Principals for Effective Event-Driven Microservices with Apache Kafka
PPTX
10 Principals for Effective Event Driven Microservices
PDF
The Future of Streaming: Global Apps, Event Stores and Serverless
PDF
A Global Source of Truth for the Microservices Generation
PDF
Building Event Driven Services with Kafka Streams
PDF
NDC London 2017 - The Data Dichotomy- Rethinking Data and Services with Streams
PDF
Building Event Driven Services with Apache Kafka and Kafka Streams - Devoxx B...
PDF
Building Event Driven Services with Stateful Streams
PDF
Devoxx London 2017 - Rethinking Services With Stateful Streams
PDF
Event Driven Services Part 2: Building Event-Driven Services with Apache Kafka
PDF
Event Driven Services Part 1: The Data Dichotomy
PDF
Event Driven Services Part 3: Putting the Micro into Microservices with State...
PPTX
Advanced databases ben stopford
PDF
A Paradigm Shift: The Increasing Dominance of Memory-Oriented Solutions for H...
PDF
Data Grids with Oracle Coherence
PPT
Architecting for Change: An Agile Approach
10 Principals for Effective Event-Driven Microservices with Apache Kafka
10 Principals for Effective Event Driven Microservices
The Future of Streaming: Global Apps, Event Stores and Serverless
A Global Source of Truth for the Microservices Generation
Building Event Driven Services with Kafka Streams
NDC London 2017 - The Data Dichotomy- Rethinking Data and Services with Streams
Building Event Driven Services with Apache Kafka and Kafka Streams - Devoxx B...
Building Event Driven Services with Stateful Streams
Devoxx London 2017 - Rethinking Services With Stateful Streams
Event Driven Services Part 2: Building Event-Driven Services with Apache Kafka
Event Driven Services Part 1: The Data Dichotomy
Event Driven Services Part 3: Putting the Micro into Microservices with State...
Advanced databases ben stopford
A Paradigm Shift: The Increasing Dominance of Memory-Oriented Solutions for H...
Data Grids with Oracle Coherence
Architecting for Change: An Agile Approach

The return of big iron?

  • 2. Remember when it looked like this. They were all pretty much alike?
  • 3. It used to be easy…
  • 6. But it’s also quite exciting!
  • 7. We Lose: Joe Hellerstein (Berkeley) 2001 “Databases are commoditised and cornered to slow-moving, evolving, structure intensive, applications that require schema evolution.“ … “The internet companies are lost and we will remain in the doldrums of the enterprise space.” … “As databases are black boxes which require a lot of coaxing to get maximum performance”
  • 10. With a lot of users.
  • 11. we changed scale
  • 12. tack ge d we ch an
  • 13. New Approach to Data Access Simple Pragmatic Solved an insoluble problem Unencumbered by tradition (good & bad)
  • 14. With this came a Different Focus Tradition No SQL •  Global consistency •  Local consistency •  Schema driven •  Schemaless •  Reliable Network •  Unreliable Network •  Highly Structured •  Semi-structured/ Unstructured NoSQL / Big Data technologies really focus on load and volume problems by avoiding the complexities associated with traditional transactional storage
  • 15. The ‘Relational Camp’ had been busy too Realisation that the traditional architecture was insufficient for various modern workloads
  • 16. End of an Era Paper - 2007 “Because RDBMSs can be beaten by more than an order of magnitude on the standard OLTP benchmark, then there is no market where they are competitive. As such, they should be considered as legacy technology more than a quarter of a century in age, for which a complete redesign and re-architecting is the appropriate next step.” – Michael Stonebraker
  • 17. No Longer a One-Size-Fits-All
  • 18. There is a new and impressive breed •  Products ~ 5 years old •  Shared nothing (sharded) •  Designed for SSD’s & 10GE •  Large address spaces (256GB+) •  No indexes (column oriented) •  Dropping traditional tenets (referential integrity etc) •  Surprisingly quick for big queries when compared with incumbent technologies.
  • 19. Both types of solution have clear value
  • 20. ..and it’s not really a question of size TB 0 1 10 100 1000 10,000 The majority of us live in the overlap region
  • 21. More a question of utility … which this tends to lead to composite offerings
  • 23. So what does this mean for the enterprise?
  • 24. 80% Enterprise Databases < 1TB This Reference is getting pretty old now, sorry (2009)
  • 25. Yet we often have a lot of them
  • 26. Communication is Store & Forward The outside world
  • 27. Sometimes we’re a bit more organized!
  • 28. But most of our data is not that accessible Core Operational Exposed Data
  • 29. …and sharing is often an afterthought Core Operational Exposed Data
  • 31. But as data is getting bigger and heavier
  • 32. ..it can make it hard to join data together
  • 33. So we often we turn to some form of Enterprise Data Warehouse (or maybe data virtualization)
  • 34. Big data tech sometimes provides a composite solution (or ETL)
  • 35. Ability to model data is much more of a gating factor than raw size Dave Campbell (Microsoft – VLDB Keynote 2012)
  • 36. Importing data into a standard model is a slow and painful processs
  • 37. An alternative is to use a Late Bound Schema
  • 38. Combining structured & unstructured approaches in a layered fashion makes the process more nimble Structured Late Bound Standardisation Schema Layer Raw Data
  • 39. We take this kind of approach •  Grid of machines •  Late bound schema •  Sharded, immutable data •  Low latency (real time) and high throughput (grid) use cases •  All data is observable (an event) •  Interfaces: Standardised (safe) or Raw (uayor)
  • 40. Both Raw & Standardised data is available Operational Relational (real time / MR) Analytics Object/SQL Standardisation Raw Data
  • 41. This helps to loosen the grip of the single schema, whilst also providing a more iterative approach to standardisation
  • 42. Support for both one standardised and many bespoke models in the same technology Raw Facts from different systems Standardised Model
  • 43. Next step: to centralise common processing tasks Standardised Risk Model Calculation
  • 44. Are we back to the mainframe?