SlideShare a Scribd company logo
Hands-on Introduction
& Hackathon Kickoff
Ashvin Agrawal William Markito
@william_markito@aasoj
Powered by
Pivotal Open Source Hub (POSH)
(incubating)
• Hackathon Details
• Apache Geode Introduction
• History
• Key features and components
• Roadmap
• Hands-on lab
• Build & run
• Starting a cluster
• Using docker for clustering
• Your first app
• Q&A
2
Agenda
Hackathon details
Powered by
Pivotal Open Source Hub (POSH)
http://ambitious-apps.challengepost.com/
4
Introduction
A distributed, memory-based data management platform for data
oriented apps that need:
• high performance, scalability, resiliency and continuous
availability
• fast access to critical data set
• location aware distributed data processing
• event driven data architecture
5
Introduction
6
One size fits all ?
Cost of sorting is nlog(n)
7
One size fits all ?
Cost of sorting is nlog(n)
• Data quality and quantity differences
• Eventual consistency
• Response time expectation
• Scalability challenges: disk, memory, network and
external systems
• 1000+ systems in production (real customers)
• Cutting edge use cases
8
Incubating… but rock solid
• 17 billion records in memory
• GE Power & Water's Remote Monitoring & Diagnostics Center
• 3 TB operational data in-memory, 400 TB archived
• China Railways
• 4.6 Million transactions a day / 40K transactions a second
• China Railways
9
Incubating… but rock solid
• Performance optimized persistence
• Configurable consistency
• Elastic capacity
• Latency minimizing distribution
• Heterogenous deployment
Designed for High Performance
10
+/-
L2 ~10 ns, memory ~100 ns, network <1ms, disk ~10ms
• Cache
• Region
• Member
• Client Cache
• Functions
• Listeners
11
Concepts
• Cache
• In-memory storage and management
for your data
• Configurable through XML, Spring,
Java API or CLI
• Collection of Region
12
Concepts
• Region
• Distributed java.util.Map on steroids
(Key/Value)
• Consistent API regardless of where or
how data is stored
• Observable (reactive)
• Highly available, redundant on cache
Member (s).
13
Concepts
• Region
• Local, Replicated or Partitioned
• In-memory or persistent
• Redundant
• LRU
• Overflow
14
Concepts
LOCAL
LOCAL_HEAP_LRU
LOCAL_OVERFLOW
LOCAL_PERSISTENT
LOCAL_PERSISTENT_OVERFLOW
PARTITION
PARTITION_HEAP_LRU
PARTITION_OVERFLOW
PARTITION_PERSISTENT
PARTITION_PERSISTENT_OVERFLOW
PARTITION_PROXY
PARTITION_PROXY_REDUNDANT
PARTITION_REDUNDANT
PARTITION_REDUNDANT_HEAP_LRU
PARTITION_REDUNDANT_OVERFLOW
PARTITION_REDUNDANT_PERSISTENT
PARTITION_REDUNDANT_PERSISTENT_OVERFLOW
REPLICATE
REPLICATE_HEAP_LRU
REPLICATE_OVERFLOW
REPLICATE_PERSISTENT
REPLICATE_PERSISTENT_OVERFLOW
REPLICATE_PROXY
• Persistent Regions
• Durability
• WAL for efficient writing
• Consistent recovery
• Compaction
15
Concepts
Server 1 Server N
• Member
• A process that has a connection to the
system
• A process that has created a cache
• Embeddable within your application
16
Concepts
Client
Locator
Server
• Client cache
• A process connected to the Geode
server(s)
• Can have a local copy of the data
• Can be notified about events on the
servers
17
Concepts
• Functions
• Used for distributed concurrent processing
(Map/Reduce, stored procedure)
• Highly available
• Data oriented
• Member oriented
18
Concepts
19
Concepts
• Functions
• Listeners
• CacheWriter / CacheListener
• AsyncEventListener (queue / batch)
• Parallel or Serial
• Conflation
20
Concepts
Hands on
• Clone & Build
22
Hands-on: Build & run
git clone https://github.com/apache/incubator-geode
cd incubator-geode
./gradlew build -Dskip.tests=true
• Start a server
cd gemfire-assembly/build/install/apache-geode
./bin/gfsh
gfsh> start locator --name=locator
gfsh> start server --name=server
gfsh> create region --name=myRegion --type=REPLICATE
23
Hands-on: Docker
&
• Containers
• FreeBSD Jails (2000)
• Solaris Zones (2004)
• Docker (2013)
• Operating system level virtualization
• Isolated user space instances
24
* https://linuxcontainers.org/
Hands-on: Docker
25
Container vs VM
pervisor abstracts the entire device, containers just abstract the operating
Hands-on: Docker & Compose
26
• Single instance
docker run -it apachegeode/geode:nightly gfsh
• Cluster
docker-compose up
• Scale
docker-compose scale server=3
Hands-on: Application
27
• Teeny URL
• Fast response time
• Statistics
• Hits
• User agent ?
• IPs ?
• URL will last for 5 minutes
• Distribute data & load
• Highly scalable
createURL
getURL
stats
• HDFS Persistence
• Off-heap memory storage
• Lucene Search
• Spark Integration
• Cloud Foundry service
28
Roadmap
• Code
• New features
• Bug fixes
• Writing tests
• Documentation
• Wiki
• Web site
• User guide
29
How to Contribute
• Community
• Join the mailing list
• Ask or answer
• Join our HipChat
• Become a speaker
• Finding bugs
• Testing an RC/Beta
• JIRA
https://issues.apache.org/jira/browse/GEODE
• Wiki
cwiki.apache.org/confluence/display/GEODE
• GitHub
https://github.com/apache/incubator-geode
• Mailing lists
mail-archives.apache.org/mod_mbox/incubator-geode-dev/
30
Links
31
Thank you
http://geode.incubator.apache.org
https://github.com/Pivotal-Open-Source-Hub

More Related Content

PPTX
Getting Started with Apache Geode
PPTX
Spring Data and In-Memory Data Management in Action
PPTX
Apache Geode: an efficient alternative to Kafka-Storm-Spark for Data Analytic
PPTX
Scaling Prometheus Metrics in Kubernetes with Telegraf | Chris Goller | Influ...
PDF
What's New in RTView Core 6.5
PDF
A Series of Fortunate Events: Building an Operator in Java
KEY
What Big Data Folks Need to Know About DevOps
PDF
Pivotal CenturyLink Cloud Platform Seminar Presentation: The Developer Experi...
Getting Started with Apache Geode
Spring Data and In-Memory Data Management in Action
Apache Geode: an efficient alternative to Kafka-Storm-Spark for Data Analytic
Scaling Prometheus Metrics in Kubernetes with Telegraf | Chris Goller | Influ...
What's New in RTView Core 6.5
A Series of Fortunate Events: Building an Operator in Java
What Big Data Folks Need to Know About DevOps
Pivotal CenturyLink Cloud Platform Seminar Presentation: The Developer Experi...

What's hot (20)

PDF
Staying Ahead of the Curve with Spring and Cassandra 4 (SpringOne 2020)
PDF
StarlingX - A Platform for the Distributed Edge | Ildiko Vancsa
PDF
The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017
PDF
Build bundles in the cloud - How Cloudyle PaaS+ helps creating OSGi applicati...
PPTX
Why kubernetes matters
PDF
WTF Do We Need a Service Mesh?
PDF
Pivotal CenturyLink Cloud Platform Seminar Presentations: Architecture & Oper...
PDF
AWS Summit Singapore 2019 | Latest Trends for Cloud-Native Application Develo...
PPTX
2015 UJUG, Servlet 4.0 portion
PPTX
Replication - Nick Carboni - ManageIQ Design Summit 2016
PPTX
Simplifying Apache Geode with Spring Data
PDF
Kubernetes Multi-cluster without Federation - Kubecon EU 2018
PPTX
Kubernetes: від знайомства до використання у CI/CD
PDF
The Time is Now: Migrating from Oracle Warehouse Builder to Oracle Data Integ...
PPTX
Kubernetes 1.16 and rancher 2.3 enhancements
PDF
Short Lived Tasks in Cloud Foundry #cfdtokyo
PDF
Automating OpenStack Deployment with Fuel
PDF
Project Zen: Improving Apache Spark for Python Users
PDF
Terraform Code Reviews: Supercharged with Conftest
PDF
ОЛЕКСАНДР ЛИПКО «Graceful Shutdown Node.js + k8s» Online WDDay 2021
Staying Ahead of the Curve with Spring and Cassandra 4 (SpringOne 2020)
StarlingX - A Platform for the Distributed Edge | Ildiko Vancsa
The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017
Build bundles in the cloud - How Cloudyle PaaS+ helps creating OSGi applicati...
Why kubernetes matters
WTF Do We Need a Service Mesh?
Pivotal CenturyLink Cloud Platform Seminar Presentations: Architecture & Oper...
AWS Summit Singapore 2019 | Latest Trends for Cloud-Native Application Develo...
2015 UJUG, Servlet 4.0 portion
Replication - Nick Carboni - ManageIQ Design Summit 2016
Simplifying Apache Geode with Spring Data
Kubernetes Multi-cluster without Federation - Kubecon EU 2018
Kubernetes: від знайомства до використання у CI/CD
The Time is Now: Migrating from Oracle Warehouse Builder to Oracle Data Integ...
Kubernetes 1.16 and rancher 2.3 enhancements
Short Lived Tasks in Cloud Foundry #cfdtokyo
Automating OpenStack Deployment with Fuel
Project Zen: Improving Apache Spark for Python Users
Terraform Code Reviews: Supercharged with Conftest
ОЛЕКСАНДР ЛИПКО «Graceful Shutdown Node.js + k8s» Online WDDay 2021
Ad

Viewers also liked (7)

PDF
Geode on Docker
PDF
How to Contribute to Apache Geode
PDF
ApacheCon 2015 - A Stock Prediction System Using OSS
PPTX
Building a Stock Prediction system with Machine Learning using Geode, SpringX...
PDF
Microservice Architecuture with Event Sourcing @ Sydney JVM Meetup
PDF
Implementing a highly scalable stock prediction system with R, Geode, SpringX...
PDF
Apache Geode Meetup, Cork, Ireland at CIT
Geode on Docker
How to Contribute to Apache Geode
ApacheCon 2015 - A Stock Prediction System Using OSS
Building a Stock Prediction system with Machine Learning using Geode, SpringX...
Microservice Architecuture with Event Sourcing @ Sydney JVM Meetup
Implementing a highly scalable stock prediction system with R, Geode, SpringX...
Apache Geode Meetup, Cork, Ireland at CIT
Ad

Similar to Apache Geode (incubating) Introduction with Docker (20)

PPTX
ApexMeetup Geode - Talk1 2016-03-17
PDF
Pivotal's effort on Apache Geode
PDF
Slides for the Apache Geode Hands-on Meetup and Hackathon Announcement
PPTX
Coherence RoadMap 2018
PDF
Introduction to Apache Geode (Cork, Ireland)
PPTX
Geode introduction
PDF
MySQL & Oracle Linux Keynote at Open Source India 2014
PDF
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
PPTX
Introduction to High Availability with SQL Server
PPTX
Change management in hybrid landscapes
PPTX
Geode Meetup Apachecon
PDF
Escalando Foursquare basado en Checkins y Recomendaciones
PDF
Apache Geode Meetup, London
PDF
Chill, Distill, No Overkill: Best Practices to Stress Test Kafka with Siva Ku...
PPTX
Open Sourcing GemFire - Apache Geode
PPTX
An Introduction to Apache Geode (incubating)
PDF
ECS19 - Patrick Curran, Eric Shupps - SHAREPOINT 24X7X365: ARCHITECTING FOR H...
PDF
Bitfusion Nimbix Dev Summit Heterogeneous Architectures
PPTX
Intro to Apache Apex (next gen Hadoop) & comparison to Spark Streaming
ApexMeetup Geode - Talk1 2016-03-17
Pivotal's effort on Apache Geode
Slides for the Apache Geode Hands-on Meetup and Hackathon Announcement
Coherence RoadMap 2018
Introduction to Apache Geode (Cork, Ireland)
Geode introduction
MySQL & Oracle Linux Keynote at Open Source India 2014
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
Introduction to High Availability with SQL Server
Change management in hybrid landscapes
Geode Meetup Apachecon
Escalando Foursquare basado en Checkins y Recomendaciones
Apache Geode Meetup, London
Chill, Distill, No Overkill: Best Practices to Stress Test Kafka with Siva Ku...
Open Sourcing GemFire - Apache Geode
An Introduction to Apache Geode (incubating)
ECS19 - Patrick Curran, Eric Shupps - SHAREPOINT 24X7X365: ARCHITECTING FOR H...
Bitfusion Nimbix Dev Summit Heterogeneous Architectures
Intro to Apache Apex (next gen Hadoop) & comparison to Spark Streaming

Recently uploaded (20)

PPTX
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PDF
Digital Systems & Binary Numbers (comprehensive )
PDF
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
PDF
System and Network Administraation Chapter 3
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PDF
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
PDF
Upgrade and Innovation Strategies for SAP ERP Customers
PPTX
Operating system designcfffgfgggggggvggggggggg
PPTX
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
PPTX
Introduction to Artificial Intelligence
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PDF
How to Choose the Right IT Partner for Your Business in Malaysia
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PDF
Product Update: Alluxio AI 3.7 Now with Sub-Millisecond Latency
PPTX
history of c programming in notes for students .pptx
PDF
wealthsignaloriginal-com-DS-text-... (1).pdf
PPTX
L1 - Introduction to python Backend.pptx
PDF
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
Digital Systems & Binary Numbers (comprehensive )
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
System and Network Administraation Chapter 3
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
Upgrade and Innovation Strategies for SAP ERP Customers
Operating system designcfffgfgggggggvggggggggg
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
Introduction to Artificial Intelligence
Navsoft: AI-Powered Business Solutions & Custom Software Development
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
How to Choose the Right IT Partner for Your Business in Malaysia
Wondershare Filmora 15 Crack With Activation Key [2025
Product Update: Alluxio AI 3.7 Now with Sub-Millisecond Latency
history of c programming in notes for students .pptx
wealthsignaloriginal-com-DS-text-... (1).pdf
L1 - Introduction to python Backend.pptx
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus

Apache Geode (incubating) Introduction with Docker

  • 1. Hands-on Introduction & Hackathon Kickoff Ashvin Agrawal William Markito @william_markito@aasoj Powered by Pivotal Open Source Hub (POSH) (incubating)
  • 2. • Hackathon Details • Apache Geode Introduction • History • Key features and components • Roadmap • Hands-on lab • Build & run • Starting a cluster • Using docker for clustering • Your first app • Q&A 2 Agenda
  • 3. Hackathon details Powered by Pivotal Open Source Hub (POSH) http://ambitious-apps.challengepost.com/
  • 5. A distributed, memory-based data management platform for data oriented apps that need: • high performance, scalability, resiliency and continuous availability • fast access to critical data set • location aware distributed data processing • event driven data architecture 5 Introduction
  • 6. 6 One size fits all ? Cost of sorting is nlog(n)
  • 7. 7 One size fits all ? Cost of sorting is nlog(n) • Data quality and quantity differences • Eventual consistency • Response time expectation • Scalability challenges: disk, memory, network and external systems
  • 8. • 1000+ systems in production (real customers) • Cutting edge use cases 8 Incubating… but rock solid
  • 9. • 17 billion records in memory • GE Power & Water's Remote Monitoring & Diagnostics Center • 3 TB operational data in-memory, 400 TB archived • China Railways • 4.6 Million transactions a day / 40K transactions a second • China Railways 9 Incubating… but rock solid
  • 10. • Performance optimized persistence • Configurable consistency • Elastic capacity • Latency minimizing distribution • Heterogenous deployment Designed for High Performance 10 +/- L2 ~10 ns, memory ~100 ns, network <1ms, disk ~10ms
  • 11. • Cache • Region • Member • Client Cache • Functions • Listeners 11 Concepts
  • 12. • Cache • In-memory storage and management for your data • Configurable through XML, Spring, Java API or CLI • Collection of Region 12 Concepts
  • 13. • Region • Distributed java.util.Map on steroids (Key/Value) • Consistent API regardless of where or how data is stored • Observable (reactive) • Highly available, redundant on cache Member (s). 13 Concepts
  • 14. • Region • Local, Replicated or Partitioned • In-memory or persistent • Redundant • LRU • Overflow 14 Concepts LOCAL LOCAL_HEAP_LRU LOCAL_OVERFLOW LOCAL_PERSISTENT LOCAL_PERSISTENT_OVERFLOW PARTITION PARTITION_HEAP_LRU PARTITION_OVERFLOW PARTITION_PERSISTENT PARTITION_PERSISTENT_OVERFLOW PARTITION_PROXY PARTITION_PROXY_REDUNDANT PARTITION_REDUNDANT PARTITION_REDUNDANT_HEAP_LRU PARTITION_REDUNDANT_OVERFLOW PARTITION_REDUNDANT_PERSISTENT PARTITION_REDUNDANT_PERSISTENT_OVERFLOW REPLICATE REPLICATE_HEAP_LRU REPLICATE_OVERFLOW REPLICATE_PERSISTENT REPLICATE_PERSISTENT_OVERFLOW REPLICATE_PROXY
  • 15. • Persistent Regions • Durability • WAL for efficient writing • Consistent recovery • Compaction 15 Concepts Server 1 Server N
  • 16. • Member • A process that has a connection to the system • A process that has created a cache • Embeddable within your application 16 Concepts Client Locator Server
  • 17. • Client cache • A process connected to the Geode server(s) • Can have a local copy of the data • Can be notified about events on the servers 17 Concepts
  • 18. • Functions • Used for distributed concurrent processing (Map/Reduce, stored procedure) • Highly available • Data oriented • Member oriented 18 Concepts
  • 20. • Listeners • CacheWriter / CacheListener • AsyncEventListener (queue / batch) • Parallel or Serial • Conflation 20 Concepts
  • 22. • Clone & Build 22 Hands-on: Build & run git clone https://github.com/apache/incubator-geode cd incubator-geode ./gradlew build -Dskip.tests=true • Start a server cd gemfire-assembly/build/install/apache-geode ./bin/gfsh gfsh> start locator --name=locator gfsh> start server --name=server gfsh> create region --name=myRegion --type=REPLICATE
  • 24. • Containers • FreeBSD Jails (2000) • Solaris Zones (2004) • Docker (2013) • Operating system level virtualization • Isolated user space instances 24 * https://linuxcontainers.org/ Hands-on: Docker
  • 25. 25 Container vs VM pervisor abstracts the entire device, containers just abstract the operating
  • 26. Hands-on: Docker & Compose 26 • Single instance docker run -it apachegeode/geode:nightly gfsh • Cluster docker-compose up • Scale docker-compose scale server=3
  • 27. Hands-on: Application 27 • Teeny URL • Fast response time • Statistics • Hits • User agent ? • IPs ? • URL will last for 5 minutes • Distribute data & load • Highly scalable createURL getURL stats
  • 28. • HDFS Persistence • Off-heap memory storage • Lucene Search • Spark Integration • Cloud Foundry service 28 Roadmap
  • 29. • Code • New features • Bug fixes • Writing tests • Documentation • Wiki • Web site • User guide 29 How to Contribute • Community • Join the mailing list • Ask or answer • Join our HipChat • Become a speaker • Finding bugs • Testing an RC/Beta
  • 30. • JIRA https://issues.apache.org/jira/browse/GEODE • Wiki cwiki.apache.org/confluence/display/GEODE • GitHub https://github.com/apache/incubator-geode • Mailing lists mail-archives.apache.org/mod_mbox/incubator-geode-dev/ 30 Links