SlideShare a Scribd company logo
Scalable Object Storage with
Apache CloudStack and Apache
Hadoop
April 30 2013
Chiradeep Vittal
@chiradeep
Agenda
โ€ข What is CloudStack
โ€ข Object Storage for IAAS
โ€ข Current Architecture and Limitations
โ€ข Requirements for Object Storage
โ€ข Object Storage integrations in CloudStack
โ€ข HDFS for Object Storage
โ€ข Future directions
โ€ข History
โ€ข Incubating in the Apache
Software Foundation since
April 2012
โ€ข Open Source since May
2010
โ€ข In production since 2009
โ€“ Turnkey platform for delivering
IaaS clouds
โ€“ Full featured GUI, end-user API
and admin API
Apache CloudStack
Build your cloud the way the
worldโ€™s most successful
clouds are built
How did Amazon build its cloud?
Commodity
Servers
Commodity
Storage
Networking
Open Source Xen Hypervisor
Amazon Orchestration Software
AWS API (EC2, S3, โ€ฆ)
Amazon eCommerce Platform
How can YOU build a cloud?
Servers StorageNetworking
Open Source Xen Hypervisor
Amazon Orchestration Software
AWS API (EC2, S3, โ€ฆ)
Amazon eCommerce Platform
Hypervisor (Xen/KVM/VMW/)
CloudStack Orchestration Software
Optional Portal
CloudStack or AWS API
Secondary Storage
Image
L3/L2 core
DC Edge
End users
Pod Pod Pod Pod
Zone Architecture
Pod
Access Sw
MySQL
CloudStack
Admin/User API
Primary Storage
NFS/ISCSI/FC
Hypervisor (Xen
/VMWare/KVM)
VM
VM
Snapshot
Snapshot
Image
Disk Disk
VM
Cloud-Style Workloads
โ€ข Low cost
โ€“ Standardized, cookie cutter infrastructure
โ€“ Highly automated and efficient
โ€ข Application owns availability
โ€“ At scale everything breaks
โ€“ Focus on MTTR instead of MTBF
Secondary Storage
Image
L3/L2 core
DC Edge
Pod Pod Pod Pod
At scaleโ€ฆeverything breaks
Pod
Access Sw
Primary Storage
NFS/ISCSI/FC
Hypervisor (Xen
/VMWare/KVM)
VM
VM
Snapshot
Snapshot
Image
Disk Disk
VM
Region โ€œWestโ€
Zone โ€œWest-Alphaโ€
Zone โ€œWest-Betaโ€
Zone โ€œWest-Gammaโ€
Zone โ€œWest-Deltaโ€
Low Latency Backbone
(e.g., SONET ring)
Regions and zones
Region โ€œEastโ€
Region โ€œSouthโ€
Internet
Geographic
separation
Region โ€œWestโ€
Low Latency
Secondary Storage in CloudStack 4.0
โ€ข NFS server default
โ€“ can be mounted by hypervisor
โ€“ Easy to obtain, set up and operate
โ€ข Problems with NFS:
โ€“ Scale: max limits of file systems
โ€ข Solution: CloudStack can manage multiple NFS stores (+
complexity)
โ€“ Performance
โ€ข N hypervisors : 1 storage CPU / 1 network link
โ€“ Wide area suitability for cross-region storage
โ€ข Chatty protocol
โ€“ Lack of replication
Object Storage Technology
Region โ€œWestโ€
Zone โ€œWest-Alphaโ€
Zone โ€œWest-Betaโ€
Zone โ€œWest-Gammaโ€
Zone โ€œWest-Deltaโ€
Object Storage in a region
โ€ข Replication
โ€ข Audit
โ€ข Repair
โ€ข Maintenance
Region โ€œWestโ€
Object Storage enables reliability
Object Storage Technology
Region โ€œWestโ€
Object Storage also enables other
applications
Object Store
API Servers
โ€ข DropBox
โ€ข Static Content
โ€ข Archival
Object Storage characteristics
โ€ข Highly reliable and durable
โ€“ 99.9 % availability for AWS S3
โ€“ 99.999999999 % durability
โ€ข Massive scale
โ€“ 1.3 trillion objects stored across 7 AWS regions [Nov 2012 figures]
โ€“ Throughput: 830,000 requests per second
โ€ข Immutable objects
โ€“ Objects cannot be modified, only deleted
โ€ข Simple API
โ€“ PUT/POST objects, GET objects, DELETE objects
โ€“ No seek / no mutation / no POSIX API
โ€ข Flat namespace
โ€“ Everything stored in buckets.
โ€“ Bucket names are unique
โ€“ Buckets can only contain objects, not other buckets
โ€ข Cheap and getting cheaper
CloudStack S3 API Server
Object Storage Technology
S3
API Servers
MySQL
CloudStack S3 API Server
โ€ข Understands AWS S3 REST-style and SOAP API
โ€ข Pluggable backend
โ€“ Backend storage needs to map simple calls to their
API
โ€ข E.g., createContainer, saveObject, loadObject
โ€“ Default backend is a POSIX filesystem
โ€“ Backend with Caringo Object Store (commercial
vendor) available
โ€“ HDFS backend also available
โ€ข MySQL storage
โ€“ Bucket -> object mapping
โ€“ ACLs, bucket policies
Object Store Integration into
CloudStack
โ€ข For images and snapshots
โ€ข Replacement for NFS secondary storage
Or
Augmentation for NFS secondary storage
โ€ข Integrations available with
โ€“ Riak CS
โ€“ Openstack Swift
โ€ข New in 4.2 (upcoming):
โ€“ Framework for integrating storage providers
What do we want to build ?
โ€ข Open source, ASL licensed object storage
โ€ข Scales to at least 1 billion objects
โ€ข Reliability and durability on par with S3
โ€ข S3 API (or similar, e.g., Google Storage)
โ€ข Tooling around maintenance and
operation, specific to object storage
The following slides are a design
discussion
Architecture of Scalable Object
Storage
API Servers
Auth Servers
Object Servers Replicators/Auditors
Object
Lookup
Servers
Why HDFS
โ€ข ASF Project (Apache Hadoop)
โ€ข Immutable objects, replication
โ€ข Reliability, scale and performance
โ€“ 200 million objects in 1 cluster [Facebook]
โ€“ 100 PB in 1 cluster [Facebook]
โ€ข Simple operation
โ€“ Just add data nodes
HDFS-based Object Storage
S3 API Servers
S3 Auth Servers
Data nodes
Namenode
pair
HDFS API
BUT
โ€ข Name Node Scalability
โ€“ 150 bytes RAM / block
โ€“ GC issues
โ€ข Name Node SPOF
โ€“ Being addressed in the communityโœ”
โ€ข Cross-zone replication
โ€“ Rack-awareness placement โœ”
โ€“ What if the zones are spread a little further apart?
โ€ข Storage for object metadata
โ€“ ACLs, policies, timers
Name Node scalability
โ€ข 1 billion objects = 3 billion blocks (chunks)
โ€“ Average of 5 MB/object = 5 PB (actual), 15
PB (raw)
โ€“ 450 GB of RAM per Name Node
โ€ข 150b x 3 x 10^9
โ€“ 16 TB / node => 1000 Data nodes
โ€ข Requires Name Node federation ?
โ€ข Or an approach like HAR files
Name Node Federation
Extension: Federated NameNodes are HA pairs
Federation issues
โ€ข HA for name nodes
โ€ข Namespace shards
โ€“ Map object -> name node
โ€ข Requires another scalable key-value store
โ€“ HBase?
โ€ข Rebalancing between name nodes
Replication over lossy/slower links
A. Asynchronous replication
โ€“ Use distcp to replicate between clusters
โ€“ 6 copies vs. 3
โ€“ Master/Slave relationship
โ€ข Possibility of loss of data during failover
โ€ข Need coordination logic outside of HDFS
B. Synchronous replication
โ€“ API server writes to 2 clusters and acks only
when both writes are successful
โ€“ Availability compromised when one zone is
down
CAP Theorem
Consistency or Availability during partition
Many nuances
Storage for object metadata
A. Store it in HDFS along with the object
โ€“ Reads are expensive (e.g., to check ACL)
โ€“ Mutable data, needs layer over HDFS
B. Use another storage system (e.g. HBase)
โ€“ Name node federation also requires this.
C. Modify Name Node to store metadata
โ€“ High performance
โ€“ Not extensible
Object store on HDFS Future
โ€ข Viable for small-sized deployments
โ€“ Up to 100-200 million objects
โ€“ Datacenters close together
โ€ข Larger deployments needs development
โ€“ No effort ongoing at this time
Conclusion
โ€ข CloudStack needs object storage for
โ€œcloud-styleโ€ workloads
โ€ข Object Storage is not easy
โ€ข HDFS comes close but not close enough
โ€ข Join the community!
Ad

Recommended

Netflix running Presto in the AWS Cloud
Netflix running Presto in the AWS Cloud
Zhenxiao Luo
ย 
Scaling Traffic from 0 to 139 Million Unique Visitors
Scaling Traffic from 0 to 139 Million Unique Visitors
Yelp Engineering
ย 
Building a unified data pipeline in Apache Spark
Building a unified data pipeline in Apache Spark
DataWorks Summit
ย 
MySQL in the Hosted Cloud
MySQL in the Hosted Cloud
Colin Charles
ย 
Data warehouse solutions
Data warehouse solutions
Tu Pham
ย 
Amazon RedShift - Ianni Vamvadelis
Amazon RedShift - Ianni Vamvadelis
huguk
ย 
Qubole Overview at the Fifth Elephant Conference
Qubole Overview at the Fifth Elephant Conference
Joydeep Sen Sarma
ย 
HBaseConAsia2018 Track2-1: Kerberos-based Big Data Security Solution and Prac...
HBaseConAsia2018 Track2-1: Kerberos-based Big Data Security Solution and Prac...
Michael Stack
ย 
Scaling Pinterest
Scaling Pinterest
C4Media
ย 
HBaseConAsia2018 Track2-3: Bringing MySQL Compatibility to HBase using Databa...
HBaseConAsia2018 Track2-3: Bringing MySQL Compatibility to HBase using Databa...
Michael Stack
ย 
Qubole @ AWS Meetup Bangalore - July 2015
Qubole @ AWS Meetup Bangalore - July 2015
Joydeep Sen Sarma
ย 
Amazon Athena Hands-On Workshop
Amazon Athena Hands-On Workshop
DoiT International
ย 
HBaseConAsia2018 Track3-7: The application of HBase in New Energy Vehicle Mon...
HBaseConAsia2018 Track3-7: The application of HBase in New Energy Vehicle Mon...
Michael Stack
ย 
An overview of Amazon Athena
An overview of Amazon Athena
Julien SIMON
ย 
Cloud Optimized Big Data
Cloud Optimized Big Data
Joydeep Sen Sarma
ย 
Getting Started with EC2, S3 and EMR
Getting Started with EC2, S3 and EMR
Arun Sirimalla
ย 
Oracle Databases on AWS - Getting the Best Out of RDS and EC2
Oracle Databases on AWS - Getting the Best Out of RDS and EC2
Maris Elsins
ย 
Hadoop @ eBay: Past, Present, and Future
Hadoop @ eBay: Past, Present, and Future
Ryan Hennig
ย 
Autoscaling Spark on AWS EC2 - 11th Spark London meetup
Autoscaling Spark on AWS EC2 - 11th Spark London meetup
Rafal Kwasny
ย 
CosmosDB for DBAs & Developers
CosmosDB for DBAs & Developers
Niko Neugebauer
ย 
Presto Fast SQL on Anything
Presto Fast SQL on Anything
Alluxio, Inc.
ย 
CloudStack Object Storage Framework & Demo
CloudStack Object Storage Framework & Demo
ShapeBlue
ย 
OpenStack Swift In the Enterprise
OpenStack Swift In the Enterprise
Hostway|HOSTING
ย 
Storage as a service OpenStack
Storage as a service OpenStack
openstackindia
ย 
Storing your data in the cloud: doing right reversim 2018
Storing your data in the cloud: doing right reversim 2018
Orit Wasserman
ย 
Better, faster, cheaper infrastructure with apache cloud stack and riak cs redux
Better, faster, cheaper infrastructure with apache cloud stack and riak cs redux
John Burwell
ย 
Apache CloudStack from API to UI
Apache CloudStack from API to UI
CloudStack - Open Source Cloud Computing Project
ย 
Hacking apache cloud stack
Hacking apache cloud stack
Nitin Mehta
ย 
Cloud storage: the right way OSS EU 2018
Cloud storage: the right way OSS EU 2018
Orit Wasserman
ย 
Initial presentation of swift (for montreal user group)
Initial presentation of swift (for montreal user group)
Marcos Garcรญa
ย 

More Related Content

What's hot (13)

Scaling Pinterest
Scaling Pinterest
C4Media
ย 
HBaseConAsia2018 Track2-3: Bringing MySQL Compatibility to HBase using Databa...
HBaseConAsia2018 Track2-3: Bringing MySQL Compatibility to HBase using Databa...
Michael Stack
ย 
Qubole @ AWS Meetup Bangalore - July 2015
Qubole @ AWS Meetup Bangalore - July 2015
Joydeep Sen Sarma
ย 
Amazon Athena Hands-On Workshop
Amazon Athena Hands-On Workshop
DoiT International
ย 
HBaseConAsia2018 Track3-7: The application of HBase in New Energy Vehicle Mon...
HBaseConAsia2018 Track3-7: The application of HBase in New Energy Vehicle Mon...
Michael Stack
ย 
An overview of Amazon Athena
An overview of Amazon Athena
Julien SIMON
ย 
Cloud Optimized Big Data
Cloud Optimized Big Data
Joydeep Sen Sarma
ย 
Getting Started with EC2, S3 and EMR
Getting Started with EC2, S3 and EMR
Arun Sirimalla
ย 
Oracle Databases on AWS - Getting the Best Out of RDS and EC2
Oracle Databases on AWS - Getting the Best Out of RDS and EC2
Maris Elsins
ย 
Hadoop @ eBay: Past, Present, and Future
Hadoop @ eBay: Past, Present, and Future
Ryan Hennig
ย 
Autoscaling Spark on AWS EC2 - 11th Spark London meetup
Autoscaling Spark on AWS EC2 - 11th Spark London meetup
Rafal Kwasny
ย 
CosmosDB for DBAs & Developers
CosmosDB for DBAs & Developers
Niko Neugebauer
ย 
Presto Fast SQL on Anything
Presto Fast SQL on Anything
Alluxio, Inc.
ย 
Scaling Pinterest
Scaling Pinterest
C4Media
ย 
HBaseConAsia2018 Track2-3: Bringing MySQL Compatibility to HBase using Databa...
HBaseConAsia2018 Track2-3: Bringing MySQL Compatibility to HBase using Databa...
Michael Stack
ย 
Qubole @ AWS Meetup Bangalore - July 2015
Qubole @ AWS Meetup Bangalore - July 2015
Joydeep Sen Sarma
ย 
Amazon Athena Hands-On Workshop
Amazon Athena Hands-On Workshop
DoiT International
ย 
HBaseConAsia2018 Track3-7: The application of HBase in New Energy Vehicle Mon...
HBaseConAsia2018 Track3-7: The application of HBase in New Energy Vehicle Mon...
Michael Stack
ย 
An overview of Amazon Athena
An overview of Amazon Athena
Julien SIMON
ย 
Cloud Optimized Big Data
Cloud Optimized Big Data
Joydeep Sen Sarma
ย 
Getting Started with EC2, S3 and EMR
Getting Started with EC2, S3 and EMR
Arun Sirimalla
ย 
Oracle Databases on AWS - Getting the Best Out of RDS and EC2
Oracle Databases on AWS - Getting the Best Out of RDS and EC2
Maris Elsins
ย 
Hadoop @ eBay: Past, Present, and Future
Hadoop @ eBay: Past, Present, and Future
Ryan Hennig
ย 
Autoscaling Spark on AWS EC2 - 11th Spark London meetup
Autoscaling Spark on AWS EC2 - 11th Spark London meetup
Rafal Kwasny
ย 
CosmosDB for DBAs & Developers
CosmosDB for DBAs & Developers
Niko Neugebauer
ย 
Presto Fast SQL on Anything
Presto Fast SQL on Anything
Alluxio, Inc.
ย 

Similar to Scalable Object Storage with Apache CloudStack and Apache Hadoop (20)

CloudStack Object Storage Framework & Demo
CloudStack Object Storage Framework & Demo
ShapeBlue
ย 
OpenStack Swift In the Enterprise
OpenStack Swift In the Enterprise
Hostway|HOSTING
ย 
Storage as a service OpenStack
Storage as a service OpenStack
openstackindia
ย 
Storing your data in the cloud: doing right reversim 2018
Storing your data in the cloud: doing right reversim 2018
Orit Wasserman
ย 
Better, faster, cheaper infrastructure with apache cloud stack and riak cs redux
Better, faster, cheaper infrastructure with apache cloud stack and riak cs redux
John Burwell
ย 
Apache CloudStack from API to UI
Apache CloudStack from API to UI
CloudStack - Open Source Cloud Computing Project
ย 
Hacking apache cloud stack
Hacking apache cloud stack
Nitin Mehta
ย 
Cloud storage: the right way OSS EU 2018
Cloud storage: the right way OSS EU 2018
Orit Wasserman
ย 
Initial presentation of swift (for montreal user group)
Initial presentation of swift (for montreal user group)
Marcos Garcรญa
ย 
Deploying Apache CloudStack from API to UI
Deploying Apache CloudStack from API to UI
Joe Brockmeier
ย 
Apache CloudStack: API to UI (STLLUG)
Apache CloudStack: API to UI (STLLUG)
Joe Brockmeier
ย 
Architecting An Enterprise Storage Platform Using Object Stores
Architecting An Enterprise Storage Platform Using Object Stores
Niraj Tolia
ย 
OpenStack 101: Introduction & Technical Overview
OpenStack 101: Introduction & Technical Overview
Thang Man
ย 
Storage as a service and OpenStack Cinder
Storage as a service and OpenStack Cinder
openstackindia
ย 
Txlf2012
Txlf2012
Joe Brockmeier
ย 
Power of OpenStack & Hadoop
Power of OpenStack & Hadoop
Tuan Yang
ย 
HDFS Tiered Storage: Mounting Object Stores in HDFS
HDFS Tiered Storage: Mounting Object Stores in HDFS
DataWorks Summit/Hadoop Summit
ย 
Building a Resilient, Scalable, Storage System with OpenStack
Building a Resilient, Scalable, Storage System with OpenStack
Cloudian
ย 
Data storage in clouds
Data storage in clouds
Thanassis Parathyras
ย 
CloudStack Object Storage Framework & Demo
CloudStack Object Storage Framework & Demo
ShapeBlue
ย 
OpenStack Swift In the Enterprise
OpenStack Swift In the Enterprise
Hostway|HOSTING
ย 
Storage as a service OpenStack
Storage as a service OpenStack
openstackindia
ย 
Storing your data in the cloud: doing right reversim 2018
Storing your data in the cloud: doing right reversim 2018
Orit Wasserman
ย 
Better, faster, cheaper infrastructure with apache cloud stack and riak cs redux
Better, faster, cheaper infrastructure with apache cloud stack and riak cs redux
John Burwell
ย 
Hacking apache cloud stack
Hacking apache cloud stack
Nitin Mehta
ย 
Cloud storage: the right way OSS EU 2018
Cloud storage: the right way OSS EU 2018
Orit Wasserman
ย 
Initial presentation of swift (for montreal user group)
Initial presentation of swift (for montreal user group)
Marcos Garcรญa
ย 
Deploying Apache CloudStack from API to UI
Deploying Apache CloudStack from API to UI
Joe Brockmeier
ย 
Apache CloudStack: API to UI (STLLUG)
Apache CloudStack: API to UI (STLLUG)
Joe Brockmeier
ย 
Architecting An Enterprise Storage Platform Using Object Stores
Architecting An Enterprise Storage Platform Using Object Stores
Niraj Tolia
ย 
OpenStack 101: Introduction & Technical Overview
OpenStack 101: Introduction & Technical Overview
Thang Man
ย 
Storage as a service and OpenStack Cinder
Storage as a service and OpenStack Cinder
openstackindia
ย 
Power of OpenStack & Hadoop
Power of OpenStack & Hadoop
Tuan Yang
ย 
HDFS Tiered Storage: Mounting Object Stores in HDFS
HDFS Tiered Storage: Mounting Object Stores in HDFS
DataWorks Summit/Hadoop Summit
ย 
Building a Resilient, Scalable, Storage System with OpenStack
Building a Resilient, Scalable, Storage System with OpenStack
Cloudian
ย 
Open stack in sina
Open stack in sina
Hui Cheng
ย 
Ad

More from buildacloud (20)

The Future of SDN in CloudStack by Chiradeep Vittal
The Future of SDN in CloudStack by Chiradeep Vittal
buildacloud
ย 
Policy Based SDN Solution for DC and Branch Office by Suresh Boddapati
Policy Based SDN Solution for DC and Branch Office by Suresh Boddapati
buildacloud
ย 
L4-L7 services for SDN and NVF by Youcef Laribi
L4-L7 services for SDN and NVF by Youcef Laribi
buildacloud
ย 
Jenkins, jclouds, CloudStack, and CentOS by David Nalley
Jenkins, jclouds, CloudStack, and CentOS by David Nalley
buildacloud
ย 
Intro to Zenoss by Andrew Kirch
Intro to Zenoss by Andrew Kirch
buildacloud
ย 
Guaranteeing Storage Performance by Mike Tutkowski
Guaranteeing Storage Performance by Mike Tutkowski
buildacloud
ย 
Cloud Application Blueprints with Apache Brooklyn by Alex Henevald
Cloud Application Blueprints with Apache Brooklyn by Alex Henevald
buildacloud
ย 
Introduction to Apache CloudStack by David Nalley
Introduction to Apache CloudStack by David Nalley
buildacloud
ย 
Managing infrastructure with Application Policy by Mike Cohen
Managing infrastructure with Application Policy by Mike Cohen
buildacloud
ย 
Intro to Zenoss by Andrew Kirch
Intro to Zenoss by Andrew Kirch
buildacloud
ย 
Monitoring CloudStack in context with Converged Infrastructure by Mike Turnlund
Monitoring CloudStack in context with Converged Infrastructure by Mike Turnlund
buildacloud
ย 
Rest api design by george reese
Rest api design by george reese
buildacloud
ย 
Enterprise grade firewall and ssl termination to ac by will stevens
Enterprise grade firewall and ssl termination to ac by will stevens
buildacloud
ย 
State of the cloud by reuven cohen
State of the cloud by reuven cohen
buildacloud
ย 
Securing Your Cloud With the Xen Hypervisor by Russell Pavlicek
Securing Your Cloud With the Xen Hypervisor by Russell Pavlicek
buildacloud
ย 
DevCloud - Setup and Demo on Apache CloudStack
DevCloud - Setup and Demo on Apache CloudStack
buildacloud
ย 
Cloud Network Virtualization with Juniper Contrail
Cloud Network Virtualization with Juniper Contrail
buildacloud
ย 
Ian rae panel cloud stack & cloud storage where are we at, and where do we ne...
Ian rae panel cloud stack & cloud storage where are we at, and where do we ne...
buildacloud
ย 
Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski
Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski
buildacloud
ย 
CloudStack University by Sebastien Goasguen
CloudStack University by Sebastien Goasguen
buildacloud
ย 
The Future of SDN in CloudStack by Chiradeep Vittal
The Future of SDN in CloudStack by Chiradeep Vittal
buildacloud
ย 
Policy Based SDN Solution for DC and Branch Office by Suresh Boddapati
Policy Based SDN Solution for DC and Branch Office by Suresh Boddapati
buildacloud
ย 
L4-L7 services for SDN and NVF by Youcef Laribi
L4-L7 services for SDN and NVF by Youcef Laribi
buildacloud
ย 
Jenkins, jclouds, CloudStack, and CentOS by David Nalley
Jenkins, jclouds, CloudStack, and CentOS by David Nalley
buildacloud
ย 
Intro to Zenoss by Andrew Kirch
Intro to Zenoss by Andrew Kirch
buildacloud
ย 
Guaranteeing Storage Performance by Mike Tutkowski
Guaranteeing Storage Performance by Mike Tutkowski
buildacloud
ย 
Cloud Application Blueprints with Apache Brooklyn by Alex Henevald
Cloud Application Blueprints with Apache Brooklyn by Alex Henevald
buildacloud
ย 
Introduction to Apache CloudStack by David Nalley
Introduction to Apache CloudStack by David Nalley
buildacloud
ย 
Managing infrastructure with Application Policy by Mike Cohen
Managing infrastructure with Application Policy by Mike Cohen
buildacloud
ย 
Intro to Zenoss by Andrew Kirch
Intro to Zenoss by Andrew Kirch
buildacloud
ย 
Monitoring CloudStack in context with Converged Infrastructure by Mike Turnlund
Monitoring CloudStack in context with Converged Infrastructure by Mike Turnlund
buildacloud
ย 
Rest api design by george reese
Rest api design by george reese
buildacloud
ย 
Enterprise grade firewall and ssl termination to ac by will stevens
Enterprise grade firewall and ssl termination to ac by will stevens
buildacloud
ย 
State of the cloud by reuven cohen
State of the cloud by reuven cohen
buildacloud
ย 
Securing Your Cloud With the Xen Hypervisor by Russell Pavlicek
Securing Your Cloud With the Xen Hypervisor by Russell Pavlicek
buildacloud
ย 
DevCloud - Setup and Demo on Apache CloudStack
DevCloud - Setup and Demo on Apache CloudStack
buildacloud
ย 
Cloud Network Virtualization with Juniper Contrail
Cloud Network Virtualization with Juniper Contrail
buildacloud
ย 
Ian rae panel cloud stack & cloud storage where are we at, and where do we ne...
Ian rae panel cloud stack & cloud storage where are we at, and where do we ne...
buildacloud
ย 
Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski
Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski
buildacloud
ย 
CloudStack University by Sebastien Goasguen
CloudStack University by Sebastien Goasguen
buildacloud
ย 
Ad

Recently uploaded (20)

FIDO Seminar: Evolving Landscape of Post-Quantum Cryptography.pptx
FIDO Seminar: Evolving Landscape of Post-Quantum Cryptography.pptx
FIDO Alliance
ย 
Data Validation and System Interoperability
Data Validation and System Interoperability
Safe Software
ย 
SAP Modernization Strategies for a Successful S/4HANA Journey.pdf
SAP Modernization Strategies for a Successful S/4HANA Journey.pdf
Precisely
ย 
Supporting the NextGen 911 Digital Transformation with FME
Supporting the NextGen 911 Digital Transformation with FME
Safe Software
ย 
Security Tips for Enterprise Azure Solutions
Security Tips for Enterprise Azure Solutions
Michele Leroux Bustamante
ย 
Providing an OGC API Processes REST Interface for FME Flow
Providing an OGC API Processes REST Interface for FME Flow
Safe Software
ย 
Kubernetes Security Act Now Before Itโ€™s Too Late
Kubernetes Security Act Now Before Itโ€™s Too Late
Michael Furman
ย 
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
caoyixuan2019
ย 
Viral>Wondershare Filmora 14.5.18.12900 Crack Free Download
Viral>Wondershare Filmora 14.5.18.12900 Crack Free Download
Puppy jhon
ย 
Edge-banding-machines-edgeteq-s-200-en-.pdf
Edge-banding-machines-edgeteq-s-200-en-.pdf
AmirStern2
ย 
FIDO Alliance Seminar State of Passkeys.pptx
FIDO Alliance Seminar State of Passkeys.pptx
FIDO Alliance
ย 
Can We Use Rust to Develop Extensions for PostgreSQL? (POSETTE: An Event for ...
Can We Use Rust to Develop Extensions for PostgreSQL? (POSETTE: An Event for ...
NTT DATA Technology & Innovation
ย 
Artificial Intelligence in the Nonprofit Boardroom.pdf
Artificial Intelligence in the Nonprofit Boardroom.pdf
OnBoard
ย 
June Patch Tuesday
June Patch Tuesday
Ivanti
ย 
FIDO Seminar: New Data: Passkey Adoption in the Workforce.pptx
FIDO Seminar: New Data: Passkey Adoption in the Workforce.pptx
FIDO Alliance
ย 
AI vs Human Writing: Can You Tell the Difference?
AI vs Human Writing: Can You Tell the Difference?
Shashi Sathyanarayana, Ph.D
ย 
FIDO Seminar: Targeting Trust: The Future of Identity in the Workforce.pptx
FIDO Seminar: Targeting Trust: The Future of Identity in the Workforce.pptx
FIDO Alliance
ย 
OpenACC and Open Hackathons Monthly Highlights June 2025
OpenACC and Open Hackathons Monthly Highlights June 2025
OpenACC
ย 
Mastering AI Workflows with FME - Peak of Data & AI 2025
Mastering AI Workflows with FME - Peak of Data & AI 2025
Safe Software
ย 
AI VIDEO MAGAZINE - June 2025 - r/aivideo
AI VIDEO MAGAZINE - June 2025 - r/aivideo
1pcity Studios, Inc
ย 
FIDO Seminar: Evolving Landscape of Post-Quantum Cryptography.pptx
FIDO Seminar: Evolving Landscape of Post-Quantum Cryptography.pptx
FIDO Alliance
ย 
Data Validation and System Interoperability
Data Validation and System Interoperability
Safe Software
ย 
SAP Modernization Strategies for a Successful S/4HANA Journey.pdf
SAP Modernization Strategies for a Successful S/4HANA Journey.pdf
Precisely
ย 
Supporting the NextGen 911 Digital Transformation with FME
Supporting the NextGen 911 Digital Transformation with FME
Safe Software
ย 
Security Tips for Enterprise Azure Solutions
Security Tips for Enterprise Azure Solutions
Michele Leroux Bustamante
ย 
Providing an OGC API Processes REST Interface for FME Flow
Providing an OGC API Processes REST Interface for FME Flow
Safe Software
ย 
Kubernetes Security Act Now Before Itโ€™s Too Late
Kubernetes Security Act Now Before Itโ€™s Too Late
Michael Furman
ย 
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
caoyixuan2019
ย 
Viral>Wondershare Filmora 14.5.18.12900 Crack Free Download
Viral>Wondershare Filmora 14.5.18.12900 Crack Free Download
Puppy jhon
ย 
Edge-banding-machines-edgeteq-s-200-en-.pdf
Edge-banding-machines-edgeteq-s-200-en-.pdf
AmirStern2
ย 
FIDO Alliance Seminar State of Passkeys.pptx
FIDO Alliance Seminar State of Passkeys.pptx
FIDO Alliance
ย 
Can We Use Rust to Develop Extensions for PostgreSQL? (POSETTE: An Event for ...
Can We Use Rust to Develop Extensions for PostgreSQL? (POSETTE: An Event for ...
NTT DATA Technology & Innovation
ย 
Artificial Intelligence in the Nonprofit Boardroom.pdf
Artificial Intelligence in the Nonprofit Boardroom.pdf
OnBoard
ย 
June Patch Tuesday
June Patch Tuesday
Ivanti
ย 
FIDO Seminar: New Data: Passkey Adoption in the Workforce.pptx
FIDO Seminar: New Data: Passkey Adoption in the Workforce.pptx
FIDO Alliance
ย 
AI vs Human Writing: Can You Tell the Difference?
AI vs Human Writing: Can You Tell the Difference?
Shashi Sathyanarayana, Ph.D
ย 
FIDO Seminar: Targeting Trust: The Future of Identity in the Workforce.pptx
FIDO Seminar: Targeting Trust: The Future of Identity in the Workforce.pptx
FIDO Alliance
ย 
OpenACC and Open Hackathons Monthly Highlights June 2025
OpenACC and Open Hackathons Monthly Highlights June 2025
OpenACC
ย 
Mastering AI Workflows with FME - Peak of Data & AI 2025
Mastering AI Workflows with FME - Peak of Data & AI 2025
Safe Software
ย 
AI VIDEO MAGAZINE - June 2025 - r/aivideo
AI VIDEO MAGAZINE - June 2025 - r/aivideo
1pcity Studios, Inc
ย 

Scalable Object Storage with Apache CloudStack and Apache Hadoop

  • 1. Scalable Object Storage with Apache CloudStack and Apache Hadoop April 30 2013 Chiradeep Vittal @chiradeep
  • 2. Agenda โ€ข What is CloudStack โ€ข Object Storage for IAAS โ€ข Current Architecture and Limitations โ€ข Requirements for Object Storage โ€ข Object Storage integrations in CloudStack โ€ข HDFS for Object Storage โ€ข Future directions
  • 3. โ€ข History โ€ข Incubating in the Apache Software Foundation since April 2012 โ€ข Open Source since May 2010 โ€ข In production since 2009 โ€“ Turnkey platform for delivering IaaS clouds โ€“ Full featured GUI, end-user API and admin API Apache CloudStack Build your cloud the way the worldโ€™s most successful clouds are built
  • 4. How did Amazon build its cloud? Commodity Servers Commodity Storage Networking Open Source Xen Hypervisor Amazon Orchestration Software AWS API (EC2, S3, โ€ฆ) Amazon eCommerce Platform
  • 5. How can YOU build a cloud? Servers StorageNetworking Open Source Xen Hypervisor Amazon Orchestration Software AWS API (EC2, S3, โ€ฆ) Amazon eCommerce Platform Hypervisor (Xen/KVM/VMW/) CloudStack Orchestration Software Optional Portal CloudStack or AWS API
  • 6. Secondary Storage Image L3/L2 core DC Edge End users Pod Pod Pod Pod Zone Architecture Pod Access Sw MySQL CloudStack Admin/User API Primary Storage NFS/ISCSI/FC Hypervisor (Xen /VMWare/KVM) VM VM Snapshot Snapshot Image Disk Disk VM
  • 7. Cloud-Style Workloads โ€ข Low cost โ€“ Standardized, cookie cutter infrastructure โ€“ Highly automated and efficient โ€ข Application owns availability โ€“ At scale everything breaks โ€“ Focus on MTTR instead of MTBF
  • 8. Secondary Storage Image L3/L2 core DC Edge Pod Pod Pod Pod At scaleโ€ฆeverything breaks Pod Access Sw Primary Storage NFS/ISCSI/FC Hypervisor (Xen /VMWare/KVM) VM VM Snapshot Snapshot Image Disk Disk VM
  • 9. Region โ€œWestโ€ Zone โ€œWest-Alphaโ€ Zone โ€œWest-Betaโ€ Zone โ€œWest-Gammaโ€ Zone โ€œWest-Deltaโ€ Low Latency Backbone (e.g., SONET ring) Regions and zones
  • 11. Secondary Storage in CloudStack 4.0 โ€ข NFS server default โ€“ can be mounted by hypervisor โ€“ Easy to obtain, set up and operate โ€ข Problems with NFS: โ€“ Scale: max limits of file systems โ€ข Solution: CloudStack can manage multiple NFS stores (+ complexity) โ€“ Performance โ€ข N hypervisors : 1 storage CPU / 1 network link โ€“ Wide area suitability for cross-region storage โ€ข Chatty protocol โ€“ Lack of replication
  • 12. Object Storage Technology Region โ€œWestโ€ Zone โ€œWest-Alphaโ€ Zone โ€œWest-Betaโ€ Zone โ€œWest-Gammaโ€ Zone โ€œWest-Deltaโ€ Object Storage in a region โ€ข Replication โ€ข Audit โ€ข Repair โ€ข Maintenance
  • 14. Object Storage Technology Region โ€œWestโ€ Object Storage also enables other applications Object Store API Servers โ€ข DropBox โ€ข Static Content โ€ข Archival
  • 15. Object Storage characteristics โ€ข Highly reliable and durable โ€“ 99.9 % availability for AWS S3 โ€“ 99.999999999 % durability โ€ข Massive scale โ€“ 1.3 trillion objects stored across 7 AWS regions [Nov 2012 figures] โ€“ Throughput: 830,000 requests per second โ€ข Immutable objects โ€“ Objects cannot be modified, only deleted โ€ข Simple API โ€“ PUT/POST objects, GET objects, DELETE objects โ€“ No seek / no mutation / no POSIX API โ€ข Flat namespace โ€“ Everything stored in buckets. โ€“ Bucket names are unique โ€“ Buckets can only contain objects, not other buckets โ€ข Cheap and getting cheaper
  • 16. CloudStack S3 API Server Object Storage Technology S3 API Servers MySQL
  • 17. CloudStack S3 API Server โ€ข Understands AWS S3 REST-style and SOAP API โ€ข Pluggable backend โ€“ Backend storage needs to map simple calls to their API โ€ข E.g., createContainer, saveObject, loadObject โ€“ Default backend is a POSIX filesystem โ€“ Backend with Caringo Object Store (commercial vendor) available โ€“ HDFS backend also available โ€ข MySQL storage โ€“ Bucket -> object mapping โ€“ ACLs, bucket policies
  • 18. Object Store Integration into CloudStack โ€ข For images and snapshots โ€ข Replacement for NFS secondary storage Or Augmentation for NFS secondary storage โ€ข Integrations available with โ€“ Riak CS โ€“ Openstack Swift โ€ข New in 4.2 (upcoming): โ€“ Framework for integrating storage providers
  • 19. What do we want to build ? โ€ข Open source, ASL licensed object storage โ€ข Scales to at least 1 billion objects โ€ข Reliability and durability on par with S3 โ€ข S3 API (or similar, e.g., Google Storage) โ€ข Tooling around maintenance and operation, specific to object storage
  • 20. The following slides are a design discussion
  • 21. Architecture of Scalable Object Storage API Servers Auth Servers Object Servers Replicators/Auditors Object Lookup Servers
  • 22. Why HDFS โ€ข ASF Project (Apache Hadoop) โ€ข Immutable objects, replication โ€ข Reliability, scale and performance โ€“ 200 million objects in 1 cluster [Facebook] โ€“ 100 PB in 1 cluster [Facebook] โ€ข Simple operation โ€“ Just add data nodes
  • 23. HDFS-based Object Storage S3 API Servers S3 Auth Servers Data nodes Namenode pair HDFS API
  • 24. BUT โ€ข Name Node Scalability โ€“ 150 bytes RAM / block โ€“ GC issues โ€ข Name Node SPOF โ€“ Being addressed in the communityโœ” โ€ข Cross-zone replication โ€“ Rack-awareness placement โœ” โ€“ What if the zones are spread a little further apart? โ€ข Storage for object metadata โ€“ ACLs, policies, timers
  • 25. Name Node scalability โ€ข 1 billion objects = 3 billion blocks (chunks) โ€“ Average of 5 MB/object = 5 PB (actual), 15 PB (raw) โ€“ 450 GB of RAM per Name Node โ€ข 150b x 3 x 10^9 โ€“ 16 TB / node => 1000 Data nodes โ€ข Requires Name Node federation ? โ€ข Or an approach like HAR files
  • 26. Name Node Federation Extension: Federated NameNodes are HA pairs
  • 27. Federation issues โ€ข HA for name nodes โ€ข Namespace shards โ€“ Map object -> name node โ€ข Requires another scalable key-value store โ€“ HBase? โ€ข Rebalancing between name nodes
  • 28. Replication over lossy/slower links A. Asynchronous replication โ€“ Use distcp to replicate between clusters โ€“ 6 copies vs. 3 โ€“ Master/Slave relationship โ€ข Possibility of loss of data during failover โ€ข Need coordination logic outside of HDFS B. Synchronous replication โ€“ API server writes to 2 clusters and acks only when both writes are successful โ€“ Availability compromised when one zone is down
  • 29. CAP Theorem Consistency or Availability during partition Many nuances
  • 30. Storage for object metadata A. Store it in HDFS along with the object โ€“ Reads are expensive (e.g., to check ACL) โ€“ Mutable data, needs layer over HDFS B. Use another storage system (e.g. HBase) โ€“ Name node federation also requires this. C. Modify Name Node to store metadata โ€“ High performance โ€“ Not extensible
  • 31. Object store on HDFS Future โ€ข Viable for small-sized deployments โ€“ Up to 100-200 million objects โ€“ Datacenters close together โ€ข Larger deployments needs development โ€“ No effort ongoing at this time
  • 32. Conclusion โ€ข CloudStack needs object storage for โ€œcloud-styleโ€ workloads โ€ข Object Storage is not easy โ€ข HDFS comes close but not close enough โ€ข Join the community!

Editor's Notes

  • #4: Need a better slide than this
  • #5: Frequently require CCNA , Vmwareceritification, EMC training, etc etc. But they chose commondity systems. And simple networking.Can also sell cheaply since they use their own commerce platform.
  • #6: The key here is the API on top of the infrastructure. This is the disruptive piece for the industry. Forget about CCNA, Vmware cert, now people can programmatically control their infrastructure as well as the VMs on top of it.