This document provides an overview of spatial databases and the components of a spatial database management system (SDBMS). It discusses how SDBMSs compare to traditional database management systems (DBMSs) and geographic information systems (GISs). The key components of an SDBMS include the spatial data model, query language, query processing techniques using spatial indexes and optimization, and file organization. Spatial queries allow users to query and analyze spatial data and relationships.
Spatial databases are designed to store and analyze spatial data more efficiently than traditional databases. Spatial data represents objects in geometric space and includes points, lines, and polygons. Spatial databases use spatial indexes and spatial query languages to optimize storage and retrieval of spatial data types and allow spatial queries and analysis. Common spatial database operations include measurements, functions, and predicates on geometric objects.
This document provides an overview of spatial data and spatial databases. It discusses key topics like the types of spatial data, properties of spatial data, spatial data types, spatial databases and their characteristics. It also describes spatial database management systems (SDBMS) and their three-layer architecture. Additionally, it covers spatial indices, creation of spatial data, geographic information systems (GIS), advantages and disadvantages of spatial databases, and applications of spatial databases such as urban planning and military operations.
The document discusses geographic information systems (GIS) databases and data concepts. It describes how GIS databases organize spatial and attribute data into different data types. Spatial data represents location and shape, while attribute data describes characteristics. Metadata provides information about the data. GIS databases can represent data digitally as raster or vector formats. Raster data uses a grid of cells, while vector data uses points, lines, and polygons. The document also discusses relational database management systems and how they are well-suited for GIS applications due to flexibility and efficient data storage.
The document discusses spatial data and spatial databases. It defines spatial data as data related to space, including location, shape, size and orientation of objects. It discusses the types of spatial data like points, lines, polygons and pixels. It also discusses non-spatial data and how spatial data is organized using coordinates. The key properties of spatial data are geometry, distribution of objects in space, temporal changes and data volume. Spatial databases allow for efficient storage and querying of spatial data through the use of spatial data types and indexes.
SDBMS is a database system for handling spatial data like points, lines, surfaces and higher dimension objects used in applications like CAD, cartography and GIS. It stores spatial data describing attributes of objects like length, area and volume. SDBMS allows for spatial indexing to retrieve objects in an area without scanning the entire space, spatial joins to efficiently join multiple spatial objects, and querying spatial data based on topological, directional and metric relationships.
Database management unit 6 of computer engineeringwivax28493
This document provides information about emerging databases and complex data types covered in a database management systems course. It discusses topics like active databases, main memory databases, semantic databases, semi-structured data, nested data types like JSON and XML, object-relational databases, and spatial data types. It also provides references for additional reading.
Building a Spatial Database in PostgreSQLKudos S.A.S
The document discusses building a spatial database in PostgreSQL using PostGIS. It provides an introduction to spatial data and databases, and explains why PostGIS was created - to provide an open source spatial extension for PostgreSQL that is compliant with OpenGIS standards. It covers key topics like spatial data types, spatial relationships, spatial indexing and functions, and implementing the OpenGIS specification in PostgreSQL.
The document discusses spatial query languages and how they differ from standard SQL in handling geometric data types and operations, emphasizing the need for specific spatial queries like intersections and lengths. It introduces user-defined spatial data types and operations in SQL3 and details examples of spatial queries, including those using multiple tables and aggregate functions. The document also highlights the integration of spatial data into databases, supported by various vendors through the OGIS standard.
ADVANCE DATABASE MANAGEMENT SYSTEM CONCEPTS & ARCHITECTURE by vikas jagtapVikas Jagtap
The document discusses advanced database management system concepts, focusing primarily on spatial data management and web-based systems. It explains various spatial database features, including how spatial queries work, different architectures (2-tier, 3-tier, and n-tier), and mobile databases, emphasizing their relevance in modern applications. Additionally, it outlines multimedia databases and their components, showcasing the increasing integration of such systems in various industries.
PostGIS is a spatial extension for PostgreSQL that aims to make it compliant with the OpenGIS Simple Features for SQL standard. It adds spatial data types and functions to PostgreSQL, allowing spatial objects to be stored and manipulated like other database objects. PostGIS indexes spatial data using GiST to enable fast spatial queries in SQL. The open source PostGIS software helps enable more open and affordable spatial databases and GIS applications.
Spot db consistency checking and optimization in spatial databasePratik Udapure
This document discusses optimizing spatial databases. It covers spatial indexes like grids, z-order, octrees, quadtrees, UB-trees, R-trees, and kd-trees that are used to optimize spatial queries by decreasing search time. Spatial queries allow processing data types like geometry and geography and consider spatial relationships between objects. Examples of SQL queries on spatial data and features of spatial databases like spatial measurements and functions are also provided.
Enterprise geodatabase sql access and administrationbrentpierce
The document provides an overview of accessing and administering an enterprise geodatabase through SQL and Python. It discusses how the geodatabase is based on relational database principles with user data stored in tables and system metadata stored in system tables. It describes how spatial types store geometry data and the benefits of using SQL to access and edit geodatabase content. The document also outlines how Python can be used for geodatabase administration tasks like schema creation, maintenance, and publishing tools.
Building A Spatial Database In Postgresql (Ppt).pdfssuser0ab1a4
The document discusses PostGIS, a spatial extension for PostgreSQL that enables the handling of spatial data in a relational database. It highlights the advantages of using spatial databases, such as support for complex spatial relationships, SQL querying for spatial operations, and integration with other data types. Additionally, it addresses the importance of OpenGIS standards and the significance of PostGIS in open-source geographic information systems.
Temporal and spatial databases allow for the storage and querying of data that involves time or spatial aspects. Conventional databases only provide a snapshot of data, while temporal databases can maintain historical information and access past states of data. Spatial databases support spatial data types like polygons and spatial queries involving operations like overlap and distance. Key components of temporal and spatial databases are their data models, query languages, indexing structures like R-trees, and optimized algorithms for processing queries over time-series and geospatial data.
TYBSC IT PGIS Unit II Chapter I Data Management and Processing SystemsArti Parab Academics
This document discusses geographic information systems (GIS). It defines GIS as hardware and software used to process, store, and transfer geographic data. It describes how GIS has evolved from using analog data and manual processing to increased use of digital data, computers, and software. It also discusses key GIS concepts like spatial data capture and analysis, data storage and management, and data presentation.
Spatial Database Systems Design Implementation And Project Management 1st Edi...uahpirsahrin
Spatial Database Systems Design Implementation And Project Management 1st Edition Albert Kw Yeung
Spatial Database Systems Design Implementation And Project Management 1st Edition Albert Kw Yeung
Spatial Database Systems Design Implementation And Project Management 1st Edition Albert Kw Yeung
Spatial SQL is a variant of SQL that is designed to handle spatial data stored in a database. It provides functions and capabilities for working with spatial data types like points, lines, and polygons. These functions allow for spatial queries, transformations between coordinate systems, and other operations that are useful for GIS applications but difficult to perform with standard SQL. The key advantages of Spatial SQL are that it allows large volumes of spatial data to be stored and queried efficiently using specialized indexes and functions optimized for spatial data.
Spatial databases allow for storage and querying of spatial data types like points, lines, and regions. A spatial database management system (SDBMS) extends a regular DBMS with spatial data types, models, and query languages. It supports spatial indexing and algorithms for spatial queries. Common spatial queries include selection queries to find objects within a region, nearest neighbor queries to find closest objects, and spatial joins to find object relationships. Spatial indexing uses access methods like R-trees that represent regions as minimum bounding rectangles to enable efficient filtering and refinement of query results.
The document discusses advancements in hardware and software trends related to geographic information systems (GIS), highlighting the capabilities for data management, manipulation, and analysis of spatial data. It covers the architecture of GIS, the importance of spatial data infrastructure (SDI), and the role of database management systems (DBMS) for storing and retrieving geospatial information. Furthermore, it elaborates on the relational data model, querying methods, and the integration of spatial data within relational databases to enhance functionality and user access.
The document summarizes a seminar on spatial databases. It defines spatial data as data that identifies geographic locations and features on Earth. It then defines a spatial database as a database optimized to store and query geometric objects like points, lines and polygons. It lists some key advantages of spatial databases like transactions, security and concurrent users. It also lists some disadvantages as being complex to design and requiring suitable hardware. It concludes that spatial databases are valuable for applications like military planning, insurance and healthcare by allowing storage, querying and sharing of spatial data.
This document provides a summary of a technical workshop on administering Microsoft SQL Server geodatabases. The presentation covers topics such as configuring SQL Server to support geodatabases, creating databases and geodatabases, choosing spatial data storage options, organizing and controlling access to data, ensuring data safety, and improving performance. It also lists related demo theater presentations on working with SQL Server Express geodatabases and using SQL Server Profiler to troubleshoot.
The document discusses spatial SQL and databases. It provides an agenda for installing software, building a database, importing shapefiles, and writing queries. It then defines spatial SQL, discusses drivers for increased use of location data, and lists databases that support spatial SQL like Oracle, MySQL, SQL Server, Spatial Lite, and PostGIS. Finally, it covers functionality of spatial databases and shapefiles, and capabilities of PostGIS like spatial indexing.
The document discusses different types of geodatabases and their benefits. It explains that geodatabases store both geospatial and attribute data together, unlike shapefiles which store them separately. The main types discussed are file geodatabases, personal geodatabases, and ArcSDE/enterprise databases. Functionality like feature datasets, topology, and versioning are explained. Normalization and SQL queries are also covered.
METHODS OF DATA COLLECTION (Research methodology)anwesha248
The important methods of data collection along with advantages and disadvantages are published in this power point presentation.
published by- Prateeti Anwesha Bordoloi, Research Scholar
Mahapurusha Srimanta Sankaradeva Vishwavidyalaya
June 2025
Veilig en vlot fietsen in Oost-Vlaanderen: Fietssnelwegen geoptimaliseerd met...jacoba18
Dit project, in samenwerking met de provincie Oost-Vlaanderen, heeft als doel de veiligheid op de F45 tussen Oudenaarde en Gent te verbeteren door netwerkdata aan te vullen met snelheidslimieten, toegangsbeperkingen en voorrangsregels. Met mobile mapping en FME werden verkeersborden en kruispunten geanalyseerd, waarna de resultaten in ArcGIS inzicht boden in toegankelijke wegsegmenten en verkeersregels.
More Related Content
Similar to Intro to spatial databases: GIS, Postgres. (20)
The document discusses spatial query languages and how they differ from standard SQL in handling geometric data types and operations, emphasizing the need for specific spatial queries like intersections and lengths. It introduces user-defined spatial data types and operations in SQL3 and details examples of spatial queries, including those using multiple tables and aggregate functions. The document also highlights the integration of spatial data into databases, supported by various vendors through the OGIS standard.
ADVANCE DATABASE MANAGEMENT SYSTEM CONCEPTS & ARCHITECTURE by vikas jagtapVikas Jagtap
The document discusses advanced database management system concepts, focusing primarily on spatial data management and web-based systems. It explains various spatial database features, including how spatial queries work, different architectures (2-tier, 3-tier, and n-tier), and mobile databases, emphasizing their relevance in modern applications. Additionally, it outlines multimedia databases and their components, showcasing the increasing integration of such systems in various industries.
PostGIS is a spatial extension for PostgreSQL that aims to make it compliant with the OpenGIS Simple Features for SQL standard. It adds spatial data types and functions to PostgreSQL, allowing spatial objects to be stored and manipulated like other database objects. PostGIS indexes spatial data using GiST to enable fast spatial queries in SQL. The open source PostGIS software helps enable more open and affordable spatial databases and GIS applications.
Spot db consistency checking and optimization in spatial databasePratik Udapure
This document discusses optimizing spatial databases. It covers spatial indexes like grids, z-order, octrees, quadtrees, UB-trees, R-trees, and kd-trees that are used to optimize spatial queries by decreasing search time. Spatial queries allow processing data types like geometry and geography and consider spatial relationships between objects. Examples of SQL queries on spatial data and features of spatial databases like spatial measurements and functions are also provided.
Enterprise geodatabase sql access and administrationbrentpierce
The document provides an overview of accessing and administering an enterprise geodatabase through SQL and Python. It discusses how the geodatabase is based on relational database principles with user data stored in tables and system metadata stored in system tables. It describes how spatial types store geometry data and the benefits of using SQL to access and edit geodatabase content. The document also outlines how Python can be used for geodatabase administration tasks like schema creation, maintenance, and publishing tools.
Building A Spatial Database In Postgresql (Ppt).pdfssuser0ab1a4
The document discusses PostGIS, a spatial extension for PostgreSQL that enables the handling of spatial data in a relational database. It highlights the advantages of using spatial databases, such as support for complex spatial relationships, SQL querying for spatial operations, and integration with other data types. Additionally, it addresses the importance of OpenGIS standards and the significance of PostGIS in open-source geographic information systems.
Temporal and spatial databases allow for the storage and querying of data that involves time or spatial aspects. Conventional databases only provide a snapshot of data, while temporal databases can maintain historical information and access past states of data. Spatial databases support spatial data types like polygons and spatial queries involving operations like overlap and distance. Key components of temporal and spatial databases are their data models, query languages, indexing structures like R-trees, and optimized algorithms for processing queries over time-series and geospatial data.
TYBSC IT PGIS Unit II Chapter I Data Management and Processing SystemsArti Parab Academics
This document discusses geographic information systems (GIS). It defines GIS as hardware and software used to process, store, and transfer geographic data. It describes how GIS has evolved from using analog data and manual processing to increased use of digital data, computers, and software. It also discusses key GIS concepts like spatial data capture and analysis, data storage and management, and data presentation.
Spatial Database Systems Design Implementation And Project Management 1st Edi...uahpirsahrin
Spatial Database Systems Design Implementation And Project Management 1st Edition Albert Kw Yeung
Spatial Database Systems Design Implementation And Project Management 1st Edition Albert Kw Yeung
Spatial Database Systems Design Implementation And Project Management 1st Edition Albert Kw Yeung
Spatial SQL is a variant of SQL that is designed to handle spatial data stored in a database. It provides functions and capabilities for working with spatial data types like points, lines, and polygons. These functions allow for spatial queries, transformations between coordinate systems, and other operations that are useful for GIS applications but difficult to perform with standard SQL. The key advantages of Spatial SQL are that it allows large volumes of spatial data to be stored and queried efficiently using specialized indexes and functions optimized for spatial data.
Spatial databases allow for storage and querying of spatial data types like points, lines, and regions. A spatial database management system (SDBMS) extends a regular DBMS with spatial data types, models, and query languages. It supports spatial indexing and algorithms for spatial queries. Common spatial queries include selection queries to find objects within a region, nearest neighbor queries to find closest objects, and spatial joins to find object relationships. Spatial indexing uses access methods like R-trees that represent regions as minimum bounding rectangles to enable efficient filtering and refinement of query results.
The document discusses advancements in hardware and software trends related to geographic information systems (GIS), highlighting the capabilities for data management, manipulation, and analysis of spatial data. It covers the architecture of GIS, the importance of spatial data infrastructure (SDI), and the role of database management systems (DBMS) for storing and retrieving geospatial information. Furthermore, it elaborates on the relational data model, querying methods, and the integration of spatial data within relational databases to enhance functionality and user access.
The document summarizes a seminar on spatial databases. It defines spatial data as data that identifies geographic locations and features on Earth. It then defines a spatial database as a database optimized to store and query geometric objects like points, lines and polygons. It lists some key advantages of spatial databases like transactions, security and concurrent users. It also lists some disadvantages as being complex to design and requiring suitable hardware. It concludes that spatial databases are valuable for applications like military planning, insurance and healthcare by allowing storage, querying and sharing of spatial data.
This document provides a summary of a technical workshop on administering Microsoft SQL Server geodatabases. The presentation covers topics such as configuring SQL Server to support geodatabases, creating databases and geodatabases, choosing spatial data storage options, organizing and controlling access to data, ensuring data safety, and improving performance. It also lists related demo theater presentations on working with SQL Server Express geodatabases and using SQL Server Profiler to troubleshoot.
The document discusses spatial SQL and databases. It provides an agenda for installing software, building a database, importing shapefiles, and writing queries. It then defines spatial SQL, discusses drivers for increased use of location data, and lists databases that support spatial SQL like Oracle, MySQL, SQL Server, Spatial Lite, and PostGIS. Finally, it covers functionality of spatial databases and shapefiles, and capabilities of PostGIS like spatial indexing.
The document discusses different types of geodatabases and their benefits. It explains that geodatabases store both geospatial and attribute data together, unlike shapefiles which store them separately. The main types discussed are file geodatabases, personal geodatabases, and ArcSDE/enterprise databases. Functionality like feature datasets, topology, and versioning are explained. Normalization and SQL queries are also covered.
METHODS OF DATA COLLECTION (Research methodology)anwesha248
The important methods of data collection along with advantages and disadvantages are published in this power point presentation.
published by- Prateeti Anwesha Bordoloi, Research Scholar
Mahapurusha Srimanta Sankaradeva Vishwavidyalaya
June 2025
Veilig en vlot fietsen in Oost-Vlaanderen: Fietssnelwegen geoptimaliseerd met...jacoba18
Dit project, in samenwerking met de provincie Oost-Vlaanderen, heeft als doel de veiligheid op de F45 tussen Oudenaarde en Gent te verbeteren door netwerkdata aan te vullen met snelheidslimieten, toegangsbeperkingen en voorrangsregels. Met mobile mapping en FME werden verkeersborden en kruispunten geanalyseerd, waarna de resultaten in ArcGIS inzicht boden in toegankelijke wegsegmenten en verkeersregels.
apidays New York 2025 - The Future of Small Business Lending with Open Bankin...apidays
The Future of Small Business Lending with Open Banking – Bridging the $750 Billion Funding Gap
Charles Groome, Vice President of Growth Strategy at Biz2Credit
apidays New York 2025
API Management for Surfing the Next Innovation Waves: GenAI and Open Banking
May 14 & 15, 2025
------
Check out our conferences at https://www.apidays.global/
Do you want to sponsor or talk at one of our conferences?
https://apidays.typeform.com/to/ILJeAaV8
Learn more on APIscene, the global media made by the community for the community:
https://www.apiscene.io
Explore the API ecosystem with the API Landscape:
https://apilandscape.apiscene.io/
apidays New York 2025 - Building Scalable AI Systems by Sai Prasad Veluru (Ap...apidays
Building Scalable AI Systems: Cloud Architecture for Performance
Sai Prasad Veluru, Software Engineer at Apple Inc
apidays New York 2025
API Management for Surfing the Next Innovation Waves: GenAI and Open Banking
May 14 & 15, 2025
------
Check out our conferences at https://www.apidays.global/
Do you want to sponsor or talk at one of our conferences?
https://apidays.typeform.com/to/ILJeAaV8
Learn more on APIscene, the global media made by the community for the community:
https://www.apiscene.io
Explore the API ecosystem with the API Landscape:
https://apilandscape.apiscene.io/
apidays New York 2025 - Fast, Repeatable, Secure: Pick 3 with FINOS CCC by Le...apidays
Fast, Repeatable, Secure: Pick 3 with FINOS CCC
Leigh Capili, Kubernetes Contributor at Control Plane
apidays New York 2025
API Management for Surfing the Next Innovation Waves: GenAI and Open Banking
May 14 & 15, 2025
------
Check out our conferences at https://www.apidays.global/
Do you want to sponsor or talk at one of our conferences?
https://apidays.typeform.com/to/ILJeAaV8
Learn more on APIscene, the global media made by the community for the community:
https://www.apiscene.io
Explore the API ecosystem with the API Landscape:
https://apilandscape.apiscene.io/
THE FRIEDMAN TEST ( Biostatics B. Pharm)JishuHaldar
The Friedman Test is a valuable non-parametric alternative to the
Repeated Measures ANOVA, allowing for the comparison of three or
more related groups when data is ordinal or not normally distributed.
By ranking data instead of using raw values, the test overcomes the
limitations of parametric tests, making it ideal for small sample sizes and
real-world applications in medicine, psychology, pharmaceutical
sciences, and education. However, while it effectively detects differences
among groups, it does not indicate which specific groups differ, requiring
further post-hoc analysis.
apidays New York 2025 - Beyond Webhooks: The Future of Scalable API Event Del...apidays
Beyond Webhooks: The Future of Scalable API Event Delivery
Phil Leggetter, Head of Developer Experience at Hookdeck
apidays New York 2025
API Management for Surfing the Next Innovation Waves: GenAI and Open Banking
May 14 & 15, 2025
------
Check out our conferences at https://www.apidays.global/
Do you want to sponsor or talk at one of our conferences?
https://apidays.typeform.com/to/ILJeAaV8
Learn more on APIscene, the global media made by the community for the community:
https://www.apiscene.io
Explore the API ecosystem with the API Landscape:
https://apilandscape.apiscene.io/
What is FinOps as a Service and why is it Trending?Amnic
The way we build and scale companies today has changed forever because of cloud adoption. However, this flexibility introduces unpredictability, which often results in overspending, inefficiencies, and a lack of cost accountability.
FinOps as a Service is a modern approach to cloud cost management that combines powerful tooling with expert advisory to bring financial visibility, governance, and optimization into the cloud operating model, without slowing down the engineering team. FinOps empowers the engineering team, finance, and leadership/management as they make data-informed decisions about cost, together.
In this presentation, we will break down what FinOps is, why it matters more than ever, and a little about how a managed FinOps service can help organizations:
- Optimize cloud spend - without slowing down dev
- Create visibility into the cost per team, service, or feature
- Set financial guardrails while allowing autonomy in engineering
- Drive cultural alignment between finance, engineering, and product
This will guide and help whether you are a cloud-native startup or a scaling enterprise, and convert cloud cost into a strategic advantage.
apidays New York 2025 - Using GraphQL SDL files as executable API Contracts b...apidays
Using GraphQL SDL files as executable API Contracts
Hari Krishnan, Co-founder & CTO at Specmatic
apidays New York 2025
API Management for Surfing the Next Innovation Waves: GenAI and Open Banking
May 14 & 15, 2025
------
Check out our conferences at https://www.apidays.global/
Do you want to sponsor or talk at one of our conferences?
https://apidays.typeform.com/to/ILJeAaV8
Learn more on APIscene, the global media made by the community for the community:
https://www.apiscene.io
Explore the API ecosystem with the API Landscape:
https://apilandscape.apiscene.io/
FME Beyond Data Processing: Creating a Dartboard Accuracy Appjacoba18
Bij Nordend vroegen we ons af of FME ons kan helpen bepalen waar we moeten mikken voor de hoogste score, gebaseerd op onze nauwkeurigheid. We tonen hoe we met FME Flow, een zelfgemaakte WMS-dartbordlaag en geanalyseerde gooiresultaten optimale mikpunten berekenden.
Report based on the findings of a quantitative research conducted by the research agency New Image Marketing Group, commissioned by the NGO Detector Media, compiled by PhD in Sociology Marta Naumova
Unlock the power of data with “SQL Demystified” — your perfect beginner-friendly guide to mastering Structured Query Language (SQL)! Whether you're a student, aspiring analyst, or a professional stepping into the world of data, this detailed and practical PowerPoint presentation simplifies the fundamentals and builds your SQL skills step-by-step.
📘 What’s Inside?
Introduction to SQL and its real-world importance
Understanding database structures: tables, columns & relationships
Core SQL queries: SELECT, FROM, WHERE, ORDER BY
Hands-on examples and practice exercises
Advanced SQL: JOIN, subqueries, and aggregate functions
Data manipulation: INSERT, UPDATE, DELETE
Performance tips and indexing basics
SQL security best practices and learning paths for future growth
🧠 Who is it for?
Beginners eager to learn SQL
Data science and computer students
Business professionals and analysts
Anyone curious about how databases work
✅ Features:
Clear visuals and examples
Easy-to-follow explanations
Real-life applications in banking, healthcare, e-commerce, and more
Reusable and editable presentation file (PPTX format)
Take your first step toward data mastery with confidence. Learn SQL the smart way — with structure, clarity, and purpose.
1. Chapter 1: Introduction to Spatial Databases
1.1 Overview
1.2 Application domains
1.3 Compare a SDBMS with a GIS
1.4 Categories of Users
1.5 An example of an SDBMS application
1.6 A Stroll though a spatial database
1.6.1 Data Models, 1.6.2 Query Language, 1.6.3 Query Processing,
1.6.4 File Organization and Indices, 1.6.5 Query Optimization,
1.6.6 Data Mining
2. Learning Objectives
Learning Objectives (LO)
LO1 : Understand the value of SDBMS
• Application domains
• users
• How is different from a DBMS?
LO2: Understand the concept of spatial databases
LO3: Learn about the Components of SDBMS
Mapping Sections to learning objectives
LO1 - 1.1, 1.2, 1.4
LO2 - 1.3, 1.5
LO3 - 1.6
3. Value of SDBMS
Traditional (non-spatial) database management systems provide:
Persistence across failures
Allows concurrent access to data
Scalability to search queries on very large datasets which do not
fit inside main memories of computers
Efficient for non-spatial queries, but not for spatial queries
Non-spatial queries:
List the names of all bookstore with more than ten thousand
titles.
List the names of ten customers, in terms of sales, in the year
2001
Spatial Queries:
List the names of all bookstores with ten miles of Minneapolis
List all customers who live in Tennessee and its adjoining states
4. Value of SDBMS – Spatial Data Examples
Examples of non-spatial data
Names, phone numbers, email addresses of people
Examples of Spatial data
Census Data
NASA satellites imagery - terabytes of data per day
Weather and Climate Data
Rivers, Farms, ecological impact
Medical Imaging
Exercise: Identify spatial and non-spatial data items in
A phone book
A cookbook with recipes
5. Value of SDBMS – Users, Application Domains
Many important application domains have spatial data and
queries. Some Examples follow:
Army Field Commander: Has there been any significant
enemy troop movement since last night?
Insurance Risk Manager: Which homes are most likely to
be affected in the next great flood on the Mississippi?
Medical Doctor: Based on this patient's MRI, have we
treated somebody with a similar condition ?
Molecular Biologist:Is the topology of the amino acid
biosynthesis gene in the genome found in any other
sequence feature map in the database ?
Astronomer:Find all blue galaxies within 2 arcmin of
quasars.
Exercise: List two ways you have used spatial data. Which
software did you use to manipulate spatial data?
6. Learning Objectives
Learning Objectives (LO)
LO1 : Understand the value of SDBMS
LO2: Understand the concept of spatial
databases
• What is a SDBMS?
• How is it different from a GIS?
LO3: Learn about the Components of SDBMS
Sections for LO2
Section 1.5 provides an example SDBMS
Section 1.1 and 1.3 compare SDBMS with DBMS and GIS
7. What is a SDBMS ?
A SDBMS is a software module that
can work with an underlying DBMS
supports spatial data models, spatial abstract data types
(ADTs) and a query language from which these ADTs are
callable
supports spatial indexing, efficient algorithms for processing
spatial operations, and domain specific rules for query
optimization
Example: Oracle Spatial data cartridge, ESRI SDE
can work with Oracle 8i DBMS
Has spatial data types (e.g. polygon), operations (e.g.
overlap) callable from SQL3 query language
Has spatial indices, e.g. R-trees
8. SDBMS Example
Consider a spatial dataset with:
County boundary (dashed white line)
Census block - name, area,
population, boundary (dark line)
Water bodies (dark polygons)
Satellite Imagery (gray scale pixels)
Storage in a SDBMS table:
create table census_blocks (
name string,
area float,
population number,
boundary polygon );
Fig 1.2
9. Modeling Spatial Data in Traditional DBMS
•A row in the table census_blocks (Figure 1.3)
• Question: Is Polyline datatype supported in DBMS?
Figure 1.3
10. Spatial Data Types and Traditional Databases
Traditional relational DBMS
Support simple data types, e.g. number, strings, date
Modeling Spatial data types is tedious
Example: Figure 1.4 shows modeling of polygon using numbers
Three new tables: polygon, edge, points
• Note: Polygon is a polyline where last point and first point are
same
A simple unit sqaure represented as 16 rows across 3 tables
Simple spatial operators, e.g. area(), require joining tables
Tedious and computationally inefficient
Question. Name post-relational database management systems
which facilitate modeling of spatial data types, e.g. polygon.
13. Spatial Data Types and Post-relational Databases
Post-relational DBMS
Support user defined abstract data types
Spatial data types (e.g. polygon) can be added
Choice of post-relational DBMS
Object oriented (OO) DBMS
Object relational (OR) DBMS
A spatial database is a collection of spatial data types, operators,
indices, processing strategies, etc. and can work with many
post-relational DBMS as well as programming languages like
Java, Visual Basic etc.
14. How is a SDBMS different from a GIS ?
GIS is a software to visualize and analyze spatial
data using spatial analysis functions such as
Search Thematic search, search by region, (re-)classification
Location analysis Buffer, corridor, overlay
Terrain analysis Slope/aspect, catchment, drainage network
Flow analysis Connectivity, shortest path
Distribution Change detection, proximity, nearest neighbor
Spatial analysis/Statistics Pattern, centrality, autocorrelation,
indices of similarity, topology: hole description
Measurements Distance, perimeter, shape, adjacency, direction
GIS uses SDBMS
to store, search, query, share large spatial data
sets
15. How is a SDBMS different from a GIS ?
SDBMS focusses on
Efficient storage, querying, sharing of large spatial datasets
Provides simpler set based query operations
Example operations: search by region, overlay, nearest
neighbor, distance, adjacency, perimeter etc.
Uses spatial indices and query optimization to speedup
queries over large spatial datasets.
SDBMS may be used by applications other than GIS
Astronomy, Genomics, Multimedia information systems, ...
Will one use a GIS or a SDBM to answer the following:
How many neighboring countries does USA have?
Which country has highest number of neighbors?
16. Evolution of acronym “GIS”
Fig 1.1
Geographic Information Systems (1980s)
Geographic Information Science (1990s)
Geographic Information Services (2000s)
17. Three meanings of the acronym GIS
Geographic Information Services
Web-sites and service centers for casual users, e.g. travelers
Example: Service (e.g. AAA, mapquest) for route planning
Geographic Information Systems
Software for professional users, e.g. cartographers
Example: ESRI Arc/View software
Geographic Information Science
Concepts, frameworks, theories to formalize use and
development of geographic information systems and services
Example: design spatial data types and operations for
querying
Exercise: Which meaning of the term GIS is closest to the focus of
the book titled “Spatial Databases: A Tour”?
18. Learning Objectives
Learning Objectives (LO)
LO1 : Understand the value of SDBMS
LO2: Understand the concept of spatial databases
LO3: Learn about the Components of SDBMS
• Architecture choices
• SDBMS components:
– data model, query languages,
– query processing and optimization
– File organization and indices
– Data Mining
Chapter Sections
1.5 second half
1.6 – entire section
19. Components of a SDBMS
Recall: a SDBMS is a software module that
can work with an underlying DBMS
supports spatial data models, spatial ADTs and a query
language from which these ADTs are callable
supports spatial indexing, algorithms for processing
spatial operations, and domain specific rules for query
optimization
Components include
spatial data model, query language, query processing, file
organization and indices, query optimization, etc.
Figure 1.6 shows these components
We discuss each component briefly in chapter 1.6 and in
more detail in later chapters.
21. 1.6.1 Spatial Taxonomy, Data Models
Spatial Taxonomy:
multitude of descriptions available to organize space.
Topology models homeomorphic relationships, e.g. overlap
Euclidean space models distance and direction in a plane
Graphs models connectivity, Shortest-Path
Spatial data models
rules to identify identifiable objects and properties of space
Object model help manage identifiable things, e.g.
mountains, cities, land-parcels etc.
Field model help manage continuous and amorphous
phenomenon, e.g. wetlands, satellite imagery, snowfall etc.
More details in chapter 2.
22. 1.6.2 Spatial Query Language
• Spatial query language
• Spatial data types, e.g. point, linestring, polygon, …
• Spatial operations, e.g. overlap, distance, nearest neighbor,
…
• Callable from a query language (e.g. SQL3) of underlying
DBMS
SELECT S.name
FROM Senator S
WHERE S.district.Area() > 300
• Standards
• SQL3 (a.k.a. SQL 1999) is a standard for query languages
• OGIS is a standard for spatial data types and operators
• Both standards enjoy wide support in industry
• More details in chapters 2 and 3
23. Multi-scan Query Example
• Spatial join example
SELECT S.name FROM Senator S, Business B
WHERE S.district.Area() > 300 AND Within(B.location, S.district)
• Non-Spatial Join example
SELECT S.name FROM Senator S, Business B
WHERE S.soc-sec = B.soc-sec AND S.gender = ‘Female’
Fig 1.7
24. 1.6.3 Query Processing
• Efficient algorithms to answer spatial queries
• Common Strategy - filter and refine
• Filter Step:Query Region overlaps with MBRs of B,C and D
• Refine Step: Query Region overlaps with B and C
Fig 1.8
25. Query Processing of Join Queries
Fig 1.9
•Example - Determining pairs of intersecting rectangles
• (a):Two sets R and S of rectangles, (b): A rectangle with 2 opposite corners
marked, (c ): Rectangles sorted by smallest X coordinate value
• Plane sweep filter identifies 5 pairs out of 12 for refinement step
•Details of plane sweep algorithm on page 15
26. 1.6.4 File Organization and Indices
Fig 1.10
• A difference between GIS and SDBMS assumptions
•GIS algorithms: dataset is loaded in main memory (Fig. 1.10(a))
•SDBMS: dataset is on secondary storage e.g disk (Fig. 1.10(b))
•SDBMS uses space filling curves and spatial indices
•to efficiently search disk resident large spatial datasets
27. Organizing spatial data with space filling curves
Fig 1.11
•Issue:
•Sorting is not naturally defined on spatial data
•Many efficient search methods are based on sorting datasets
•Space filling curves
•Impose an ordering on the locations in a multi-dimensional space
•Examples: row-order (Fig. 1.11(a), z-order (Fig 1.11(b))
• Allow use of traditional efficient search methods on spatial data
28. Spatial Indexing: Search Data-Structures
Fig. 1.13: R- tree
Fig 1.12: B-tree
•Choice for spatial indexing:
•B-tree is a hierarchical collection of ranges of linear keys, e.g. numbers
•B-tree index is used for efficient search of traditional data
•B-tree can be used with space filling curve on spatial data
•R-tree provides better search performance yet!
•R-tree is a hierarchical collection of rectangles
•More details in chapter 4
29. 1.6.5 Query Optimization
•Query Optimization
• A spatial operation can be processed using different strategies
• Computation cost of each strategy depends on many parameters
•Query optimization is the process of
•ordering operations in a query and
•selecting efficient strategy for each operation
•based on the details of a given dataset
•Example Query:
SELECT S.name FROM Senator S, Business B
WHERE S.soc-sec = B.soc-sec AND S.gender = ‘Female’
•Optimization decision examples
•Process (S.gender = ‘Female’) before (S.soc-sec = B.soc-sec )
•Do not use index for processing (S.gender = ‘Female’)
30. 1.6.6 Data Mining
• Analysis of spatial data is of many types
• Deductive Querying, e.g. searching, sorting, overlays
• Inductive Mining, e.g. statistics, correlation, clustering,classification, …
• Data mining is a systematic and semi-automated search for
interesting non-trivial patterns in large spatial databases
•Example applications include
•Infer land-use classification from satellite imagery
•Identify cancer clusters and geographic factors with high correlation
•Identify crime hotspots to assign police patrols and social workers
31. 1.7 Summary
SDBMS is valuable to many important applications
SDBMS is a software module
works with an underlying DBMS
provides spatial ADTs callable from a query language
provides methods for efficient processing of spatial
queries
Components of SDBMS include
spatial data model, spatial data types and operators,
spatial query language, processing and optimization
spatial data mining
SDBMS is used to store, query and share spatial
data for GIS as well as other applications