The GROUP BY
clause in PL/SQL is a powerful tool used to organize data into aggregated groups based on one or more columns. It is essential for performing summary operations on large datasets, enabling efficient data analysis by grouping rows that share common values.
In this article, We will learn about PL/SQL GROUP BY Clause with the help of various examples and so on in detail.
PL/SQL GROUP BY Clause
- The
GROUP BY
clause in PL/SQL is used to group rows with the same values in specified columns into aggregated data.
- This clause is commonly used in SQL queries to aggregate data, which means summarizing or performing calculations on sets of rows.
- When using
GROUP BY
, we typically pair it with aggregate functions like SUM()
, COUNT()
, AVG()
, MAX()
, and MIN()
to compute summary statistics or perform other calculations on grouped data
Syntax:
SELECT expression1, expression2, ... expression_n,
aggregate_function (aggregate_expression)
FROM tables
WHERE conditions
GROUP BY expression1, expression2, ... expression_n;
Parameters:
- expression1, expression2, ... expression_n: It specifies the expressions that are not encapsulated within aggregate function. These expressions must be included in GROUP BY clause.
- aggregate_function: It specifies the aggregate functions i.e. SUM, COUNT, MIN, MAX or AVG functions.
- aggregate_expression: It specifies the column or expression on that the aggregate function is based on.
- tables: It specifies the table from where you want to retrieve records.
- conditions: It specifies the conditions that must be fulfilled for the record to be selected.
Examples of PL/SQL GROUP BY Clause
To illustrate how the GROUP BY clause works in PL/SQL, we first need to create a table and insert some sample data. This setup will help us demonstrate various operations using GROUP BY to avoid redundant creation.
We'll start by creating a table named student with columns for name, course_fee, and course. This table will hold information about students, the fees they pay, and the courses they are enrolled in.
Create Student Table
Let’s assume that we have Student Sample Table as follows, we will do some specific operations to learn about GROUP BY Clause in PL/SQL. Next, we'll insert some example records into the student table. Each record represents a student, their course fee, and the course they are taking. Here’s the sample data we’ll use:
CREATE TABLE student (
name VARCHAR(50),
course_fee DECIMAL(10,2),
course VARCHAR(50)
);
Insert Sample Data
INSERT INTO student (name, course_fee, course) VALUES
('Aleena', 5000.00, 'Mathematics'),
('Ben', 6000.50, 'English'),
('Chris', 7500.75, 'Science'),
('Darla', 4500.25, 'Mathematics'),
('Erie', 8000.00, 'English'),
('Monty', 6500.00, 'Science'),
('Aleena', 5500.50, 'Mathematics'),
('Chris', 7200.75, 'Science'),
('Darla', 4800.60, 'Mathematics'),
('Ben', 8300.00, 'English');
Output:
name | course_fee | course |
---|
Aleena | 5000.00 | Mathematics |
Ben | 6000.50 | English |
Chris | 7500.75 | Science |
Darla | 4500.25 | Mathematics |
Erie | 8000.00 | English |
Monty | 6500.00 | Science |
Aleena | 5500.50 | Mathematics |
Chris | 7200.75 | Science |
Darla | 4800.60 | Mathematics |
Ben | 8300.00 | English |
Example 1: Using SUM Function
Let's look at an PL/SQL GROUP BY query example that uses the SUM function. This PL/SQL GROUP BY example uses the SUM function to return the name of the student and the total course_fee (of the student).
The query groups the data by the name column, meaning it combines all records with the same student name. For each student, it then calculates the total of their course_fee values using the SUM function. The result is a list of student names along with the total fees they have paid.
Query:
SELECT name, SUM(course_fee) FROM student
GROUP BY name;
Output:
name | SUM(course_fee) |
---|
Aleena | 10500.50 |
Ben | 14300.50 |
Chris | 14701.50 |
Darla | 9300.85 |
Erie | 8000.00 |
Monty | 6500.00 |
Explanation: As we can see in the above output, the rows with duplicate NAMEs are grouped under the same NAME and their corresponding COURSE_FEE is the sum of the COURSE_FEE of duplicate rows. The SUM() function of PL/SQL is used here to calculate the sum. The NAMES that are added are Aleena, Chris, Darla, Ben.
Example 2: Using Count Function
The query groups the records in the Student table by course and name and then counts the number of occurrences (rows) for each combination of course and name. The COUNT(*) function counts the number of rows for each unique combination of name and course.
In this example, we use the COUNT function with the GROUP BY clause to count the number of records for each combination of course and name. This allows you to see how many times each student appears in the table for each course.
Query:
SELECT Name, course, Count(*)
FROM Student
GROUP BY Course, Name;
Output:
name | course | Count(*) |
---|
Aleena | Mathematics | 2 |
Ben | English | 2 |
Chris | Science | 2 |
Darla | Mathematics | 2 |
Erie | English | 1 |
Monty | Science | 1 |
Explanation:
The output shows each student’s name, the course they are enrolled in, and the count of how many times that student appears in the data for that course. This helps identify how many records there are for each student-course combination.
Example 3: Using MIN Function
Let's next look at how we could use the GROUP BY clause with the MIN function. The MIN(course_fee) function calculates the lowest course fee for each student within their respective group. Whereas in this example GROUP BY uses the MIN function to return the name of each Student and the minimum course_fee of the student.
Query:
SELECT Name, MIN(course_fee) AS "Lowest course_fee"
FROM student
GROUP BY name;
Output:
name | Lowest course_fee |
---|
Aleena | 5000.00 |
Ben | 6000.50 |
Chris | 7200.75 |
Darla | 4500.25 |
Erie | 8000.00 |
Monty | 6500.00 |
Explanation:
The output displays each student's name and the smallest course fee they have been charged. For example, if a student has multiple records with different fees, this query shows the lowest fee they paid. This helps to quickly identify the minimum fee each student has encountered.
Example 4: Using MAX Function
The GROUP BY clause groups the records by the course column. This means the query processes the data for each unique course separately. The MAX(course_fee) function calculates the highest course fee recorded for each course group.
It identifies the largest fee any student has paid for each course. Hence, Using the MAX function to find the maximum course_fee for each student grouped by the course. This query as an output returns the course and the highest course_fee recorded in that subject.
Query:
SELECT course, MAX(course_fee) AS "Highest course_fee"
FROM student
GROUP BY course;
Output:
course | Highest course_fee |
---|
Mathematics | 5500.50 |
English | 8300.00 |
Science | 7500.75 |
Explanation:
The output shows the maximum course fee for each course offered. For each course, it highlights the highest fee that any student has paid. For example, if the highest fee for the 'Mathematics' course is $5500.50, this value will be shown next to the course name. This helps to see the maximum expenditure for each course and understand which course has the highest fee paid by any student.
Key Features
- Order of Execution: The GROUP BY clause is placed after the WHERE clause and before the ORDER BY and HAVING clauses in a SQL query.
- Data Aggregation: The GROUP BY clause groups rows that have the same values in specified columns, enabling the use of aggregate functions like SUM, COUNT, AVG, MIN, and MAX to calculate summary values for each group.
- Multiple Columns Grouping: You can group data by one or more columns, allowing for more detailed and granular analysis.
- HAVING Clause Compatibility: Works with the HAVING clause, which is used to filter groups based on aggregate functions or conditions.
- NULL Value Grouping: All NULL values in a column are treated as a single group.
- Performance Optimization: Helps optimize queries when dealing with large datasets by reducing the number of rows processed through grouping.
- Subqueries: Can be used in subqueries for complex data analysis, allowing for nested grouping and aggregation.
- Result Order: The GROUP BY clause does not inherently order the results. If you need ordered results, you must use the ORDER BY clause.
Conclusion
The GROUP BY clause in PL/SQL is essential for organizing and summarizing data within a database. Through its ability to group records based on specified columns, it enables efficient aggregation of data, allowing us to compute totals, counts, minimums, and maximums across various categories.
In the context of the student table, the GROUP BY clause was instrumental in revealing insights such as the total course_fee per student, the highest and lowest course_fee in each course, and the frequency of occurrences within different categories.
Similar Reads
SQL Tutorial Structured Query Language (SQL) is the standard language used to interact with relational databases. Mainly used to manage data. Whether you want to create, delete, update or read data, SQL provides the structure and commands to perform these operations. Widely supported across various database syst
8 min read
Basics
What is SQL?Structured Query Language (SQL) is the standard language used to interact with relational databases. Allows users to store, retrieve, update, and manage data efficiently through simple commands. Known for its user-friendly syntax and powerful capabilities, SQL is widely used across industries. How D
6 min read
SQL Data TypesIn SQL, each column must be assigned a data type that defines the kind of data it can store, such as integers, dates, text, or binary values. Choosing the correct data type is crucial for data integrity, query performance and efficient indexing.Benefits of using the right data type:Memory-efficient
3 min read
SQL OperatorsSQL operators are symbols or keywords used to perform operations on data in SQL queries. Perform operations like calculations, comparisons, and logical checks.Enable filtering, calculating, and updating data in databases.Essential for query optimization and accurate data management.Types of SQL Oper
5 min read
SQL Commands | DDL, DQL, DML, DCL and TCL CommandsSQL commands are the fundamental building blocks for communicating with a database management system (DBMS). It is used to interact with the database with some operations. It is also used to perform specific tasks, functions, and queries of data. SQL can perform various tasks like creating a table,
7 min read
SQL Database OperationsSQL databases or relational databases are widely used for storing, managing and organizing structured data in a tabular format. These databases store data in tables consisting of rows and columns. SQL is the standard programming language used to interact with these databases. It enables users to cre
3 min read
SQL CREATE TABLECreating a table is one of the first and most important steps in building a database. The CREATE TABLE command in SQL defines how your data will be stored, including the table name, column names, data types, and rules (constraints) such as NOT NULL, PRIMARY KEY, and CHECK.Defines a new table in the
3 min read
Queries & Operations
SQL SELECT QuerySQL SELECT is used to retrieve data from one or more tables, either all records or specific results based on conditions. It returns the output in a tabular format of rows and columns.Extracts data from tables.Targets specific or all columns (*).Supports filtering, sorting, grouping, and joins.Result
3 min read
SQL INSERT INTO StatementThe INSERT INTO statement in SQL is used to add new rows to an existing table, whether for all columns, specific columns or by copying from another table. It is an essential command for populating databases with relevant records like customers, employees, or students.Insert data into all or selected
4 min read
SQL UPDATE StatementThe UPDATE statement in SQL is used to modify existing records in a table without deleting them. It allows updating one or multiple columns, with or without conditions, to keep data accurate and consistent.Change specific column values in selected rowsApply targeted updates using WHEREUpdate single
4 min read
SQL DELETE StatementThe SQL DELETE statement is used to remove specific rows from a table while keeping the table structure intact. It is different from DROP, which deletes the entire table.Removes rows based on conditions.Retains table schema, constraints, and indexes.Can delete a single row or all rows.Useful for cle
3 min read
SQL | WHERE ClauseIn SQL, the WHERE clause is used to filter rows based on specific conditions. Whether you are retrieving, updating, or deleting data, WHERE ensures that only relevant records are affected. Without it, your query applies to every row in the table! The WHERE clause helps you:Filter rows that meet cert
3 min read
SQL | AliasesIn SQL, aliases are temporary names given to columns or tables to make queries easier to read and write. They donât change the actual names in the database and exist only for the duration of that query.Make long or complex names readableSimplify joins and subqueriesImprove clarity in result setsAvoi
3 min read
SQL Joins & Functions
SQL Joins (Inner, Left, Right and Full Join)SQL joins are fundamental tools for combining data from multiple tables in relational databases. For example, consider two tables where one table (say Student) has student information with id as a key and other table (say Marks) has information about marks of every student id. Now to display the mar
4 min read
SQL CROSS JOINIn SQL, the CROSS JOIN is a unique join operation that returns the Cartesian product of two or more tables. This means it matches each row from the left table with every row from the right table, resulting in a combination of all possible pairs of records. In this article, we will learn the CROSS JO
3 min read
SQL | Date Functions (Set-1)SQL Date Functions are essential for managing and manipulating date and time values in SQL databases. They provide tools to perform operations such as calculating date differences, retrieving current dates and times and formatting dates. From tracking sales trends to calculating project deadlines, w
5 min read
SQL | String functionsSQL String Functions are powerful tools that allow us to manipulate, format, and extract specific parts of text data in our database. These functions are essential for tasks like cleaning up data, comparing strings, and combining text fields. Whether we're working with names, addresses, or any form
7 min read
Data Constraints & Aggregate Functions
SQL NOT NULL ConstraintIn SQL, constraints are used to enforce rules on data, ensuring the accuracy, consistency, and integrity of the data stored in a database. One of the most commonly used constraints is the NOT NULL constraint, which ensures that a column cannot have NULL values. This is important for maintaining data
3 min read
SQL PRIMARY KEY ConstraintThe PRIMARY KEY constraint in SQL is one of the most important constraints used to ensure data integrity in a database table. A primary key uniquely identifies each record in a table, preventing duplicate or NULL values in the specified column(s). Understanding how to properly implement and use the
5 min read
SQL Count() FunctionIn the world of SQL, data analysis often requires us to get counts of rows or unique values. The COUNT() function is a powerful tool that helps us perform this task. Whether we are counting all rows in a table, counting rows based on a specific condition, or even counting unique values, the COUNT()
7 min read
SQL SUM() FunctionThe SUM() function in SQL is one of the most commonly used aggregate functions. It allows us to calculate the total sum of a numeric column, making it essential for reporting and data analysis tasks. Whether we're working with sales data, financial figures, or any other numeric information, the SUM(
5 min read
SQL MAX() FunctionThe MAX() function in SQL is a powerful aggregate function used to retrieve the maximum (highest) value from a specified column in a table. It is commonly employed for analyzing data to identify the largest numeric value, the latest date, or other maximum values in various datasets. The MAX() functi
4 min read
AVG() Function in SQLSQL is an RDBMS system in which SQL functions become very essential to provide us with primary data insights. One of the most important functions is called AVG() and is particularly useful for the calculation of averages within datasets. In this, we will learn about the AVG() function, and its synta
4 min read
Advanced SQL Topics
SQL SubqueryA subquery in SQL is a query nested within another SQL query. It allows you to perform complex filtering, aggregation, and data manipulation by using the result of one query inside another. Subqueries are often found in the WHERE, HAVING, or FROM clauses and are supported in SELECT, INSERT, UPDATE,
5 min read
Window Functions in SQLSQL window functions are essential for advanced data analysis and database management. It is a type of function that allows us to perform calculations across a specific set of rows related to the current row. These calculations happen within a defined window of data and they are particularly useful
6 min read
SQL Stored ProceduresStored procedures are precompiled SQL statements that are stored in the database and can be executed as a single unit. SQL Stored Procedures are a powerful feature in database management systems (DBMS) that allow developers to encapsulate SQL code and business logic. When executed, they can accept i
7 min read
SQL TriggersA trigger is a stored procedure in adatabase that automatically invokes whenever a special event in the database occurs. By using SQL triggers, developers can automate tasks, ensure data consistency, and keep accurate records of database activities. For example, a trigger can be invoked when a row i
7 min read
SQL Performance TuningSQL performance tuning is an essential aspect of database management that helps improve the efficiency of SQL queries and ensures that database systems run smoothly. Properly tuned queries execute faster, reducing response times and minimizing the load on the serverIn this article, we'll discuss var
8 min read
SQL TRANSACTIONSSQL transactions are essential for ensuring data integrity and consistency in relational databases. Transactions allow for a group of SQL operations to be executed as a single unit, ensuring that either all the operations succeed or none of them do. Transactions allow us to group SQL operations into
8 min read
Database Design & Security
Introduction of ER ModelThe Entity-Relationship Model (ER Model) is a conceptual model for designing a databases. This model represents the logical structure of a database, including entities, their attributes and relationships between them. Entity: An objects that is stored as data such as Student, Course or Company.Attri
10 min read
Introduction to Database NormalizationNormalization is an important process in database design that helps improve the database's efficiency, consistency, and accuracy. It makes it easier to manage and maintain the data and ensures that the database is adaptable to changing business needs.Database normalization is the process of organizi
6 min read
SQL InjectionSQL Injection is a security flaw in web applications where attackers insert harmful SQL code through user inputs. This can allow them to access sensitive data, change database contents or even take control of the system. It's important to know about SQL Injection to keep web applications secure.In t
7 min read
SQL Data EncryptionIn todayâs digital era, data security is more critical than ever, especially for organizations storing the personal details of their customers in their database. SQL Data Encryption aims to safeguard unauthorized access to data, ensuring that even if a breach occurs, the information remains unreadab
5 min read
SQL BackupIn SQL Server, a backup, or data backup is a copy of computer data that is created and stored in a different location so that it can be used to recover the original in the event of a data loss. To create a full database backup, the below methods could be used : 1. Using the SQL Server Management Stu
4 min read
What is Object-Relational Mapping (ORM) in DBMS?Object-relational mapping (ORM) is a key concept in the field of Database Management Systems (DBMS), addressing the bridge between the object-oriented programming approach and relational databases. ORM is critical in data interaction simplification, code optimization, and smooth blending of applicat
7 min read