Mastering SQL Queries: Grouping, Aggregates, and Subqueries

Table of Contents:

SQL Queries, Ordered Results
Aggregate Functions in SQL, Aggregate Examples
Eliminating Duplicates, Computing Counts
Grouping and Aggregates, Filtering Tuples & Results
The HAVING Clause
Nested Subqueries, Kinds of Subqueries
Subqueries in WHERE Clause
Comparison with Subquery Result
Set Membership Tests, Correlated Subqueries
Subqueries in FROM Clause

Overview

This concise guide by Donnie Pinkston focuses on practical techniques for writing robust SQL queries that summarize, filter, and nest data retrieval logic. It emphasizes grouping and aggregate functions, the design and use of subqueries (including derived relations and correlated subqueries), and performance-aware patterns you can apply across common DBMS platforms.

What you will learn

How to summarize data using GROUP BY with aggregate functions like COUNT, SUM, and AVG.
When and how to use HAVING versus WHERE to filter groups or rows.
How to construct nested subqueries in WHERE, FROM, and SELECT clauses, and when correlated subqueries are appropriate.
Techniques for creating derived relations (subqueries in the FROM clause) to simplify complex logic.
Basic portability and syntax differences across DBMS implementations, and practical tips to avoid common performance pitfalls.

Core topics covered (in plain language)

The guide explores grouping and aggregation as foundational tools for turning row-level data into summary insights—such as counts by city or summed sales by category—and shows how HAVING filters those summaries after aggregation. It then progresses to nested subqueries: using inner queries to feed conditions or to build temporary result sets that the main query can treat like a table. Derived relations are presented as a way to break complex problems into manageable steps, improving clarity and maintainability.

Practical examples demonstrate how to replace inefficient nested constructs with joins or common table expressions where appropriate, and the text notes DBMS-specific behaviors that affect portability and execution. Throughout, the emphasis is on clear, testable queries that balance readability with performance.

Learning outcomes

Write grouped queries to aggregate and summarize data for reporting and analytics.
Apply HAVING and WHERE correctly to achieve desired filtering semantics.
Design and use nested and correlated subqueries for conditional logic that depends on other query results.
Create derived relations to decompose complex queries into logical stages.
Recognize common performance issues with subqueries and learn alternatives (joins, CTEs, indexing strategies).

Practical applications and examples

Examples are contextualized for real-world use: sales aggregation by product category, customer counts by city, inventory monitoring, and employee performance summaries. Each example includes a runnable SQL template (for instance, SELECT product_category, SUM(sales_amount) FROM sales GROUP BY product_category;) and suggestions for adapting the pattern to similar problems.

How to use this guide effectively

Start with the key concepts and glossary to align terminology.
Type and run the provided examples in your DBMS to observe results and behavior.
Refactor examples into your own schemas and datasets to practice portability and optimization.
Compare nested subqueries with joins and CTEs to evaluate readability and performance for your workload.

Who should read this

This guide is suitable for beginners who want clear, example-driven explanations and for intermediate practitioners seeking better query patterns and performance awareness. It’s particularly useful for analysts, developers, and database students who regularly work with aggregate reporting or need to express multi-step data logic in SQL.

Exercises and project ideas

The guide includes hands-on exercises such as counting customers by region with filters, building a sales analysis query set for dashboarding, and creating derived relations to simplify multi-stage reports. Each exercise encourages experimentation with both nested queries and join-based alternatives to build intuition about trade-offs.

Quick glossary (selected)

Aggregation: Summarizing rows with functions like SUM or COUNT.
Derived relation: A temporary table produced by a subquery in the FROM clause.
Correlated subquery: A subquery that references columns from the outer query and executes per row.

Final note

Clear examples and practical advice make this guide a useful reference for improving everyday SQL: from writing accurate grouped reports to safely using nested queries without sacrificing performance.