What is the primary difference between WHERE and HAVING clauses in SQL?

The WHERE clause filters rows before any groupings are made, making it suitable for row-level operations. In contrast, the HAVING clause filters groups after aggregation has been performed.

How can you use the QUALIFY clause with window functions in SQL?

The QUALIFY clause is used with window functions to filter their results after computation. It allows for precise data retrieval by applying conditions to the output of window functions such as ROW_NUMBER(), RANK(), and DENSE_RANK().

What are the key syntax elements of the WHERE clause?

The key syntax elements of the WHERE clause include the SELECT statement to specify the columns to retrieve, the FROM clause to indicate the source table, and the WHERE clause to define the conditions that records must meet to be included in the results.

When should you use the WHERE clause instead of the HAVING clause for filtering data?

The WHERE clause should be used instead of the HAVING clause for filtering data when the conditions apply to individual rows before grouping or aggregation occurs.

What are common errors to avoid when using clauses in SQL?

Common errors to avoid when using clauses in SQL include using the HAVING clause without a GROUP BY clause, misordering SQL statements, incorrectly using window functions without the QUALIFY clause, unnecessarily using SELECT *, which retrieves all columns and can degrade performance, and failing to index columns used in WHERE conditions.

How can you optimize SQL queries for better performance using the EXPLAIN statement?

You can optimize SQL queries for better performance using the EXPLAIN statement by analyzing the execution plan provided by the database. The EXPLAIN statement reveals how the database processes a query, highlighting potential bottlenecks and inefficiencies.

Which SQL clause is used to filter records?

The WHERE clause is used to filter records in SQL. It allows you to specify conditions that individual rows must meet to be included in the result set. If you need to filter grouped data after using aggregate functions, you should use the HAVING clause instead.

A Complete Guide to WHERE, HAVING, and QUALIFY in SQL

Google BigQuery

SQL

Anna Panchenko

,

Digital Analyst

Vadym Kramarenko

,

Growth Marketing Manager

April 23, 2025

Filtering data in SQL is essential for data analysts, SQL developers, and BI professionals, and understanding the differences between the WHERE, HAVING, and QUALIFY clauses in an SQL statement can enhance your ability to write efficient and accurate queries.

Mastering WHERE, HAVING, and QUALIFY clauses boosts query performance and ensures accurate data retrieval. This guide offers practical insights for data analysts, SQL developers, engineers, and BI professionals to improve filtering, optimize queries, and work more efficiently with complex datasets.

Note: This article on SQL filtering clauses, WHERE, HAVING, and QUALIFY, was originally published in May 2024 and fully updated in April 2025 to reflect the latest syntax rules, BigQuery support, and best practices for writing efficient SQL queries.

Uses of SQL Filtering Clauses and Operators

SQL filtering clauses help narrow down results by applying conditions to rows or groups of data. Understanding how the operators work is key to writing efficient queries.

The WHERE clause is used to filter individual rows before any grouping or aggregation takes place.
In contrast, the HAVING clause filters grouped records after aggregation. HAVING is often used alongside aggregate functions like SUM, COUNT, or AVG, allowing you to include only those groups that meet specific summary conditions.
The QUALIFY clause is used to filter results after window functions are applied. It lets you write cleaner queries by avoiding subqueries when filtering on ROW_NUMBER(), RANK(), or other window function results.

Understanding the WHERE Clause

The WHERE clause filters individual rows in an SQL query before grouping. It specifies the conditions that must be met for the records to be included in the result set. The WHERE clause is crucial when you need to fetch data that meets specific conditions from a table. By filtering rows early in the query process, WHERE helps in reducing the data set size, thus improving query performance. Tools that support features like SQL Server autocomplete can further streamline this process by reducing syntax errors and speeding up query writing.

The evaluation of a query with a WHERE clause typically follows this order:

FROM
WHERE
GROUP BY and aggregation
HAVING
WINDOW
QUALIFY
DISTINCT
ORDER BY
LIMIT

WHERE Clause is applicable in SELECT, UPDATE, and DELETE statements, making it one of SQL’s most fundamental and frequently used clauses.

Syntax of WHERE Clause

SELECT column1, column2, ... columnN
FROM table_name
WHERE condition;

Here:

SELECT column1, column2, ... columnN: This part of the query specifies the columns you want to retrieve from the table.
FROM table_name: Indicates the table from which to retrieve the data.
WHERE condition: Applies a condition that each row must meet to be included in the result set. You can use comparison operators like =, <, >, LIKE, IN, and others to filter the data.

Example of WHERE Clause

The WHERE clause filters records based on specified conditions. For example, we want to retrieve the names and salaries of employees who earn more than 3,000.

SELECT 
employee_id, name, salary
FROM 
`owox-analytics.myDataset.employee_data`
WHERE salary > 3000;

Here:

SELECT employee_id, name, salary: Specifies that we want to retrieve the employee name and salary columns from the table.
FROM employees: Specifies the employees table as the data source, where each row represents an employee record.
WHERE salary > 3000: Filters the rows to include only those with a salary greater than 3,000. This ensures that only employees with a salary above this threshold are included in the result set.

This example demonstrates how the WHERE clause efficiently filters data, returning only employees with salaries exceeding 3,000. Using the WHERE clause helps streamline data retrieval and enhances query performance.

Exploring the HAVING Clause

The HAVING clause filters results after the data has been grouped in an SQL query. Unlike WHERE, which filters rows before grouping, HAVING applies conditions to the grouped results. It's useful when working with aggregate functions, such as SUM, COUNT, or AVG.

Typically, SQL evaluates queries with a HAVING clause in this order:

FROM
WHERE
GROUP BY and aggregation
HAVING
WINDOW
QUALIFY
DISTINCT
ORDER BY
LIMIT

Applying HAVING allows you to refine your results based on aggregated data, making it a powerful tool for complex queries involving grouped data.

Syntax of HAVING Clause

The syntax of the HAVING clause allows you to apply conditions to groups of data created by the GROUP BY clause.

SELECT column1, function_name(column2)
FROM table_name
WHERE condition
GROUP BY column1, column2
HAVING condition

Here:

SELECT column1, function_name(column2): Specifies the columns to retrieve and the aggregate function to apply, such as COUNT, SUM, AVG, MAX, or MIN.
FROM table_name: Indicates the table from which to retrieve the data.
WHERE condition: Filters rows before grouping, similar to the standard WHERE clause.
GROUP BY column1, column2: Groups the result set by one or more columns.
HAVING condition: Filters grouped data based on the results of aggregate functions and is used only in SELECT statements.

Example of HAVING Clause

Let’s consider an example where we want to find departments with more than 3 employees, showing the department name and the number of employees.

SELECT department, COUNT(employee_id)
FROM `owox-analytics.myDataset.employee_data`
WHERE status = 'active'
GROUP BY department
HAVING COUNT(employee_id) > 3;

Illustration showing the application of the HAVING clause in SQL to filter aggregated data.

Here:

SELECT department, COUNT(employee_id): Retrieves the department name and the count of employees in each department.
FROM owox-analytics.myDataset.employee_data: Specifies that the data is being retrieved from the employees table.
WHERE status = ‘active’: Filters the rows to include only active employees before any grouping.
GROUP BY department: Groups the results by the department column.
HAVING COUNT(employee_id) > 3: Filters the grouped results to include only those departments with more than 3 employees.

This example shows how the HAVING clause filters grouped data to display only departments with more than three active employees. It’s a powerful tool for refining results based on aggregate functions, ensuring the output includes only relevant groups for analysis.

Mastering the QUALIFY Clause

The QUALIFY SQL clause filters the results of window functions. It is particularly useful in databases like BigQuery. Unlike WHERE and HAVING, which filter rows and groups, QUALIFY works on the results generated by window functions, providing an additional layer of row-level filtering. In other words, QUALIFY is to window functions what HAVING is to GROUP BY and WHERE is to FROM.

The steps in running a query with a QUALIFY clause usually go like this:

FROM clause
WHERE clause
GROUP BY clause
HAVING clause.
WINDOW functions
QUALIFY clause
DISTINCT clause
ORDER BY clause
LIMIT clause

Applying the QUALIFY clause allows you to efficiently filter and rank data within partitions, enabling more complex and precise data analysis.

Syntax of QUALIFY Clause

SELECT column1, window_function(column2) 
     OVER (partition_order_clause)
FROM table_name
WHERE condition
QUALIFY bool_expression;

Here:

SELECT: Specifies the columns and window functions you want to retrieve.
column1: Regular column to be selected.
window_function(column2) OVER (partition_order_clause): A window function applied to column2 with specific partitioning and ordering as defined in partition_order_clause.
FROM: Specifies the table from which to retrieve the data.
table_name: The name of the table.
WHERE: Filters rows before applying window functions.
condition: A condition to filter rows before window functions are applied.
QUALIFY: Filters the results of window functions based on a boolean expression.
bool_expression: A boolean expression used to filter rows based on the result of window functions.

The BigQuery syntax shows that the QUALIFY clause comes after HAVING and before WINDOW, highlighting its role in filtering results from window functions.

Example of QUALIFY Clause

SELECT 
region, salesperson, total_sales, 
RANK() OVER (
    PARTITION BY region 
    ORDER BY total_sales DESC) 
    as sales_rank
FROM `owox-analytics.myDataset.salesperson_data`
QUALIFY sales_rank = 1;

Image demonstrating the use of the QUALIFY clause in SQL for filtering query results.

The QUALIFY clause filters the results of window functions in SQL queries. Here is an example demonstrating its usage to retrieve the top salesperson in each region based on total sales.

Here:

SELECT region, salesperson, total_sales, RANK() OVER (PARTITION BY region ORDER BY total_sales DESC) as sales_rank: Selects the region, salesperson, and total sales, and ranks the salespeople within each region based on total sales in descending order.
FROM owox-analytics.myDataset.salesperson_data: Specifies the salesperson_data table as the data source.
QUALIFY sales_rank = 1: Filters the results to include only the top-ranked salesperson (highest total sales) in each region.

This example demonstrates how the QUALIFY clause can be used to filter window function results, retrieving only the top-ranked salesperson in each region based on their total sales. The QUALIFY clause simplifies complex filtering tasks and enhances the precision of your data analysis.

Key Differences Between WHERE VS. HAVING VS. QUALIFY

Understanding the differences between the WHERE, HAVING, and QUALIFY clauses is crucial for writing efficient and accurate SQL queries. Knowing how each works, especially the WHERE vs HAVING SQL logic, helps you apply the right filter at the right stage of query execution.

WHERE Clause

Usage: Filters records in a table based on a condition.
Operations: Implements row operations.
Statements Applicable: Can be used with SELECT, UPDATE, and DELETE statements.
Requirement for GROUP BY: Not required.
Names Used: Use the original name of a field.

HAVING Clause

Usage: Filters groups defined by a GROUP BY clause based on a condition.
Operations: Implements column operations on aggregated data.
Statements Applicable: Can only be used with a SELECT statement.
Requirement for GROUP BY: Required.
Names Used: Can use the original name or an alias of a field.

QUALIFY Clause

Usage: Filters the results of window functions.
Operations: Implements operations on the results of window functions.
Statements Applicable: Can only be used with a SELECT statement.
Requirement for GROUP BY: Not required.
Names Used:
- Use an alias of a field if the window function is used in a SELECT statement.
- Use the original name if the window function is used in the QUALIFY clause.

Basic Applications of WHERE, HAVING, and QUALIFY

Each SQL filtering clause- WHERE, HAVING, and QUALIFY serves a distinct purpose in refining data. Knowing their basic uses helps write more accurate and faster queries.

Using the WHERE Clause for Filtering Numeric Data

The WHERE clause filters numeric data by setting specific conditions that numeric fields must meet to be included in the results. It helps refine datasets to include only records matching the defined numerical criteria, ensuring the retrieved data is relevant and precise based on the specified conditions.

Example:

Suppose we want to retrieve all employees from a company database who earn a salary greater than 3,000. Here's how you can do it using the WHERE clause:

SELECT 
employee_id, name, salary
FROM 
`owox-analytics.myDataset.employee_data`
WHERE salary > 3000;

Here:

SELECT employee_id, name, salary: Specifies the columns to be retrieved.
FROM owox-analytics.myDataset.employee_data: Specifies the table from which to retrieve the data.
WHERE salary > 3000: Applies the condition to filter records where the salary is greater than 3,000.

In this example, the query will return a list of all employees who earn more than 3,000, displaying their name, employee ID, and salary. This type of filtering is essential for data analysis tasks that require focusing on specific numeric criteria.

Using the WHERE Clause for Filtering Text Data

The WHERE clause also filters text data by specifying conditions that textual fields must meet to be included in the results. This is particularly useful for narrowing down datasets based on specific text patterns or exact matches. By using conditions such as equality, LIKE, or even pattern matching with wildcards, the WHERE clause helps in retrieving records that contain particular text strings or match specific text criteria.

Example:

SQL query in using the WHERE clause for filtering text data

Suppose we want to find all employees whose job title includes 'Manager'. We can write the following SQL query using the LIKE operator condition for the WHERE clause:

SELECT *
FROM `owox-analytics.myDataset.employee_data`
WHERE job_title LIKE '%Manager%';

Here:

SELECT *: Selects all columns from the table.
FROM owox-analytics.myDataset.employee_data: Specifies the table from which to retrieve the data.
WHERE job_title LIKE '%Manager%': Applies the condition to filter records where the job title contains the word 'Manager'. The % sign allows any characters before and after 'Manager'.

In this example, the query will return all employees whose job titles include the word 'Manager'. This type of filtering is particularly useful for identifying records based on specific text patterns within a dataset.

Using Qualify with ROW_NUMBER()

The QUALIFY clause filters the results of window functions like ROW_NUMBER() in SQL queries. By applying QUALIFY, you can narrow down the results based on the ranking or ordering defined by the window function. This is particularly useful for retrieving specific rows from each data partition, such as the top-ranked or bottom-ranked entries.

Example:

Suppose we want to find each customer's top three sales transactions based on the transaction amount. We can achieve this using the ROW_NUMBER() function combined with the BigQuery QUALIFY clause.

SELECT userId, transaction_id, sales_amount, transaction_date
FROM `owox-analytics.myDataset.orders_data`
QUALIFY ROW_NUMBER() 
OVER (PARTITION BY userId 
    ORDER BY sales_amount DESC) <= 3;

The application of the QUALIFY clause with the ROW_NUMBER() function.

Here:

SELECT userId, transaction_id, sales_amount, transaction_date: Selects the columns to be retrieved.
FROM owox-analytics.myDataset.orders_data: Specifies the table from which to retrieve the data.
QUALIFY ROW_NUMBER() OVER (PARTITION BY userId ORDER BY sales_amount DESC) <= 3: Assigns a row number to each transaction for a customer, ordered by transaction amount in descending order, and filters to include only the top three transactions per customer.

In this example, the query will return each customer's top three sales transactions based on the highest transaction amounts. This type of filtering is useful for analyzing the most significant transactions within each customer group.

Advanced Techniques for Applying WHERE, HAVING, and QUALIFY

Advanced techniques for applying the WHERE, HAVING, and QUALIFY clauses can significantly enhance the efficiency and precision of your SQL queries. By understanding and leveraging these advanced methods, you can perform more complex data filtering, aggregation, and ranking operations, leading to more insightful and optimized data analysis.

Using Multiple Aggregate Functions in HAVING Clause

The HAVING clause can handle multiple aggregate functions simultaneously to refine group-based data analysis. You can filter groups more precisely by applying conditions on multiple aggregates, such as COUNT, SUM, and AVG. This allows for more complex queries, ensuring that only groups meeting all specified aggregate criteria are included in the results.

Example:

Let's analyze department performance in a company by calculating the average and total salary per department. We want to identify departments with total salary greater than 16,000 but an average salary per employee less than 3,500.

SELECT department,
COUNT(*) AS number_of_employees,
SUM(salary) AS total_sales,
AVG(salary) AS average_sales_per_employee
FROM `owox-analytics.myDataset.employee_data`
GROUP BY department
HAVING SUM(salary) > 16000 AND AVG(salary) < 3500;

Visualization of multiple aggregate functions applied within the HAVING clause in SQL.

Here:

SELECT department, COUNT(*) AS number_of_employees, SUM(salary) AS total_sales, AVG(salary) AS average_sales_per_employee: Retrieves the department name, counts the number of employees, sums up the salary, and calculates the average salary per employee.
FROM owox-analytics.myDataset.employee_data: Specifies the table from which to retrieve the data.
GROUP BY department: Groups the results by department.
HAVING SUM(salary) > 16000 AND AVG(salary) < 3500: Filters the groups to include only those departments where the total salary are greater than 16,000 and the average salary per employee are less than 3,500.

In this example, the query will return departments that meet both criteria, providing a more nuanced view of department performance based on salary metrics.

Using WHERE and HAVING in the Same SQL Query

Using both WHERE and HAVING makes filtering more accurate and efficient. WHERE filters rows before grouping, and HAVING filters after aggregation. The key difference is in the order: WHERE filters rows, HAVING filters grouped results.

Example:

Let’s perform a query to get the total salary and average salary per employee for departments where the employee age is 32 or more and that have more than 2 employees:

SELECT department,
COUNT(*) AS number_of_employees,
SUM(salary) AS total_sales,
AVG(salary) AS average_sales_per_employee
FROM `owox-analytics.myDataset.employee_data`
WHERE age >= 32
GROUP BY department
HAVING COUNT(*) > 2;

Here:

SELECT department, COUNT(*) AS number_of_employees, SUM(salary) AS total_sales, AVG(salary) AS average_sales_per_employee: Retrieves the department name, counts the number of employees, sums up the salary, and calculates the average salary per employee.
FROM owox-analytics.myDataset.employee_data: Specifies the table from which to retrieve the data.
WHERE age >= 32: Filters the rows to include only those where the employee age is 32 or more.
GROUP BY department: Groups the results by department.
HAVING COUNT(*) > 2: Filters the groups to include only those departments with more than two employees.

In this example, the query will return departments where the employee age is 32 or more with more than two employees and their total and average salary, into department performance within that specific age.

Using Subqueries with HAVING Clause

Subqueries can be embedded within the HAVING clause to perform calculations or retrieve aggregated values that can be used to refine the final results. This method enables you to compare group-level aggregates against dynamically calculated thresholds, providing a powerful way to apply complex conditions to grouped data.

Example:

SELECT
  region,
  COUNT(*) AS number_of_orders,
  SUM(sales_amount) AS total_sales,
  AVG(sales_amount) AS average_sales_per_order
FROM `owox-analytics.myDataset.orders_data`
GROUP BY region
HAVING SUM(sales_amount) < (
  SELECT
    SUM(sales_amount) / COUNT(DISTINCT region)
  FROM `owox-analytics.myDataset.orders_data`);

The use of subqueries with the HAVING clause for filtering results in SQL.

Let's build a query that returns departments with total sales lower than the average sales. The subquery obtains the average sales.

Here:

SELECT region, COUNT(*) AS number_of_employees, SUM(sales_amount) AS total_sales, AVG(sales_amount) AS average_sales_per_order: Retrieves the region, counts the number of orders, sums up the sales, and calculates the average sales per order.
FROM owox-analytics.myDataset.orders_data: Specifies the table from which to retrieve the data.
GROUP BY region: Groups the results by region.
HAVING SUM(sales_amount) < (SELECT SUM(sales_amount) / COUNT(distinct region) FROM owox-analytics.myDataset.orders_data): Filters the groups to include only those regions with total sales lower than the average sales, as calculated by the subquery.

In this example, the query will return region with total sales lower than the average sales, providing insights into underperforming regions.

Using QUALIFY with RANK or PERCENTILE_CONT

The QUALIFY clause can be used with window functions like RANK or PERCENTILE_CONT to filter the results based on specific ranking or percentile conditions. This lets you identify top performers, outliers, or any specific percentile group within your data. By applying QUALIFY, you can efficiently narrow down the dataset to include only those rows that meet the desired criteria, enhancing the precision and relevance of your query results.

Example:

Using the RANK function, let's write a query that returns the top 5 highest-paid employees within each department.

SELECT 
employee_id, department, salary,
RANK() 
OVER (PARTITION BY department ORDER BY salary DESC) 
AS salary_rank
FROM `owox-analytics.myDataset.employee_data`
QUALIFY salary_rank <= 5;

Using the QUALIFY clause with the RANK or PERCENTILE_CONT functions in SQL.

Here:

SELECT employee_id, department, salary, RANK() OVER (PARTITION BY department ORDER BY salary DESC) AS salary_rank: Selects the employee id, department, salary, and ranks employees within each department based on their salary in descending order.
FROM owox-analytics.myDataset.employee_data: Specifies the table from which to retrieve the data.
QUALIFY salary_rank <= 5: Filters the results to include only the top 5 employees with the highest salaries in each department.

In this example, the query will return the top 5 highest-paid employees for each department, providing insights into the highest earners within each department. Using the QUALIFY clause with the RANK function, you can focus on key data points, making your analysis more targeted and actionable.

Using QUALIFY with ROW_NUMBER() in a Subquery

The QUALIFY clause can be effectively used with the ROW_NUMBER() function in a subquery to filter specific rows based on their ranking within a partition. This approach allows for precise control over which rows to include in the final result set, such as selecting the first occurrence of each group or identifying specific rank positions.

Example:

Let's say you have a large table of sales transactions and need to retrieve each customer's most recent transaction within the last 6 months. A direct query filtering all data would be inefficient, so you use a subquery with ROW_NUMBER() and QUALIFY.

SELECT
    Customers.user_id, Customers.name, 
    Orders.transaction_id, Orders.transaction_date, 
    Orders.sales_amount
FROM
(SELECT user_id, name
FROM `owox-analytics.myDataset.customers`)  
    AS Customers
LEFT JOIN
(SELECT userId, transaction_id, transaction_date, sales_amount
FROM `owox-analytics.myDataset.orders_data`
WHERE transaction_date BETWEEN CURRENT_DATE() - 
INTERVAL 180 DAY AND CURRENT_DATE()
QUALIFY ROW_NUMBER() 
OVER (PARTITION BY userId ORDER BY transaction_date DESC) = 1) 
AS Orders
ON
Customers.user_id=Orders.userId;

The use of the QUALIFY clause with the ROW_NUMBER() function in a subquery.

Here:

SELECT user_id, name FROM owox-analytics.myDataset.customers AS Customers: Selects all user_id and name columns from the customers table with an alias Customers.
LEFT JOIN (SELECT userId, transaction_id, transaction_date, sales_amount FROM `owox-analytics.myDataset.orders_data` WHERE transaction_date BETWEEN CURRENT_DATE() - INTERVAL 180 DAY AND CURRENT_DATE(): Selects sales transactions that occurred within the last 180 days by filtering the transaction_date to be between the current date minus 180 days and the current date.
QUALIFY ROW_NUMBER() OVER (PARTITION BY userId ORDER BY transaction_date DESC) = 1) AS Orders: The ROW_NUMBER() function assigns a unique row number to each transaction per customer, ordered by the transaction_date in descending order.
ON Customers.user_id = Orders.userId: Joins the customers table with the subquery result on the user id.

In this example, the query retrieves each customer's most recent sales transaction within the last 6 months, optimizing performance and ensuring efficient data handling. This technique is particularly useful for dealing with large datasets and complex filtering requirements.

Common Errors with WHERE, HAVING, and QUALIFY Clauses

Using WHERE, HAVING, and QUALIFY effectively requires understanding their distinct roles in SQL queries. Misusing them can lead to inefficiencies or incorrect results. Knowing when and how each clause filters data, whether at the row, group, or window function level, helps ensure the accuracy of queries.

Correct Order of SQL Statements

Ensuring the correct order of SQL statements is crucial for accurate query results. Placing WHERE, GROUP BY, and HAVING clauses in the wrong order can lead to syntax errors or incorrect outcomes.

⚠️ Error: Placing SQL clauses in the wrong order can cause errors. WHERE should come before GROUP BY, and HAVING should follow it. Getting this wrong may lead to syntax or logic issues.

📖 Example: Consider a scenario where you need to filter rows based on a condition, group them by a column, and then apply an aggregate function condition. The correct order of the SQL statements ensures the query runs properly.

SELECT column1, COUNT(column2) 
    as count_column2
FROM my_table
HAVING COUNT(column2) > 10
WHERE condition
GROUP BY column1
ORDER BY column1;

In this incorrect example, the HAVING clause is placed before the WHERE and GROUP BY clauses, which is not valid SQL syntax.

✅ Solution: To avoid these common errors, it is essential to follow the correct sequence of SQL statements. This not only ensures accurate data filtering and aggregation but also enhances query performance and readability.

Always ensure your query follows this correct order:

SELECT
FROM
WHERE
GROUP BY
HAVING
ORDER BY

SELECT column1, 
COUNT(column2) as count_column2
FROM my_table
WHERE condition
GROUP BY column1
HAVING COUNT(column2) > 10
ORDER BY column1;

Grouping Data Correctly

When using the HAVING clause, it is crucial to group your data correctly using the GROUP BY clause. Failing to do so will result in errors and prevent your query from executing properly.

⚠️ Error: When applying the HAVING clause without first grouping the data with GROUP BY, the query will fail. The HAVING clause is intended to filter aggregated results, and without grouping, there are no aggregates to filter.

📖 Example: Consider a scenario where you need to count the number of products in each category and filter categories with more than five products. The query must include GROUP BY to work correctly.

SELECT category, 
COUNT(product_id) as product_count
FROM products
HAVING COUNT(product_id) > 5;

In this incorrect example, the HAVING clause is used without the GROUP BY clause, resulting in an error.

✅ Solution: To avoid such errors, it is essential to remember that the HAVING clause is meant to work with grouped data. Always ensure that the HAVING clause is paired with the GROUP BY clause. This ensures that the data is correctly grouped before applying the aggregate condition.

SELECT category, 
COUNT(product_id) as product_count
FROM products
GROUP BY category
HAVING COUNT(product_id) > 5;

Ensuring Consistency Between SELECT and GROUP BY

Matching columns in the SELECT statement with those in the GROUP BY clause is essential for accurate data aggregation. Failing to include nonaggregated columns from the SELECT statement in the GROUP BY clause will result in errors.

⚠️ Error: The nonaggregated columns in the SELECT statement must be included in the GROUP BY clause. If they are not, the query will fail because SQL cannot aggregate the results correctly.

📖 Example: Consider a scenario where you need to count the number of products in each category. The query must include the category column in both the SELECT and GROUP BY clauses to work correctly.

SELECT category, 
COUNT(product_id) as product_count
FROM products
GROUP BY product_id;

In this incorrect example, the category column is not included in the GROUP BY clause, resulting in an error.

✅ Solution: To avoid such errors, ensure that all nonaggregated columns in the SELECT statement are included in the GROUP BY clause. This ensures consistency and allows SQL to aggregate the data correctly.

SELECT category, 
COUNT(product_id) as product_count
FROM products
GROUP BY category;

Correct Use of HAVING Conditions

Using the HAVING clause correctly is crucial for filtering results based on aggregated values. The HAVING clause should only be used with aggregate functions to ensure the query follows best practices and maintains clarity.

⚠️ Error: The HAVING clause should only be used with aggregated values. Using it with non-aggregated columns can cause confusion and lead to inefficient queries.

📖 Example: Consider a scenario where you need to count the number of products in each category and filter the categories. Using HAVING incorrectly to filter a non-aggregated column.

SELECT category, 
COUNT(product_id) as product_count
FROM products
GROUP BY category
HAVING category = 'Electronics';

In this incorrect example, the HAVING clause is used to filter based on a non-aggregated column, which is not the best practice.

✅ Solution: To avoid such errors, use the WHERE clause for filtering non-aggregated columns and reserve the HAVING clause for conditions on aggregated values. This approach aligns with best practices and enhances the clarity of your SQL code.

SELECT category, 
COUNT(product_id) as product_count
FROM products
WHERE category = 'Electronics'
GROUP BY category
HAVING COUNT(product_id) > 5;

Distinguishing Between WHERE and HAVING

Remembering the distinct roles of WHERE and HAVING is crucial for accurate query results. WHERE filters rows before any grouping occurs, while HAVING filters groups after aggregation.

⚠️ Error: Confusing WHERE and HAVING can lead to mistakes. Use WHERE to filter individual rows before grouping, and HAVING to filter grouped data after aggregation. Mixing them up causes inefficient and incorrect queries.

📖 Example: Consider a scenario where you need to filter products by category and count the number of products in each category. Using HAVING to filter rows instead of groups.

SELECT category, 
COUNT(product_id) as product_count
FROM products
HAVING category = 'Electronics'
GROUP BY category;

In this incorrect example, HAVING is used to filter rows by category, which should be done with WHERE.

✅ Solution:To avoid such errors, remember that WHERE is for filtering individual rows, and HAVING is for filtering groups after aggregation. This distinction helps ensure that your queries are both efficient and clear.

SELECT category, 
COUNT(product_id) as product_count
FROM products
WHERE category = 'Electronics'
GROUP BY category;

Including Aggregate Functions in HAVING Clause

When filtering based on aggregated values, it is essential to include the aggregate function in the HAVING clause.

⚠️ Error: The HAVING clause is designed to work with aggregate functions to filter groups based on aggregated values. Failing to include the aggregate function in the HAVING clause will result in incorrect query logic.

📖 Example: Consider a scenario where you need to count the number of products in each category and filter categories with more than five products. Incorrectly excluding the aggregate function.

SELECT category, COUNT(product_id)
FROM products
GROUP BY category
HAVING product_id > 5;

In this incorrect example, the HAVING clause is incorrectly filtering based on product_id without using the aggregate function.

✅ Solution: To avoid such errors, always include the aggregate function in the HAVING clause when filtering based on aggregated values. This ensures the query logic correctly applies the aggregation condition.

SELECT category, COUNT(product_id)
FROM products
GROUP BY category
HAVING COUNT(product_id) > 5;

Incorrect Use of QUALIFY Without Window Functions

The QUALIFY clause is specifically designed to work with window functions like ROW_NUMBER(), RANK(), and others. Using QUALIFY without a window function leads to incorrect queries and potential syntax errors.

⚠️ Error: Errors occur when the QUALIFY clause is used without any accompanying window function. QUALIFY is intended to filter the results of window functions, and without them, the clause cannot function correctly. This misuse can cause confusion and lead to invalid query logic.

📖 Example: Consider a scenario where you mistakenly use QUALIFY without a window function.

SELECT employee_name, department, salary
FROM employees
QUALIFY salary > 50000;

In this incorrect example, QUALIFY is used without a window function, which is not valid SQL syntax.

✅ Solution: To avoid such errors, ensure that QUALIFY is always used with window functions. Window functions like ROW_NUMBER(), RANK(), and others provide the necessary context for QUALIFY to filter the results effectively.

SELECT employee_name, department, salary,
  ROW_NUMBER() 
    OVER (
      PARTITION BY department 
      ORDER BY salary DESC) 
        as salary_rank
FROM employees
QUALIFY salary_rank = 1;

Best Practices for Query Optimization

Query optimization ensures that your SQL queries run efficiently and return results quickly. By following best practices, you can minimize the load on your database and improve the overall performance of your applications. Here are some key strategies for optimizing your SQL queries:

Use Search Indexes

Indexes act like internal guides for the database to locate specific information quickly. Identify frequently used columns in WHERE clauses and create indexes on those columns. However, creating too many indexes can slow down adding and updating data, so use them strategically.

The database must maintain the indexes and the main table data, adding some overhead. It's important to strike a balance and only create indexes on columns that will improve search speed.

Use WHERE Clause Instead of HAVING

WHERE clauses help improve performance because they filter out unnecessary rows before any grouping or aggregation happens. On the other hand, HAVING filters data after the grouping is done.

When you use WHERE instead of HAVING for row-level conditions, your query runs faster since it works with a smaller dataset from the beginning. This reduces the amount of data the database needs to group and calculate, making the entire process more efficient and less time-consuming.

Avoiding Queries Inside a Loop

Running queries inside a loop will significantly slow down the execution time. In most cases, you can insert and update data in bulk, which is a far better approach than running queries inside a loop.

Iterative patterns, such as those in for, while, and do-while loops, take much time to execute, affecting performance and scalability. All queries should be made outside of loops to avoid this, improving efficiency.

Using SELECT Instead of SELECT *

One of the best ways to enhance efficiency is to reduce the load on the database. This can be done by limiting the amount of information retrieved from each query. Running queries with SELECT * retrieves all the columns from a table, including unnecessary data, which increases load and processing time.

A SELECT statement with defined parameters decreases the database load and enhances performance. Specifying only the necessary columns will streamline the data retrieval process and improve query speed.

Use QUALIFY instead of GROUP BY and JOIN

When optimizing SQL queries, it's crucial to minimize complexity and improve efficiency. One common scenario involves finding specific rows within groups, such as the earliest sale for each salesperson.

Traditionally, this might be achieved using GROUP BY and JOIN operations, but these methods can be cumbersome and inefficient. In this guide, we explore how the QUALIFY clause offers a simpler, more efficient alternative for filtering results when using window functions.

Example:

Imagine we have a table of sales data with columns sales_person, sale_date, and amount. We want to find the earliest sale date and the corresponding amount for each salesperson.

WITH first_sale AS (
  SELECT
    sales_person,
    MIN(sale_date) AS earliest_date
  FROM
    sales
  GROUP BY
    sales_person
)

SELECT
  s.sales_person,
  s.amount,
  f.earliest_date
FROM
  sales s
JOIN
  first_sale f
ON
  s.sales_person = f.sales_person
  AND s.sale_date = f.earliest_date;

The join can result in duplicate rows if there are multiple sales on the earliest date, leading to inaccuracies in the results. Additionally, the query is complex and inefficient due to the need for intermediate result sets and the join operation, which increases the overall execution time and resource usage.

Solution :

We can use the QUALIFY clause to simplify the query and improve performance.

SELECT
  sales_person,
  sale_date,
  amount
FROM
  sales
QUALIFY
  ROW_NUMBER() OVER (PARTITION BY sales_person ORDER BY sale_date) = 1;

‍

Deepen Your Understanding with Some More BigQuery Functions

Beyond WHERE, HAVING, and QUALIFY, BigQuery offers powerful built-in functions that enhance data transformation, analysis, and maintenance. Mastering these can significantly boost query flexibility and insight generation. Below are key function categories and their uses:

ARRAY Functions: Useful for analyzing complex or nested data like customer activity or product attributes. These functions let you flatten, transform, and filter array values efficiently.
DATE Functions: Used to work with calendar dates (year, month, day). Ideal for filtering data by specific dates or calculating the difference between two dates.
DATETIME Functions: Handle both date and time components (without time zone). Perfect for scheduling events, tracking transactions, or timestamping events in business applications.
TIMESTAMP Functions: Include date, time, and time zone data. Useful for global applications where accurate time comparisons across regions are required, such as logs or real-time analytics.
STRING Functions: Helpful for cleaning and analyzing text such as reviews or descriptions. You can search, format, replace, or split strings to structure unstructured data.
DROP Statements: Safely removes obsolete or temporary tables and views. It’s especially helpful when cleaning up resources after migrations or analysis tasks.

Boost Your Data Analysis with OWOX Reports Extension for Google Sheets

Enhance your data analysis capabilities using advanced SQL clauses and tools like OWOX Reports Extension for Google Sheets to gain deeper insights and optimize your queries.

You can apply filters, use window functions, and schedule automated updates directly within Sheets. Whether you're a data analyst or a business user, OWOX Reports helps you work faster, reduce errors, and make informed decisions with live BigQuery data.

FAQ

You might also like

March 17, 2025

BigQuery String Functions: Syntax and Usage Examples

March 14, 2024

BigQuery Data Manipulation Language (DML): The Essential Guide

March 17, 2025

Conversion Functions in BigQuery: Detailed Analysis and Examples

April 12, 2024

A Closer Look at BigQuery Aggregate Functions

Automated Reporting, Unmatched Precision

I can finally automate reporting across my business with the precision and flexibility that I could previously only get from a team of 20 people

OWOX helps us to structure the marketing and website data into Google BigQuery

OWOX BI helps us make strategic and managerial decisions based on accurate, high-quality data. The advantage is that you just have to set it up once for all

Great Service!

We needed a solution that allowed us to keep a copy of all the analytics hits we were tracking. OWOX helped us do exactly that.

Transforming Data into Insightful Revenue

The OWOX team helped us go deep into modeling our company data so that we can look at our revenue metrics in a way that truly reflects how we see our business

All testimonials

WHERE vs. HAVING vs. QUALIFY: SQL Filter Operators Explained

Uses of SQL Filtering Clauses and Operators

Understanding the WHERE Clause

Syntax of WHERE Clause

Example of WHERE Clause

Get BigQuery Reports in Seconds

Exploring the HAVING Clause

Syntax of HAVING Clause

Example of HAVING Clause

Mastering the QUALIFY Clause

Syntax of QUALIFY Clause

Example of QUALIFY Clause

Simplify BigQuery Reporting in Sheets

Key Differences Between WHERE VS. HAVING VS. QUALIFY

Basic Applications of WHERE, HAVING, and QUALIFY

Using the WHERE Clause for Filtering Numeric Data

Using the WHERE Clause for Filtering Text Data

Using Qualify with ROW_NUMBER()

Simplify BigQuery Reporting in Sheets

Advanced Techniques for Applying WHERE, HAVING, and QUALIFY

Using Multiple Aggregate Functions in HAVING Clause

Using WHERE and HAVING in the Same SQL Query

Using Subqueries with HAVING Clause

Using QUALIFY with RANK or PERCENTILE_CONT

Get BigQuery Reports in Seconds

Using QUALIFY with ROW_NUMBER() in a Subquery

Common Errors with WHERE, HAVING, and QUALIFY Clauses

Correct Order of SQL Statements

Grouping Data Correctly

Ensuring Consistency Between SELECT and GROUP BY

Correct Use of HAVING Conditions

Distinguishing Between WHERE and HAVING

Including Aggregate Functions in HAVING Clause

Incorrect Use of QUALIFY Without Window Functions

Simplify BigQuery Reporting in Sheets

Best Practices for Query Optimization

Use Search Indexes

Use WHERE Clause Instead of HAVING

Avoiding Queries Inside a Loop

Using SELECT Instead of SELECT *

Use QUALIFY instead of GROUP BY and JOIN

Deepen Your Understanding with Some More BigQuery Functions

Boost Your Data Analysis with OWOX Reports Extension for Google Sheets

FAQ

You might also like

2,000 companies rely on us

Subscribe to the OWOX Insider: your analytics digest