Skip to content

LeetCode 586 - Customer Placing the Largest Number of Orders

Database Language: SQL Server

Difficulty: ⭐

Problem Description

Input

Table: Orders

Column Name Type
order_number int
customer_number int

order_number is the primary key (column with unique values) for this table. This table contains information about the order ID and the customer ID.

Requirement

Write a solution to find the customer_number for the customer who has placed the largest number of orders.

The test cases are generated so that exactly one customer will have placed more orders than any other customer.

The result format is in the following example.

Examples

Example 1

Input

Orders table:

order_number customer_number
1 1
2 2
3 3
4 3
Output
customer_number
3
Explanation

The customer with number 3 has two orders, which is greater than either customer 1 or 2 because each of them only has one order. So the result is customer_number 3.

SQL Schema

CREATE TABLE orders (order_number INT PRIMARY KEY, customer_number INT);

TRUNCATE TABLE orders;
INSERT INTO orders (order_number, customer_number) values ('1', '1');
INSERT INTO orders (order_number, customer_number) values ('2', '2');
INSERT INTO orders (order_number, customer_number) values ('3', '3');
INSERT INTO orders (order_number, customer_number) values ('4', '3');

Solution

To find the customer who has placed the largest number of orders, the number of orders placed by each customer needs to be determined first. This can be determined by using the COUNT() aggregate function:

SELECT customer_number, COUNT(order_number) AS order_count
FROM Orders
GROUP BY customer_number
customer_number order_count
1 1
2 1
3 2

Since the question wants the customer who placed the largest number of orders, the output of the previous query needs to be sorted by the order_count in descending order:

SELECT customer_number, COUNT(order_number) AS order_count
FROM Orders
GROUP BY customer_number
ORDER BY order_count DESC
customer_number order_count
3 2
1 1
2 1

The query above returned all customers but the question only wants a single customer who has placed the most orders. To address the requirement, the output needs to be limited to just 1 row using the LIMIT 1 clause:

SELECT TOP 1 customer_number, COUNT(order_number) AS order_count
FROM Orders
GROUP BY customer_number
ORDER BY order_count DESC
customer_number order_count
3 2

Lastly, the output needed is just the customer_number without the number of orders made by that customer so the order_count column in the SELECT clause needs to be removed:

SELECT TOP 1 customer_number
FROM Orders
GROUP BY customer_number
ORDER BY order_count DESC

But this generates the following error because the order_count column is being referenced in the ORDER BY clause:

Query 1 ERROR: Msg: 207, Line 4, State: 1, Level: 16
Invalid column name 'order_count'.

To resolve this, the previous expression used to create the order_count column will be used in the ORDER BY clause:

# Final Solution Query
SELECT TOP 1 customer_number
FROM Orders
GROUP BY customer_number
ORDER BY COUNT(order_number) DESC
customer_number
3

Here's the query plan generated by SQL Server for this query:

  |--Sort(TOP 1, ORDER BY:([Expr1002] DESC))
    |--Compute Scalar(DEFINE:([Expr1002]=CONVERT_IMPLICIT(int,[Expr1005],0)))
            |--Stream Aggregate(GROUP BY:([leetcode].[dbo].[Orders].[customer_number]) DEFINE:([Expr1005]=Count(*)))
                |--Sort(ORDER BY:([leetcode].[dbo].[Orders].[customer_number] ASC))
                    |--Clustered Index Scan(OBJECT:([leetcode].[dbo].[Orders].[PK_Orders]))

And here's the fastest runtime for this query:

  • Runtime: 446ms
  • Beats: 93.21% as of July 27, 2024