How to Use the GROUPBY Function in Excel
Excel's GROUPBY function: powerful data grouping and aggregation tools
Excel's GROUPBY
function allows you to group and aggregate data based on specific fields in the data table. It also provides parameters that allow you to sort and filter the data so that you can customize the output to your specific needs.
GROUPBY
function syntax
The GROUPBY
function contains eight parameters:
<code>=GROUPBY(a,b,c,d,e,f,g,h)</code>
Parameters a to c are required:
- a (row field): A range (one column or multiple columns) containing the value or category to which the data is grouped.
- b (value): The range of values containing aggregated data (one column or multiple columns).
- c (function): a function used to aggregate the values in parameter b .
Parameters d to h are optional, and you can learn more about these parameters in the last part of this article:
- d (field title): A number that specifies whether you selected the title in parameters a and b , and whether they should be displayed in the output.
- e (Total Depth): A number that determines whether the output should display the total.
- f (sorting order): a number that indicates how the results are sorted.
- g (Filter Array): An array-oriented formula used to filter out unnecessary information.
- h (field relationship): A number that specifies the field relationship when multiple columns are provided in parameter a .
GROUPBY
function practical: only the required parameters are used
If you are overwhelmed by a large number of parameters of the GROUPBY
function, it is important to note that GROUPBY
function works perfectly even if you only fill in the parameters a , b and c . So first, I'll show you how GROUPBY
function can use only these three parameters.
Suppose you have a restaurant chain that serves different dishes from different cuisines and you have calculated the total sales and average customer ratings for each cuisine-dish combination.
While these data are useful, you may be more interested in comparisons of different categories of data. Specifically, you might want to know the total revenue per cuisine and the average customer rating for each dish.
Then, since you want to see the total sales for each cuisine, select the cell that contains these data and add another comma:
<code>=GROUPBY(TabFood[Cuisine],TabFood[Sales],</code>
The last required parameter is the function used for aggregated data. In this example, since you want to find out the total sales of each cuisine, you need to insert the SUM
function and turn off the brackets:
<code>=GROUPBY(TabFood[Cuisine],TabFood[Sales],SUM)</code>
After pressing Enter, Excel calculates the average customer rating for each dish type. Again, without any optional parameters, the data is sorted alphabetically by default by the values in the column on the left, with a convenient total row at the bottom.
Since the values in column J are decimal averages, you can organize the displayed decimal places by clicking the "Increase Decimal places" and "Decrease Decimal places" buttons in the "Numbers" group on the Start tab.
GROUPBY
function practical: use optional parameters
Although the GROUPBY
function has five optional parameters in addition to the three required parameters, which makes it more complicated, these additional options are really just to help you create output that is more in line with your needs. More importantly, you can choose which optional parameters to use and skip unwanted parameters.
Below, I'll cover each optional parameter so you can see how they will affect your data when selecting to include them.
Field title
In my above example, I manually typed the output column headers because by default they are not included in the result. However, if you want the output data to contain the column title and the data it contains, use the field title parameter.
First type your GROUPBY
formula, including the first three (required) parameters. In this case, let's assume that you want to group the cuisines by average customer rating:
<code>=GROUPBY(A1:A21,D1:D21,AVERAGE</code>
Note that the title line is included in the selection. In fact, when selecting data for the first two parameters, you should consider in advance whether you want to output the data copy title in the table.
Benefits of including field titles | Disadvantages of including field titles |
---|---|
If you change the title in the original table, the output title will take these changes. | If you want to make the output title more specific than the original table title, you can't change the output title. |
Total depth
The Total Depth parameter allows you to decide whether you want the results to display a total, and if so, whether they should be at the top or bottom of the data. This parameter also allows you to choose whether to display a subtotal.
For Total Depth Parameters, type:
- 0, if you do not want any totals or subtotals to be displayed,
- 1. If you just want to display the total at the bottom of the result,
- 2. If you want the subtotal to appear at the bottom of each result category and display the total at the bottom of the entire result,
- -1, if you just want to display the total at the top of the result,
- -2, if you want the subtotal to appear at the top of each result category and display the total at the top of the entire result.
Sort order
The Sort Order field allows you to tell Excel whether and how to sort the results. Using this parameter does highlight why the GROUPBY
function is more useful than using a pivot table: as long as you change any data in the original table, the entire output data is reordered according to the sort order parameters, and the pivot table needs to be refreshed manually.
The number you enter for this parameter represents the column in the result. For example, if you type 1, this will sort the results for the first column in ascending or alphabetical order. On the other hand, typing -1 will sort the results of the first column in descending or inverse alphabetical order.
In this example, I've typed:
<code>=GROUPBY(A1:A21,C1:C21,SUM,,,,-2)</code>
This will sort the second column (sales) in descending order.
Filter arrays
Filtering array parameters are unlikely to be used like the previous optional parameters, although it can help if your original data table contains rows that may break the data.
In this example, the years in cells A2, A8, and A17 interrupt the result of GROUPBY
function.
I can use the filter array parameter to tell Excel to ignore any cell containing numbers in column A via the ISNUMBER
function:
<code>=GROUPBY(A1:A24,C1:C24,SUM,,,,ISNUMBER(A1:A24)=FALSE)</code>
Field Relationship
Finally, the field relation parameter controls how the data is grouped when the row field parameter refers to multiple columns.
In this example, when the field relationship parameter contains 0 (the default value if the parameter is omitted), GROUPBY
will return a hierarchical result table, where each column is represented by a separate data row.
<code>=GROUPBY(A1:B21,C1:C21,SUM,,,3,,0)</code>
On the other hand, when the field relationship parameter contains 1, GROUPBY
will return a result table that ignores the hierarchy and sorts each column independently. In other words, categories are not nested, which is why you can't include subtotals in the result when you select this field relationship option.
<code>=GROUPBY(A1:B21,C1:C21,SUM,,,3,,1)</code>
In addition to using SUM
and AVERAGE
in the GROUPBY
function parameters, you can also use the PERENTOF
function, which converts data into percentages to show the proportion of the subset that makes up the entire dataset.
The above is the detailed content of How to Use the GROUPBY Function in Excel. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics











Quick link Why should tables be named in Excel How to name a table in Excel Excel table naming rules and techniques By default, tables in Excel are named Table1, Table2, Table3, and so on. However, you don't have to stick to these tags. In fact, it would be better if you don't! In this quick guide, I will explain why you should always rename tables in Excel and show you how to do this. Why should tables be named in Excel While it may take some time to develop the habit of naming tables in Excel (if you don't usually do this), the following reasons illustrate today

Excel Overflow Range Operator (#) enables formulas to be automatically adjusted to accommodate changes in overflow range size. This feature is only available for Microsoft 365 Excel for Windows or Mac. Common functions such as UNIQUE, COUNTIF, and SORTBY can be used in conjunction with overflow range operators to generate dynamic sortable lists. The pound sign (#) in the Excel formula is also called the overflow range operator, which instructs the program to consider all results in the overflow range. Therefore, even if the overflow range increases or decreases, the formula containing # will automatically reflect this change. How to list and sort unique values in Microsoft Excel

Use formula conditional formatting to handle overflow arrays in Excel Direct formatting of overflow arrays in Excel can cause problems, especially when the data shape or size changes. Formula-based conditional formatting rules allow automatic formatting to be adjusted when data parameters change. Adding a dollar sign ($) before a column reference applies a rule to all rows in the data. In Excel, you can apply direct formatting to the values or background of a cell to make the spreadsheet easier to read. However, when an Excel formula returns a set of values (called overflow arrays), applying direct formatting will cause problems if the size or shape of the data changes. Suppose you have this spreadsheet with overflow results from the PIVOTBY formula,

This tutorial shows you how to quickly apply, modify, and remove Excel table styles while preserving all table functionalities. Want to make your Excel tables look exactly how you want? Read on! After creating an Excel table, the first step is usual

This tutorial explains how to use MATCH function in Excel with formula examples. It also shows how to improve your lookup formulas by a making dynamic formula with VLOOKUP and MATCH. In Microsoft Excel, there are many different lookup/ref

Quick Links The AGGREGATE Syntax

The tutorial shows how to compare text strings in Excel for case-insensitive and exact match. You will learn a number of formulas to compare two cells by their values, string length, or the number of occurrences of a specific character, a
