Pandas dataframe.aggregate() | Python (original) (raw)

Last Updated : 11 Apr, 2025

Dataframe.aggregate() function is used to apply some aggregation across one or more columns. Aggregate using callable, string, dict or list of string/callables.

The most frequently used aggregations are:

**sum: Return the sum of the values for the requested axis
**min: Return the minimum of the values for the requested axis
**max: Return the maximum of the values for the requested axis

Syntax

DataFrame.aggregate(func, axis=0, *args, **kwargs)

**Parameters:

**func : callable, string, dictionary or list of string/callables. Function to use for aggregating the data. If a function, must either work when passed a DataFrame or when passed to DataFrame.apply. For a DataFrame, can pass a dict, if the keys are DataFrame column names.
**axis : (default 0) {0 or ‘index’, 1 or ‘columns’} 0 or ‘index’: apply function to each column. 1 or ‘columns’: apply function to each row.

**Return Type: Returns Aggregated DataFrame.

Importing Pandas and Reading CSV File

For link to CSV file Used in Code, click

Python `

importing pandas package

import pandas as pd

making data frame from csv file

df = pd.read_csv("nba.csv")

printing the first 10 rows of the dataframe

df[:10]

**Output :

Examples of dataframe.aggregate()

Below, we are discussing how to add values of Excel in Python using Pandas

Example 1: Aggregating Data Across All Numeric Columns

We can Aggregate data across all numeric columns using built-in functions such as ‘sum’ and ‘min’.

Python `

df.select_dtypes(include='number').aggregate(['sum', 'min'])

**Output:

For each column which are having numeric values, minimum and sum of all values has been found. For Pandas Dataframe df , we have four such columns Number, Age, Weight, Salary.

Example 2: Aggregating Specific Columns

In Pandas, we can also apply different aggregation functions across different columns. For that, we need to pass a dictionary with key containing the column names and values containing the list of aggregation functions for any specific column.

Python `

df.aggregate({"Number":['sum', 'min'], "Age":['max', 'min'], "Weight":['min', 'sum'],
"Salary":['sum']})

**Output:

Separate aggregation has been applied to each column, if any specific aggregation is not applied on a column then it has NaN value corresponding to it.