pandas.core.groupby.DataFrameGroupBy.agg — pandas 3.0.0.dev0+2095.g2e141aaf99 documentation (original) (raw)

DataFrameGroupBy.agg(func=None, *args, engine=None, engine_kwargs=None, **kwargs)[source]#

Aggregate using one or more operations.

The aggregate function allows the application of one or more aggregation operations on groups of data within a DataFrameGroupBy object. It supports various aggregation methods, including user-defined functions and predefined functions such as ‘sum’, ‘mean’, etc.

Parameters:

funcfunction, str, list, dict or None

Function to use for aggregating the data. If a function, must either work when passed a DataFrame or when passed to DataFrame.apply.

Accepted combinations are:

function
string function name
list of functions and/or function names, e.g. [np.sum, 'mean']
dict of index labels -> functions, function names or list of such.
None, in which case **kwargs are used with Named Aggregation. Here the output has one column for each element in **kwargs. The name of the column is keyword, whereas the value determines the aggregation used to compute the values in the column.
Can also accept a Numba JIT function withengine='numba' specified. Only passing a single function is supported with this engine.
If the 'numba' engine is chosen, the function must be a user defined function with values and index as the first and second arguments respectively in the function signature. Each group’s index will be passed to the user defined function and optionally available for use.

*args

Positional arguments to pass to func.

enginestr, default None

'cython' : Runs the function through C-extensions from cython.
'numba' : Runs the function through JIT compiled code from numba.
NoneDefaults to 'cython' or globally setting
compute.use_numba

engine_kwargsdict, default None

For 'cython' engine, there are no accepted engine_kwargs
For 'numba' engine, the engine can accept nopython, nogiland parallel dictionary keys. The values must either be True orFalse. The default engine_kwargs for the 'numba' engine is{'nopython': True, 'nogil': False, 'parallel': False} and will be applied to the function

**kwargs

If func is None, **kwargs are used to define the output names and aggregations via Named Aggregation. See func entry.
Otherwise, keyword arguments to be passed into func.

Returns:

DataFrame

Aggregated DataFrame based on the grouping and the applied aggregation functions.