BUG: Groupby min/max with nullable dtypes by jbrockmendel · Pull Request #42567 · pandas-dev/pandas (original) (raw)
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good!
I agree with Brock on having consistent use of mask
being better than some mask
and some mask_in
. I think having a function here with both mask
and result_mask
, I think it's clear that mask
is about the input.
@jbrockmendel here or for a follow-up, but it might be good to add a benchmark for this. Based on a quick check, it seems that GroupByMethods
and GroupByCythonAgg
target those algos (both in benchmarks/groupby.py
), and are not yet parametrized with a nullable dtype.