What’s new in 2.1.2 (October 26, 2023) — pandas 2.2.3 documentation (original) (raw)
These are the changes in pandas 2.1.2. See Release notes for a full changelog including other versions of pandas.
Deprecations#
- Reverted deprecation of
fill_method=None
in DataFrame.pct_change(), Series.pct_change(),DataFrameGroupBy.pct_change()
, andSeriesGroupBy.pct_change()
; the values'backfill'
,'bfill'
,'pad'
, and'ffill'
are still deprecated (GH 53491)
Fixed regressions#
- Fixed regression in DataFrame.join() where result has missing values and dtype is arrow backed string (GH 55348)
- Fixed regression in rolling() where non-nanosecond index or
on
column would produce incorrect results (GH 55026, GH 55106, GH 55299) - Fixed regression in DataFrame.resample() which was extrapolating back to
origin
whenorigin
was outside its bounds (GH 55064) - Fixed regression in DataFrame.sort_index() which was not sorting correctly when the index was a sliced MultiIndex (GH 55379)
- Fixed regression in
DataFrameGroupBy.agg()
andSeriesGroupBy.agg()
where if the optioncompute.use_numba
was set to True, groupby methods not supported by the numba engine would raise aTypeError
(GH 55520) - Fixed performance regression with wide DataFrames, typically involving methods where all columns were accessed individually (GH 55256, GH 55245)
- Fixed regression in merge_asof() raising
TypeError
forby
with datetime and timedelta dtypes (GH 55453) - Fixed regression in read_parquet() when reading a file with a string column consisting of more than 2 GB of string data and using the
"string"
dtype (GH 55606) - Fixed regression in DataFrame.to_sql() not roundtripping datetime columns correctly for sqlite when using
detect_types
(GH 55554) - Fixed regression in construction of certain DataFrame or Series subclasses (GH 54922)
Bug fixes#
- Fixed bug in
DataFrameGroupBy
reductions not preserving object dtype wheninfer_string
is set (GH 55620) - Fixed bug in SeriesGroupBy.value_counts() returning incorrect dtype for string columns (GH 55627)
- Fixed bug in
Categorical.equals()
if other has arrow backed string dtype (GH 55364) - Fixed bug in
DataFrame.__setitem__()
not inferring string dtype for zero-dimensional array withinfer_string=True
(GH 55366) - Fixed bug in DataFrame.idxmin() and DataFrame.idxmax() raising for arrow dtypes (GH 55368)
- Fixed bug in DataFrame.interpolate() raising incorrect error message (GH 55347)
- Fixed bug in Index.insert() raising when inserting
None
into Index withdtype="string[pyarrow_numpy]"
(GH 55365) - Fixed bug in Series.all() and Series.any() not treating missing values correctly for
dtype="string[pyarrow_numpy]"
(GH 55367) - Fixed bug in Series.floordiv() for ArrowDtype (GH 55561)
- Fixed bug in Series.mode() not sorting values for arrow backed string dtype (GH 55621)
- Fixed bug in Series.rank() for
string[pyarrow_numpy]
dtype (GH 55362) - Fixed bug in Series.str.extractall() for ArrowDtype dtype being converted to object (GH 53846)
- Fixed bug where PDEP-6 warning about setting an item of an incompatible dtype was being shown when creating a new conditional column (GH 55025)
- Silence
Period[B]
warnings introduced by GH 53446 during normal plotting activity (GH 55138) - Fixed bug in Series constructor not inferring string dtype when
NA
is the first value andinfer_string
is set (:issue:` 55655`)
Other#
- Fixed non-working installation of optional dependency group
output_formatting
. Replacing underscore_
with a dash-
fixes broken dependency resolution. A correct way to use now ispip install pandas[output-formatting]
.
Contributors#
A total of 20 people contributed patches to this release. People with a “+” by their names contributed a patch for the first time.
- Amanda Bizzinotto
- Artur Barseghyan +
- Hadi Abdi Khojasteh +
- Joris Van den Bossche
- Luke Manley
- Lumberbot (aka Jack)
- Marc Garcia
- Marco Edward Gorelli
- MarcoGorelli
- Mateusz Sokół
- Matthew Roeschke
- Natalia Mokeeva
- Pandas Development Team
- Patrick Hoefler
- Richard Shadrach
- Thomas Li
- Yao Xiao
- dependabot[bot]
- rohanjain101 +
- torext +