Version 0.6.0 (November 25, 2011) — pandas 3.0.0rc0+31.g944c527c0a documentation (original) (raw)
New features#
- Added
meltfunction topandas.core.reshape - Added
levelparameter to group by level in Series and DataFrame descriptive statistics (GH 313) - Added
headandtailmethods to Series, analogous to DataFrame (GH 296) - Added
Series.isinfunction which checks if each value is contained in a passed sequence (GH 289) - Added
float_formatoption toSeries.to_string - Added
skip_footer(GH 291) andconverters(GH 343) options toread_csvandread_table - Added
drop_duplicatesandduplicatedfunctions for removing duplicate DataFrame rows and checking for duplicate rows, respectively (GH 319) - Implemented operators ‘&’, ‘|’, ‘^’, ‘-’ on DataFrame (GH 347)
- Added
Series.mad, mean absolute deviation - Added
QuarterEndDateOffset (GH 321) - Added
dotto DataFrame (GH 65) - Added
orientoption toPanel.from_dict(GH 359, GH 301) - Added
orientoption toDataFrame.from_dict - Added passing list of tuples or list of lists to
DataFrame.from_records(GH 357) - Added multiple levels to groupby (GH 103)
- Allow multiple columns in
byargument ofDataFrame.sort_index(GH 92, GH 362) - Added fast
get_valueandput_valuemethods to DataFrame (GH 360) - Added
covinstance methods to Series and DataFrame (GH 194, GH 362) - Added
kind='bar'option toDataFrame.plot(GH 348) - Added
idxminandidxmaxto Series and DataFrame (GH 286) - Added
read_clipboardfunction to parse DataFrame from clipboard (GH 300) - Added
nuniquefunction to Series for counting unique elements (GH 297) - Made DataFrame constructor use Series name if no columns passed (GH 373)
- Support regular expressions in read_table/read_csv (GH 364)
- Added
DataFrame.to_htmlfor writing DataFrame to HTML (GH 387) - Added support for MaskedArray data in DataFrame, masked values converted to NaN (GH 396)
- Added
DataFrame.boxplotfunction (GH 368) - Can pass extra args, kwds to DataFrame.apply (GH 376)
- Implement
DataFrame.joinwith vectoronargument (GH 312) - Added
legendboolean flag toDataFrame.plot(GH 324) - Can pass multiple levels to
stackandunstack(GH 370) - Can pass multiple values columns to
pivot_table(GH 381) - Use Series name in GroupBy for result index (GH 363)
- Added
rawoption toDataFrame.applyfor performance if only need ndarray (GH 309) - Added proper, tested weighted least squares to standard and panel OLS (GH 303)
Performance enhancements#
- VBENCH Cythonized
cache_readonly, resulting in substantial micro-performance enhancements throughout the code base (GH 361) - VBENCH Special Cython matrix iterator for applying arbitrary reduction operations with 3-5x better performance than
np.apply_along_axis(GH 309) - VBENCH Improved performance of
MultiIndex.from_tuples - VBENCH Special Cython matrix iterator for applying arbitrary reduction operations
- VBENCH + DOCUMENT Add
rawoption toDataFrame.applyfor getting better performance when - VBENCH Faster cythonized count by level in Series and DataFrame (GH 341)
- VBENCH? Significant GroupBy performance enhancement with multiple keys with many “empty” combinations
- VBENCH New Cython vectorized function
map_inferspeeds upSeries.applyandSeries.mapsignificantly when passed elementwise Python function, motivated by (GH 355) - VBENCH Significantly improved performance of
Series.order, which also makes np.unique called on a Series faster (GH 327) - VBENCH Vastly improved performance of GroupBy on axes with a MultiIndex (GH 299)
Contributors#
A total of 8 people contributed patches to this release. People with a “+” by their names contributed a patch for the first time.
- Adam Klein +
- Chang She +
- Dieter Vandenbussche
- Jeff Hammerbacher +
- Nathan Pinger +
- Thomas Kluyver
- Wes McKinney
- Wouter Overmeire +