pandas.DataFrame.select_dtypes — pandas 0.16.2 documentation (original) (raw)
DataFrame.select_dtypes(include=None, exclude=None)¶
Return a subset of a DataFrame including/excluding columns based on their dtype.
Parameters: | include, exclude : list-like A list of dtypes or strings to be included/excluded. You must pass in a non-empty sequence for at least one of these. |
---|---|
Returns: | subset : DataFrame The subset of the frame including the dtypes in include and excluding the dtypes in exclude. |
Raises: | ValueError If both of include and exclude are empty If include and exclude have overlapping elements If any kind of string dtype is passed in. TypeError If either of include or exclude is not a sequence |
Notes
- To select all numeric types use the numpy dtype numpy.number
- To select strings you must use the object dtype, but note that this will return all object dtype columns
- See the numpy dtype hierarchy
- To select Pandas categorical dtypes, use ‘category’
Examples
df = pd.DataFrame({'a': np.random.randn(6).astype('f4'), ... 'b': [True, False] * 3, ... 'c': [1.0, 2.0] * 3}) df a b c 0 0.3962 True 1 1 0.1459 False 2 2 0.2623 True 1 3 0.0764 False 2 4 -0.9703 True 1 5 -1.2094 False 2 df.select_dtypes(include=['float64']) c 0 1 1 2 2 1 3 2 4 1 5 2 df.select_dtypes(exclude=['floating']) b 0 True 1 False 2 True 3 False 4 True 5 False