Index.get_indexer() throws when None and at least one of np.nan, pd.NaT are present in input · Issue #22332 · pandas-dev/pandas (original) (raw)
Code Sample
arr=pd.unique(np.array([pd.NaT, None], dtype=np.object)) index=pd.Index(arr, dtype=np.object).get_indexer([])
Problem description
throws "Reindexing only valid with uniquely valued Index objects".
This is also the case for [np.nan, None]
Expected Output
It should not crash. One would also expect, that the array returned by pd.unique()
is really considered to be consisting of unique elements.
That is not the only problem in , but one blocker for #22305.
Output of pd.show_versions()
INSTALLED VERSIONS
commit: None
python: 3.6.2.final.0
python-bits: 64
OS: Linux
OS-release: 4.4.0-53-generic
machine: x86_64
processor: x86_64
byteorder: little
LC_ALL: None
LANG: en_US.UTF-8
LOCALE: en_US.UTF-8
pandas: 0.23.4
pytest: 3.2.1
pip: 10.0.1
setuptools: 36.5.0.post20170921
Cython: 0.28.3
numpy: 1.13.1
scipy: 1.1.0
pyarrow: None
xarray: None
IPython: 6.1.0
sphinx: 1.6.3
patsy: 0.4.1
dateutil: 2.6.1
pytz: 2017.2
blosc: None
bottleneck: 1.2.1
tables: 3.4.2
numexpr: 2.6.2
feather: None
matplotlib: 2.0.2
openpyxl: 2.4.8
xlrd: 1.1.0
xlwt: 1.3.0
xlsxwriter: 0.9.8
lxml: 3.8.0
bs4: 4.6.0
html5lib: 0.9999999
sqlalchemy: 1.1.13
pymysql: None
psycopg2: None
jinja2: 2.9.6
s3fs: 0.1.3
fastparquet: None
pandas_gbq: None
pandas_datareader: None
[paste the output of pd.show_versions()
here below this line]