BUG: Index.get_indexer will change behaviour for nulls with arrow strings (original) (raw)

Pandas version checks

Reproducible Example

idx = pd.Index(["a", "b", None], dtype="string[pyarrow_numpy]") idx.get_indexer([None])

Issue Description

This returns -1 for the new arrow string dtype because we cast None to np.nan when creating the array. We wanted to be as close to object dtype as possible, but patching this does not seem ideal. Thoughts?

cc @jorisvandenbossche

Expected Behavior

see above

Installed Versions

Details

Replace this line with the output of pd.show_versions()