Categorical[idx].codes != Categorical.codes[idx] · Issue #9469 · pandas-dev/pandas (original) (raw)
I'm unable to isolate this, but maybe this rings a bell for someone. It took me a long time to find the source of error in my code. I don't think this is intentional, and if so, is pretty dangerous.
[~/]
[22]: y[idx].codes[:15]
[22]: array([4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 1, 4], dtype=int8)
[~/]
[23]: y.codes[idx][:15]
[23]: array([4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4], dtype=int8)
[~/]
[24]: idx[:15]
[24]:
array([ 74172, 53885, 128953, 28125, 136548, 136700, 93633, 61147,
56535, 90577, 115719, 58038, 111711, 53399, 77475])
[~/]
[25]: type(y)
[25]: pandas.core.categorical.Categorical
[~/]
[27]: pd.version.version
[27]: '0.15.2'