ENH: Handle categorical dtype to/from R by jseabold · Pull Request #9187 · pandas-dev/pandas (original) (raw)
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@jseabold: Sorry, my bad. I see now that that code was added due to
GH #1615. Unfortunately, there appears to be a regression whenfactors_as_strings=False:
from pandas.rpy.common import load_data
prestige = load_data('Prestige', 'car', factors_as_strings=False)
raises ValueError: codes need to be between -1 and len(categories)-1.
And the issue also affects factors_as_strings=True, where
prestige = load_data('Prestige', 'car', factors_as_strings=True)
raises IndexError: index 2147483647 is out of bounds for axis 0 with size 3