BUG: Some sas7bdat files with many columns are not parseable by read_sas by troels · Pull Request #22628 · pandas-dev/pandas (original) (raw)
Hi @gfyoung
So the test case I added will fail in the current version of pandas, with the following error (these files can't be parsed):
pandas/tests/io/sas/test_sas7bdat.py ............F. [100%]
=============================================== FAILURES ===============================================
__________________________________________ test_many_columns ___________________________________________
datapath = <function datapath.<locals>.deco at 0x7f53f5a47f28>
def test_many_columns(datapath):
fname = datapath("io", "sas", "data", "many_columns.sas7bdat")
> df = pd.read_sas(fname, encoding='latin-1')
pandas/tests/io/sas/test_sas7bdat.py:188:
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
pandas/io/sas/sasreader.py:68: in read_sas
data = reader.read()
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
self = <pandas.io.sas.sas7bdat.SAS7BDATReader object at 0x7f53f5899860>, nrows = 3
def read(self, nrows=None):
if (nrows is None) and (self.chunksize is not None):
nrows = self.chunksize
elif nrows is None:
nrows = self.row_count
if len(self.column_types) == 0:
self.close()
> raise EmptyDataError("No columns to parse from file")
E pandas.errors.EmptyDataError: No columns to parse from file
pandas/io/sas/sas7bdat.py:604: EmptyDataError
================================= 1 failed, 13 passed in 18.88 seconds =================================