Unicode column misalignment · Issue #2612 · pandas-dev/pandas (original) (raw)

Skip to content

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sign up

Appearance settings

@wesm

Description

@wesm

In [17]: open('/home/wesm/tmp/foo.csv', 'rb').read()
Out[17]: '\xe6\xb8\xac\xe8\xa9\xa6\xe4\xb8\x80,\xe6\xb8\xac\xe8\xa9\xa6\xe4\xb8\x89\r\nabc@example.com,\xe6\xb8\xac\xe8\xa9\xa6\xe4\xb8\x80\r\ndef@example.com,\xe6\xb8\xac\xe8\xa9\xa6\xe4\xba\x8c\r\nghi@example.com,\xe6\xb8\xac\xe8\xa9\xa6\xe4\xb8\x89\r\n'

In [18]: read_csv('/home/wesm/tmp/foo.csv', encoding='utf-8')
Out[18]: 
               測試一  測試三
0  abc@example.com  測試一
1  def@example.com  測試二
2  ghi@example.com  測試三

In [24]: df
Out[24]: 
               測試一  測試三
0  abc@example.com  測試一
1  def@example.com  測試二
2  ghi@example.com  測試三

In [25]: df.columns[0]
Out[25]: u'\u6e2c\u8a66\u4e00'

In [26]: df.columns[1]
Out[26]: u'\u6e2c\u8a66\u4e09'