Unicode column misalignment · Issue #2612 · pandas-dev/pandas (original) (raw)
Navigation Menu
- Explore
- Pricing
Provide feedback
Saved searches
Use saved searches to filter your results more quickly
Appearance settings
Description
In [17]: open('/home/wesm/tmp/foo.csv', 'rb').read()
Out[17]: '\xe6\xb8\xac\xe8\xa9\xa6\xe4\xb8\x80,\xe6\xb8\xac\xe8\xa9\xa6\xe4\xb8\x89\r\nabc@example.com,\xe6\xb8\xac\xe8\xa9\xa6\xe4\xb8\x80\r\ndef@example.com,\xe6\xb8\xac\xe8\xa9\xa6\xe4\xba\x8c\r\nghi@example.com,\xe6\xb8\xac\xe8\xa9\xa6\xe4\xb8\x89\r\n'
In [18]: read_csv('/home/wesm/tmp/foo.csv', encoding='utf-8')
Out[18]:
測試一 測試三
0 abc@example.com 測試一
1 def@example.com 測試二
2 ghi@example.com 測試三
In [24]: df
Out[24]:
測試一 測試三
0 abc@example.com 測試一
1 def@example.com 測試二
2 ghi@example.com 測試三
In [25]: df.columns[0]
Out[25]: u'\u6e2c\u8a66\u4e00'
In [26]: df.columns[1]
Out[26]: u'\u6e2c\u8a66\u4e09'