Skip to content

Unicode column misalignment #2612

Closed
Closed
@wesm

Description

@wesm
In [17]: open('/home/wesm/tmp/foo.csv', 'rb').read()
Out[17]: '\xe6\xb8\xac\xe8\xa9\xa6\xe4\xb8\x80,\xe6\xb8\xac\xe8\xa9\xa6\xe4\xb8\x89\r\[email protected],\xe6\xb8\xac\xe8\xa9\xa6\xe4\xb8\x80\r\[email protected],\xe6\xb8\xac\xe8\xa9\xa6\xe4\xba\x8c\r\[email protected],\xe6\xb8\xac\xe8\xa9\xa6\xe4\xb8\x89\r\n'

In [18]: read_csv('/home/wesm/tmp/foo.csv', encoding='utf-8')
Out[18]: 
               測試一  測試三
0  [email protected]  測試一
1  [email protected]  測試二
2  [email protected]  測試三

In [24]: df
Out[24]: 
               測試一  測試三
0  [email protected]  測試一
1  [email protected]  測試二
2  [email protected]  測試三

In [25]: df.columns[0]
Out[25]: u'\u6e2c\u8a66\u4e00'

In [26]: df.columns[1]
Out[26]: u'\u6e2c\u8a66\u4e09'

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions