How to read Unicode file as Unicode string in Python -
i have file encoded in unicode or utf-8 (i don't know which). when read file in python 3.4, resulting string interpreted ascii string. how convert unicode string u"text"
?
the term "unicode" refers standard, not particular encoding. since files in computers binary, there exist different ways of encoding unicode data in binary files. 1 of them "utf-8".
you can consult https://docs.python.org/3/howto/unicode.html
an example taken document (in section "reading , writing unicode data")
with open('unicode.txt', encoding='utf-8') f: line in f: print(repr(line))
in python 3, unlike python2, unicode string constants not written "u".
Comments
Post a Comment