y********o 发帖数: 2565 | 1 Hi.
I am trying to process an xml-tagged chinese document using Python's minidom.
But my code gets stuck when it hits the first Chinese character in the
xml-tagged document. Python complains that the Chinese character is an invalid
token, and thus not well-formed.
I tried using encoding="UTF-8" and encoding="UTF-16" and encoding="GB2312" and
encoding="GBK" in the xml-tagged chinese document. None of them helped.
Would you please give a hint? Thanks. | c***r 发帖数: 4631 | 2
minidom.
invalid
and
try ISO8859-1
【在 y********o 的大作中提到】 : Hi. : I am trying to process an xml-tagged chinese document using Python's minidom. : But my code gets stuck when it hits the first Chinese character in the : xml-tagged document. Python complains that the Chinese character is an invalid : token, and thus not well-formed. : I tried using encoding="UTF-8" and encoding="UTF-16" and encoding="GB2312" and : encoding="GBK" in the xml-tagged chinese document. None of them helped. : Would you please give a hint? Thanks.
|
|