Unicode (UTF-8) 在 Python 中读取和写入文件 - Unicode (UTF-8) reading and writing to files in Python

原創

2021-10-06 09:19

问题：

I'm having some brain failure in understanding reading and writing text to a file (Python 2.4).我在理解对文件（Python 2.4）的读取和写入文本时遇到了一些大脑故障。

# The string, which has an a-acute in it.
ss = u'Capit\xe1n'
ss8 = ss.encode('utf8')
repr(ss), repr(ss8)

("u'Capit\\xe1n'", "'Capit\\xc3\\xa1n'") ("u'Capit\\xe1n'", "'Capit\\xc3\\xa1n'")

print ss, ss8
print >> open('f1','w'), ss8

>>> file('f1').read()
'Capit\xc3\xa1n\n'

So I type in Capit\\xc3\\xa1n into my favorite editor, in file f2.所以我在文件 f2 中输入Capit\\xc3\\xa1n到我最喜欢的编辑器中。

Then:然后：

>>> open('f1').read()
'Capit\xc3\xa1n\n'
>>> open('f2').read()
'Capit\\xc3\\xa1n\n'
>>> open('f1').read().decode('utf8')
u'Capit\xe1n\n'
>>> open('f2').read().decode('utf8')
u'Capit\\xc3\\xa1n\n'

What am I not understanding here?我在这里不明白什么？ Clearly there is some vital bit of magic (or good sense) that I'm missing.显然，我缺少一些重要的魔法（或理智）。 What does one type into text files to get proper conversions?在文本文件中键入什么以获得正确的转换？

What I'm truly failing to grok here, is what the point of the UTF-8 representation is, if you can't actually get Python to recognize it, when it comes from outside.我真正无法理解的是 UTF-8 表示的意义是什么，如果你实际上无法让 Python 识别它，当它来自外部时。 Maybe I should just JSON dump the string, and use that instead, since that has an asciiable representation!也许我应该只用 JSON 转储字符串，然后使用它，因为它具有 asciiable 表示！ More to the point, is there an ASCII representation of this Unicode object that Python will recognize and decode, when coming in from a file?更重要的是，当从文件中输入时，Python 会识别和解码这个 Unicode 对象的 ASCII 表示吗？ If so, how do I get it?如果是这样，我如何获得它？

>>> print simplejson.dumps(ss)
'"Capit\u00e1n"'
>>> print >> file('f3','w'), simplejson.dumps(ss)
>>> simplejson.load(open('f3'))
u'Capit\xe1n'

解决方案：

参考一： https://en.stackoom.com/question/23yD
参考二： https://stackoom.com/question/23yD

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

Unicode (UTF-8) 在 Python 中读取和写入文件 - Unicode (UTF-8) reading and writing to files in Python

问题：

解决方案：

钉钉打卡速度慢

使用neovim打造go ide(支持代码跳转, 代码补全, 实时语法检查)

Nginx R31 doc 官方文档-01-nginx 如何安装

Python 潮流周刊#51：用 Python 绘制美观的图表

cs01 CSS Syntax

Qt/C++音视频开发74-合并标签图形/生成yolo运算结果图形/文字和图形合并成一个/水印滤镜

挑战程序设计竞赛 2.2章习题 POJ - 3617 Best Cow Line 贪心

字节面试：MySQL什么时候锁表？如何防止锁表？

.NET8连接SQL SERVER 2008 R2 报：证书链是由不受信任的颁发机构颁发的

golang开发环境搭建(win10)

在沒有jQuery的情況下查找最近的元素 - Finding closest element without jQuery

Chrome 未在“網絡”選項卡中顯示 OPTIONS 請求 - Chrome not showing OPTIONS requests in Network tab

如何在Django中獲取所有請求標頭？ - How can I get all the request headers in Django?

如何在 Jersey JaxRS 中獲取所有查詢參數？ - How can I grab all query parameters in Jersey JaxRS?

在 AngularJS 中的 ng-repeat 循環中綁定 ng-model - Binding ng-model inside ng-repeat loop in AngularJS

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結