python【】詞性標註橫排

原創

2020-02-24 13:46

>>> import re
>>> import jieba.posseg as pseg
>>> f = open('E:/序言.txt','r').read()
>>> words = pseg.cut(f)
>>> l = []
>>> m = []

>>> for w in words:
...   x = w.word
...   y = w.flag
...   l.append((x))
...   m.append((y))
...
Building prefix dict from the default dictionary ...
Loading model from cache C:\Users\oil\AppData\Local\Temp\jieba.cache
Loading model cost 0.893 seconds.
Prefix dict has been built succesfully.
>>> print(l)
['美國版', '序言', '\n', '\n', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '-', '\n', '\n', '\u3000', '\u3000', ' 目前', '已經', '有', '不少', '部', '哲學史', '了', '，', '我', '的', '目的', '並', '不是', '要', '僅僅', '在', '它們', '之中', '再', '加上', '一部', '。', '我', '的', '目的', '是', '要', '揭示', '，', '哲學', '乃是', '社會', '生活', '與', '政治', '生活', '的', '一個', '組成部分', '：', '它', '並', '不是', '卓越', '的', '個人', '所', '做出', '的', '孤立', '的', '思考', '，', '而是', '曾經', '有', '各種', '體系', '盛行', '過', '的', '各種', '社會', '性格', '的', '產物', '與', '成因', '。', '這', '一', '目的', '就', '要求', '我們', '對於', '一般', '歷史',

----------------------------------------------

>>> for line in lines:
... words = pseg.cut(line)
... print(words)
...
<generator object cut at 0x0000019655658048>
<generator object cut at 0x00000196556580A0>
<generator object cut at 0x0000019655658048>
<generator object cut at 0x00000196556580A0>
<generator object cut at 0x0000019655658048>
<generator object cut at 0x00000196556580A0>
<generator object cut at 0x0000019655658048>
<generator object cut at 0x00000196556580A0>
<generator object cut at 0x0000019655658048>
<generator object cut at 0x00000196556580A0>
<generator object cut at 0x0000019655658048>
<generator object cut at 0x00000196556580A0>
<generator object cut at 0x0000019655658048>
<generator object cut at 0x00000196556580A0>
<generator object cut at 0x0000019655658048>
<generator object cut at 0x00000196556580A0>
<generator object cut at 0x0000019655658048>
<generator object cut at 0x00000196556580A0>
<generator object cut at 0x0000019655658048>
<generator object cut at 0x00000196556580A0>
<generator object cut at 0x0000019655658048>
<generator object cut at 0x00000196556580A0>
<generator object cut at 0x0000019655658048>
<generator object cut at 0x00000196556580A0>

>>>

--------------------------------------------

>>> import jieba.posseg as pseg
>>> import re
>>> l = []
>>> m = []
>>> f = open("E:/序言.txt",'r').read()
>>> c = "。"
>>> lines = f.split(c)
>>> s = open("E:/序言++.txt",'a+')
>>> for line in lines:
...   words = pseg.cut(line)
...   for w in words:
...     x = w.word
...     y = w.flag
...     print(x,y,file = s)
...
Building prefix dict from the default dictionary ...
Dumping model to file cache C:\Users\oil\AppData\Local\Temp\jieba.cache
Loading model cost 1.096 seconds.
Prefix dict has been built succesfully.
>>> s.close()
>>>

難道要把txt分割嘛？，越來月麻煩了 = =，暫時沒有解決，也九先放一下了，這樣的詞性標註就對我來說一點用都沒有了= =暫時

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

python【】詞性標註橫排

如何使用 JS 判斷用戶是否處於活躍狀態

Mono 支持LoongArch架構

lightdb秒級增加列和刪除列（not null帶默認值）

lightdb數據庫超時相關控制參數

通過HPA+CronHPA組合應對業務複雜彈性伸縮場景

❤️‍🔥 Solon Cloud Event 新的事務特性與應用

lightdb mysql 8.0兼容之不可見主鍵

使用 JS 實現在瀏覽器控制檯打印圖片 console.image()

基於Ubuntu-22.04安裝K8s-v1.28.2實驗（四）使用域名訪問網站應用

【微信小程序】數據局部刷新，十個字三句話搞定

【PHP+MYSQL】php網頁要如何向數據庫傳輸數據？

【微信小程序】tab 切換可用頂部導航

【微信小程序】從雲端獲取圖片下載，轉入本地圖片臨時路徑

【微信小程序】雲數據頁面上拉加載數據

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結