Python實例:政府工作報告詞雲
問題分析
- 直觀理解政策文件
-
實例解析
- 基本思路
- 步驟1:讀取文件、分詞整理
步驟2:設置並輸出詞雲
步驟3:觀察結構,優化迭代
具體代碼:
#GovRptWordCloudv1.py
import jieba
import wordcloud
f = open("新時代中國特色社會主義.txt","r",encoding="utf-8")
t = f.read()
f.close()
ls = jieba.lcut(t)
txt = " ".join(ls)
w = wordcloud.WordCloud(font_path = "msyh.ttc",\
width=1000,height = 700,\
background_color="white",\
)
w.generate(txt)
w.to_file("grwordcloud.png")
運行結果:
在這段代碼中生成詞雲對象時,增加max_words=15
結果爲:
更有形的詞雲
具體代碼:
#GovRptWordCloudv2.py
import jieba
import wordcloud
from scipy.misc import imread
mask = imread("chinamap.jpg")
excludes = { }
f = open("新時代中國特色社會主義.txt", "r", encoding="utf-8")
t = f.read()
f.close()
ls = jieba.lcut(t)
txt = " ".join(ls)
w = wordcloud.WordCloud(\
width = 1000, height = 700,\
background_color = "white",
font_path = "msyh.ttc", mask = mask
)
w.generate(txt)
w.to_file("grwordcloudm.png")
運行結果: