使用 Python 制作酷炫多彩的 Jenkins 插件词云图

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"blockquote","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"Jenkins 插件名称高频关键词有哪些?一切尽在酷炫多彩的词云图"}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"作为最流行的 CI/CD 工具,Jenkins 的优势之一是其生态强大,而这与其插件体系分不开的。 目前 Jenkins 插件 1500+ (截止2020年06月17日,插件数量为1749)。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"近日发现词云比较好玩,于是想着以 Jenkins 插件名称为数据源,形成的词云会是什么样的呢,什么关键字会比较突出呢? 想到就去做,带着问题,带着好奇心,开始了实践之旅~"}]},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"text","text":"插件基本字段说明"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"以 Jenkins 中文本地化插件为例,在 Jenkins 官网插件详情页面可以看出: 其 ID 为 localization-zh-cn,Name 为 Localization: Chinese (Simplified)。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https://static001.geekbang.org/infoq/1f/1f4bcd81d78de816ea0024a92bb6573c.png","alt":"localization-zh-cn","title":null,"style":null,"href":null,"fromPaste":true,"pastePass":true}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"text","text":"获取所有 Jenkins 插件的名称"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"如何获取所有 Jenkins 插件的名称呢?这里我想到3种方式,或许还有更多方式:"}]},{"type":"bulletedlist","content":[{"type":"listitem","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"link","attrs":{"href":"https://plugins.jenkins.io/","title":null},"content":[{"type":"text","text":"插件官网"}]},{"type":"text","text":"爬虫抓取"}]}]},{"type":"listitem","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"link","attrs":{"href":"https://github.com/jenkins-infra/repository-permissions-updater","title":null},"content":[{"type":"text","text":"插件权限文件"}]},{"type":"text","text":"获取"}]}]},{"type":"listitem","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"link","attrs":{"href":"http://updates.jenkins-ci.org/update-center.json","title":null},"content":[{"type":"text","text":"插件更新中心配置文件"}]},{"type":"text","text":"获取"}]}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"对比上面的三种方式,插件权限文件中并没有 Name 字段,插件更新中心配置文件相对从插件官网抓取比较简单。 所以计划从 update-center.json 进行解析,其中插件名称在 json 中对应字段为 title。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https://static001.geekbang.org/infoq/3e/3e4aabcb4b9cedd8a4cbd039bffc5d6b.png","alt":"update-center-json","title":null,"style":null,"href":null,"fromPaste":true,"pastePass":true}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"text","text":"生成 Jenkins 插件名称文件"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"读取 update-center.json 中 plugin 的 title 字段,按行写入到 jenkins-plugins.txt 文件,代码如下:"}]},{"type":"codeblock","attrs":{"lang":null},"content":[{"type":"text","text":"# -*- coding: UTF-8 -*-\nimport json\n\n\nif __name__ == \"__main__\":\n json_obj = json.load(open(\"update-center.json\", \"r\"))\n plugins_obj = json_obj[\"plugins\"]\n with open(\"jenkins-plugins.txt\", \"w\") as fw:\n for plugin_name in plugins_obj:\n plugin_obj = plugins_obj[plugin_name]\n print plugin_obj[\"title\"]\n fw.write(plugin_obj[\"title\"].encode('utf-8') + \"\\n\")\n"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"jenkins-plugins.txt 文件共有 1749 行(与 Jenkins 1749个插件对应),其内容截图如下:"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https://static001.geekbang.org/infoq/2a/2a15e4e98c8f79753ea6588df23d6e97.png","alt":"jenkins-plugins-txt","title":null,"style":null,"href":null,"fromPaste":true,"pastePass":true}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"text","text":"生成词云图"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"这里使用 Python 代码生成词云图,词的来源为 jenkins-plugins.txt,代码如下:"}]},{"type":"codeblock","attrs":{"lang":null},"content":[{"type":"text","text":"# -*- coding: UTF-8 -*-\n\nfrom wordcloud import WordCloud\nimport matplotlib.pyplot as plt\nimport numpy as np\nfrom PIL import Image\n\n\ndef generate_word_cloud_image(background_image):\n # mask\n mask = np.array(Image.open(background_image))\n\n # generate word cloud\n wc = WordCloud(mask=mask, scale=1.5, mode='RGBA', background_color=\"white\", max_words=2000).generate(text=text)\n\n # show word cloud\n plt.imshow(wc, interpolation='bilinear')\n plt.axis('off')\n plt.show()\n\n # save to file\n background_image = str(background_image).split(\"/\")[-1].replace(\".png\", \"\")\n wc.to_file(\"word-cloud-img/\" + background_image + '-word-cloud.png')\n\n\nif __name__ == \"__main__\":\n with open('jenkins-plugins.txt') as f:\n text = f.read()\n\n for image_name in [\"background-img/kongfu.png\", \"background-img/jenkins-logo.png\"]:\n generate_word_cloud_image(background_image=image_name)\n"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"生成的词云图各个关键字以不同大小和比例,绘制出一幅多彩的画卷,感觉很美观、很酷炫!"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"以 Jenkins logo 为背景图片,生成的词云图如下:"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https://static001.geekbang.org/infoq/bc/bc1bdbf471c2059a96efb9a7f6afa130.png","alt":null,"title":"","style":[{"key":"width","value":"100%"},{"key":"bordertype","value":"none"}],"href":"","fromPaste":false,"pastePass":false}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"以 Jenkins 中文社区的 kongfu 为背景图片,生成的词云图如下:"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https://static001.geekbang.org/infoq/d1/d1c2785f94e4f72e38acdf5e3805c6fc.png","alt":"kongfu-word-cloud","title":null,"style":null,"href":null,"fromPaste":true,"pastePass":true}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"从词云图中可以看出,Pipeline 词频最高,Build、API、Job 次之,紧接着还有 Publisher、Notifier、Trigger、Step、GitHub 等等。"}]},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"text","text":"资源说明"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"所有代码及文件可以在 GitHub 仓库找到,如果你也觉得好玩有趣,顺便点个 star 吧,谢谢~"}]},{"type":"bulletedlist","content":[{"type":"listitem","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"link","attrs":{"href":"https://github.com/donhui/jenkins-plugins-word-cloud","title":null},"content":[{"type":"text","text":"https://github.com/donhui/jenkins-plugins-word-cloud"}]}]}]}]},{"type":"image","attrs":{"src":"https://static001.geekbang.org/infoq/62/62744ccd1ba761f7b41ea2818304bdc1.jpeg","alt":null,"title":null,"style":null,"href":null,"fromPaste":true,"pastePass":true}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}}]}
發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章