今天小編就爲大家分享一篇關於安裝elasticsearch-analysis-ik中文分詞器的步驟講解,小編覺得內容挺不錯的,現在分享給大家,具有很好的參考價值,需要的朋友一起跟隨小編來看看吧
1 安裝elasticsearch-analysis-ik中文分詞器
Ik介紹:ik是一款中文的分詞插件,支持自定義詞庫。
1.1 下載ik分詞器
下載地址:https://github.com/medcl/elasticsearch-analysis-ik/releases
下載指定版本的分詞器(zip版本)
1.2 解壓ik分詞器
[es@bigdata1 plugins]$ cd /home/es/ [es@bigdata1 ~]$ ls elasticsearch-6.2.2 jdk1.8.0_161 kibana-6.2.2-linux-x86_64.tar.gz elasticsearch-6.2.2.tar.gz jdk-8u161-linux-x64.tar.gz logstash-6.2.2.tar.gz elasticsearch-analysis-ik-6.2.2.zip kibana-6.2.2-linux-x86_64 [es@bigdata1 ~]$ unzip elasticsearch-analysis-ik-6.2.2.zip Archive: elasticsearch-analysis-ik-6.2.2.zip creating: elasticsearch/ inflating: elasticsearch/elasticsearch-analysis-ik-6.2.2.jar inflating: elasticsearch/httpclient-4.5.2.jar inflating: elasticsearch/httpcore-4.4.4.jar inflating: elasticsearch/commons-logging-1.2.jar inflating: elasticsearch/commons-codec-1.9.jar inflating: elasticsearch/plugin-descriptor.properties creating: elasticsearch/config/ inflating: elasticsearch/config/extra_single_word_full.dic inflating: elasticsearch/config/quantifier.dic inflating: elasticsearch/config/IKAnalyzer.cfg.xml inflating: elasticsearch/config/main.dic inflating: elasticsearch/config/extra_single_word_low_freq.dic inflating: elasticsearch/config/extra_stopword.dic inflating: elasticsearch/config/preposition.dic inflating: elasticsearch/config/extra_main.dic inflating: elasticsearch/config/extra_single_word.dic inflating: elasticsearch/config/suffix.dic inflating: elasticsearch/config/surname.dic inflating: elasticsearch/config/stopword.dic [es@bigdata1 ~]$ ls elasticsearch elasticsearch-analysis-ik-6.2.2.zip kibana-6.2.2-linux-x86_64 elasticsearch-6.2.2 jdk1.8.0_161 kibana-6.2.2-linux-x86_64.tar.gz elasticsearch-6.2.2.tar.gz jdk-8u161-linux-x64.tar.gz logstash-6.2.2.tar.gz [es@bigdata1 ~]$ mv elasticsearch/* ./elasticsearch-6.2.2/plugins/ik/ [es@bigdata1 ~]$ vim ./elasticsearch-6.2.2/config/elasticsearch.yml
1.3 重新啓動Elasticsearch服務
ps -ef | grep elasticsearch
然後kill掉相關的進程。
然後按照18.8的方式啓動elasticsearch進程
1.4 然後執行以下命令
GET _analyze { "analyzer":"ik_smart", "text":"2018年5月全球編程語言排行榜" }
運行結果是:
{
"tokens": [
{
"token": "2018年",
"start_offset": 0,
"end_offset": 5,
"type": "TYPE_CQUAN",
"position": 0
},
{
"token": "5月",
"start_offset": 5,
"end_offset": 7,
"type": "TYPE_CQUAN",
"position": 1
},
{
"token": "全球",
"start_offset": 7,
"end_offset": 9,
"type": "CN_WORD",
"position": 2
},
{
"token": "編程",
"start_offset": 9,
"end_offset": 11,
"type": "CN_WORD",
"position": 3
},
{
"token": "語言",
"start_offset": 11,
"end_offset": 13,
"type": "CN_WORD",
"position": 4
},
{
"token": "排行榜",
"start_offset": 13,
"end_offset": 16,
"type": "CN_WORD",
"position": 5
}
]
}
Kibana的效果圖
總結
以上就是這篇文章的全部內容了,希望本文的內容對大家的學習或者工作具有一定的參考學習價值,謝謝大家對神馬文庫的支持。如果你想了解更多相關內容請查看下面相關鏈接