elasticsearch中的精準文本位置匹配

原創

煎饼皮皮侠

2020-06-16 15:49

在elasticsearch中，將長篇幅的文檔劃分爲樹形結構的段落後，有助於文本的精準位置匹配，

例如：原來的content是這樣的：

content = "一、大標題 \n 1. 一級標題 \n 1> 二級標題"

段落劃分後，是如下這樣：

content = {
    paras: [
        {
            "text": "大標題",
             "sub_paras": [
                     {
                         "text": "一級標題",
                         "sub_paras": [
                              {
                                  "text": "二級標題"
                                }
                          ]
                      }
              ]
        }
    ]
}

如果在查詢時，只想定位到文字所在的段落，可以這樣查詢：

            "query": {
                "bool": {
                    "should": [
                        {"nested": {
                            "path": "content.paras",
                            "query": {
                                "term": {
                                    "content.paras.text": "哈哈"
                                }
                            },
                            "inner_hits": {
                                "name": "inner_hit_p"
                            }
                        }},
                        {"nested": {
                            "path": "content.paras.sub_paras",
                            "query": {
                                "term": {
                                    "content.paras.sub_paras.text": "哈哈"
                                }
                            },
                            "inner_hits": {
                                "name": "inner_hit_sub_p"
                            }
                        }},
                        {"nested": {
                            "path": "content.paras.sub_paras.sub_paras",
                            "query": {
                                "term": {
                                    "content.paras.sub_paras.sub_paras.text": "哈哈"
                                }
                            },
                            "inner_hits": {
                                "name": "inner_hit_sub_sub_p"
                            }
                        }},
                    ]
                }
            }

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

elasticsearch中的精準文本位置匹配

使用skopeo同步鏡像

好玩Spring之編程式配置數據源及事務的使用

好玩Spring之@Resource的工作原理

好玩Spring之事件機制

好玩Spring之BeanFactoryPostProcessor

elasticsearch中的精準文本位置匹配

Mac下配置sublime實現LaTeX

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結