{"id":149967,"date":"2020-09-02T20:27:59","date_gmt":"2020-09-02T12:27:59","guid":{"rendered":"http:\/\/4563.org\/?p=149967"},"modified":"2020-09-02T20:27:59","modified_gmt":"2020-09-02T12:27:59","slug":"elasticsearch-%e7%b4%a2%e5%bc%95-html-%e6%96%87%e6%a1%a3%e6%9c%89%e4%bb%80%e4%b9%88%e6%af%94%e8%be%83%e5%a5%bd%e7%9a%84%e5%ae%9e%e8%b7%b5%e6%96%b9%e6%a1%88","status":"publish","type":"post","link":"http:\/\/4563.org\/?p=149967","title":{"rendered":"Elasticsearch \u7d22\u5f15 html \u6587\u6863\u6709\u4ec0\u4e48\u6bd4\u8f83\u597d\u7684\u5b9e\u8df5\u65b9\u6848"},"content":{"rendered":"<div>\n<div>\n<div>\n<h1>                  Elasticsearch \u7d22\u5f15 html \u6587\u6863\u6709\u4ec0\u4e48\u6bd4\u8f83\u597d\u7684\u5b9e\u8df5\u65b9\u6848               <\/h1>\n<p> <\/p>\n<div>\n<div> <span>\u8cc7\u6df1\u5927\u4f6c : NULL2020 <\/span>  <span><i><\/i> 16<\/span> <\/div>\n<div> <\/div>\n<\/p><\/div>\n<\/p><\/div>\n<\/p><\/div>\n<div isfirst=\"1\">                        \u7d22\u5f15\u7684\u65f6\u5019\uff0c\u53ef\u4ee5\u81ea\u5b9a\u4e49\u5206\u8bcd\uff0c\u6dfb\u52a0 html_strip \u8fc7\u6ee4\u6389 html \u6807\u7b7e\u4e0d\u5206\u8bcd\uff0c\u4f46\u6587\u6863\u7684 _source \u91cc\u9762\u8fd8\u662f\u4f1a\u6709 html \u6807\u7b7e\uff0c\u56e0\u4e3a\u9700\u8981\u505a\u9ad8\u4eae\uff0c\u547d\u4e2d\u7684\u9ad8\u4eae\u6587\u672c\u662f\u4ece _source \u91cc\u9762\u53d6\u7684\uff0c\u5982\u679c\u547d\u4e2d\u7684\u6587\u672c\u524d\u540e\u6709 html \u6807\u7b7e\uff0c\u5219\u8fd4\u56de\u7684\u9ad8\u4eae\u6587\u672c\u91cc\u4e5f\u6709\u53ef\u80fd\u4f1a\u6709\u6807\u7b7e\uff0c\u5e76\u4e14\u6807\u7b7e\u8fd8\u6709\u53ef\u80fd\u88ab\u622a\u65ad\uff0c\u8fd9\u5c31\u5bfc\u81f4\u8fd4\u56de\u7ed9\u524d\u7aef\u65e0\u6cd5\u51c6\u786e\u5c55\u793a\u3002<\/p>\n<p>\u6570\u636e\u4f7f\u7528 Logstash pipeline \u4ece mysql \u540c\u6b65\u5230 es\uff0c\u76ee\u524d\u60f3\u5230\u7684\u4e00\u4e2a\u65b9\u6848\u662f\u5728 pipeline \u91cc\u52a0\u4e86 mutate\uff0c\u628a\u6240\u6709 html \u6807\u7b7e\u5168\u90e8\u8fc7\u6ee4\u6389\uff0c\u8fd9\u6837\u8fd4\u56de\u7ed9\u524d\u7aef\u7684\u5c31\u53ea\u662f\u7eaf\u6587\u672c\uff0c\u7c97\u7565\u770b\u4e86\u4e0b\u7d22\u5f15\u6570\u636e\uff0c\u57fa\u672c\u6ee1\u8db3\u8981\u6c42\u3002<br \/>\u5982\u679c\u4e0d\u5728\u6570\u636e\u5199\u5165 es \u524d\u8fc7\u6ee4\u6389\u6807\u7b7e\uff0c\u6709\u6ca1\u6709\u529e\u6cd5\u5728\u641c\u7d22\u8fd4\u56de\u65f6\u8fc7\u6ee4\u6389\u6807\u7b7e\uff1f<\/p>\n<p>mutate \u53ea\u662f\u628a html \u6807\u7b7e\u8fc7\u6ee4\u6389\uff0c\u6587\u6863\u91cc\u8fd8\u6709\u4e9b url \u94fe\u63a5\uff08\u5e76\u53ef\u80fd\u5e26 url \u53c2\u6570\uff09\uff0c\u597d\u50cf\u4e5f\u4f1a\u88ab\u7d22\u5f15\u5230\uff0c\u6709\u6ca1\u6709\u529e\u6cd5\u4e0d\u7d22\u5f15 url \u53ca\u540e\u9762\u7684 url \u53c2\u6570\u3002<\/p>\n<p>\u4ee5\u4e0a\uff0c\u4e00\u822c\u641c\u7d22\u5f15\u64ce\u91cc\u5982\u4f55\u5b9e\u73b0\u628a html \u6807\u7b7e\u8fc7\u6ee4\u6389\uff0c\u540c\u65f6\u8fd4\u56de\u7684\u6570\u636e\u80fd\u591f\u63d0\u4f9b\u524d\u7aef\u53cb\u597d\u5730\u5c55\u793a?      <\/div>\n<div> <b>\u5927\u4f6c\u6709\u8a71\u8aaa<\/b> (<span>1<\/span>)        <\/div>\n<div> <\/div>\n<\/p><\/div>\n<\/p><\/div>\n<ul>\n<li data-pid=\"2974167\" data-uid=\"2\">\n<div>\n<div>\n<div> <span>\u4e3b<\/span> <span>\u8cc7\u6df1\u5927\u4f6c : NULL2020 <\/span>  <\/div>\n<div> <i title=\"\u5f15\u7528\"><\/i>  <span>          <\/span> <\/div>\n<\/p><\/div>\n<div>                                                             emmm\uff0c\u6ca1\u5927\u4f6c\u8d50\u6559\u4e0b\u5417\u3002\u3002                                                            <\/div>\n<\/p><\/div>\n<\/li>\n<li>\n","protected":false},"excerpt":{"rendered":"<p>Elasticsearch \u7d22\u5f15 &hellip;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[],"tags":[],"_links":{"self":[{"href":"http:\/\/4563.org\/index.php?rest_route=\/wp\/v2\/posts\/149967"}],"collection":[{"href":"http:\/\/4563.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/4563.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/4563.org\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/4563.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=149967"}],"version-history":[{"count":0,"href":"http:\/\/4563.org\/index.php?rest_route=\/wp\/v2\/posts\/149967\/revisions"}],"wp:attachment":[{"href":"http:\/\/4563.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=149967"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/4563.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=149967"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/4563.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=149967"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}