CDH6.2集成sparkThrift服務 參考 https://blog.csdn.net/qq_34864753/article/details/102729859
公司網絡組爲了安全起見購買了 華爲的防火牆,安裝上後 啓動的sparkThrift服務隔2小時10分鐘後就會斷開,而且是正常斷開
sparkThrift日誌
2020-01-17 13:25:25 INFO HiveSessionImpl:318 - Operation log session directory is created: /var/log/hive/operation_logs/5f686f2b-c5fe-4e4b-813c-5e6596ef0f68
2020-01-17 13:33:56 ERROR YarnClientSchedulerBackend:70 - YARN application has exited unexpectedly with state SUCCEEDED! Check the YARN application logs for more details.
2020-01-17 13:33:56 INFO HiveServer2:112 - Shutting down HiveServer2
2020-01-17 13:33:56 INFO ThriftCLIService:188 - Thrift server has stopped
2020-01-17 13:33:56 INFO AbstractService:125 - Service:ThriftBinaryCLIService is stopped.
2020-01-17 13:33:56 INFO AbstractService:125 - Service:OperationManager is stopped.
2020-01-17 13:33:56 INFO AbstractService:125 - Service:SessionManager is stopped.
2020-01-17 13:33:56 INFO AbstractConnector:318 - Stopped Spark@201aa8c1{HTTP/1.1,[http/1.1]}{0.0.0.0:4041}
2020-01-17 13:33:56 INFO SparkUI:54 - Stopped Spark web UI at http://zmbd-vpc-wk01:4041
2020-01-17 13:33:56 INFO YarnSchedulerBackend$YarnDriverEndpoint:54 - Disabling executor 2.
2020-01-17 13:33:56 INFO YarnSchedulerBackend$YarnDriverEndpoint:54 - Disabling executor 3.
2020-01-17 13:33:57 INFO YarnSchedulerBackend$YarnDriverEndpoint:54 - Disabling executor 14.
2020-01-17 13:33:57 INFO YarnSchedulerBackend$YarnDriverEndpoint:54 - Disabling executor 11.
2020-01-17 13:33:57 INFO YarnSchedulerBackend$YarnDriverEndpoint:54 - Disabling executor 15.
2020-01-17 13:33:57 INFO YarnSchedulerBackend$YarnDriverEndpoint:54 - Disabling executor 10.
2020-01-17 13:33:57 INFO YarnSchedulerBackend$YarnDriverEndpoint:54 - Disabling executor 5.
2020-01-17 13:33:57 INFO YarnSchedulerBackend$YarnDriverEndpoint:54 - Disabling executor 13.
2020-01-17 13:33:57 INFO YarnSchedulerBackend$YarnDriverEndpoint:54 - Disabling executor 4.
2020-01-17 13:33:57 INFO YarnSchedulerBackend$YarnDriverEndpoint:54 - Disabling executor 6.
2020-01-17 13:33:57 INFO YarnSchedulerBackend$YarnDriverEndpoint:54 - Disabling executor 12.
2020-01-17 13:33:57 INFO YarnSchedulerBackend$YarnDriverEndpoint:54 - Disabling executor 1.
2020-01-17 13:33:57 INFO YarnSchedulerBackend$YarnDriverEndpoint:54 - Disabling executor 7.
2020-01-17 13:33:57 INFO YarnSchedulerBackend$YarnDriverEndpoint:54 - Disabling executor 8.
2020-01-17 13:33:57 INFO YarnSchedulerBackend$YarnDriverEndpoint:54 - Disabling executor 9.
2020-01-17 13:34:06 INFO AbstractService:125 - Service:CLIService is stopped.
2020-01-17 13:34:06 INFO AbstractService:125 - Service:HiveServer2 is stopped.
2020-01-17 13:34:30 INFO ThriftCLIService:107 - Session disconnected without closing properly, close it now
2020-01-17 13:34:30 ERROR HiveSessionImpl:691 - Failed to cleanup session log dir: SessionHandle [5675bfe5-18e4-45d5-8ce3-e83de8c09530]
java.io.FileNotFoundException: File does not exist: /var/log/hive/operation_logs/5675bfe5-18e4-45d5-8ce3-e83de8c09530
從日誌上看是正常退出的,yarn上的日誌錯誤顯示爲 與sparkThrift的driver端連接斷開
百度,谷歌搜不到原因,本來好好的服務突然就出現了異常,那就看網絡組或者安全組做了什麼變更,公司的網絡組剛安裝了防火牆,只能讓他們幫忙查找問題,最終定位到的問題是防火牆的策略導致的,隔段時間連接就會老化,後面更新了策略服務就正常了
說下總結的經驗吧:大家在服務出現問題排查過程中,可能就不是自己的問題,不要在查找問題上糾結很久,如果查了2、3個小時還找不出問題,那就問一下與服務相關方有沒有動過什麼,網絡,安全都有可能會影響到服務的穩定性,更多有經驗的人蔘與到查找問題,問題就越容易解決