Spark1.5的一个bug

原創

2020-02-20 18:44

>>> 16/10/15 20:07:35 INFO YarnClientSchedulerBackend: Requesting to kill executor(s) 1
16/10/15 20:07:35 INFO ExecutorAllocationManager: Removing executor 1 because it has been idle for 60 seconds (new desired total will be 0)
16/10/15 20:07:36 ERROR YarnScheduler: Lost executor 1 on hadoop05: remote Rpc client disassociated
16/10/15 20:07:36 INFO DAGScheduler: Executor lost: 1 (epoch 0)
16/10/15 20:07:36 INFO BlockManagerMasterEndpoint: Trying to remove executor 1 from BlockManagerMaster.
16/10/15 20:07:36 INFO BlockManagerMasterEndpoint: Removing block manager BlockManagerId(1, hadoop05, 41258)
16/10/15 20:07:36 INFO BlockManagerMaster: Removed 1 successfully in removeExecutor
16/10/15 20:07:36 INFO ExecutorAllocationManager: Existing executor 1 has been removed (new total is 0)

时不时就报ERROR YarnScheduler: Lost executor 1 on hadoop05: remote Rpc client disassociated的错误。

后查证该问题是spark1.5的bug由于启用了动态分配以及回收资源，当正确的回收资源后，会报出这个错误。
这个错误不会影响集群以及计算任务的结果。
Jira地址：https://issues.apache.org/jira/browse/SPARK-4134

最好的办法是将spark升级至1.6

丑大狗

发布了189 篇原创文章 · 获赞 80 · 访问量 41万+

他的留言板关注

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

Spark1.5的一个bug

10分钟搞定Mysql主从部署配置

如何使用 JS 判断用户是否处于活跃状态

「Pygors跨平台GUI」2：安装MinGW-w64、MSYS2还是WSL2

[转帖]

python列出centos7内存使用前50的进程信息

「Pygors跨平台GUI」1：Pygors跨平台GUI应用研究

一键自动化博客发布工具,用过的人都说好(掘金篇)

lightdb数据库超时相关控制参数

lightdb秒级增加列和删除列（not null带默认值）

Java ThreadPoolShutdown

Linux 維護模式時磁盤爲只讀模式

創建DataGuard爲什麼要開啓force logging

kettle使用sql查詢ORA-00911無效字符

動態顯示impala sql的執行進度

反編譯python的pyc/pyo字節碼文件

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結