Spark PruneDependency 依賴關係 Filter

Represents a dependency between the PartitionPruningRDD and its parent. In this case, the child RDD contains a subset of partitions of the parents'.

youtub視頻演示

https://youtu.be/5ZCNiEhO_Qg (youtube視頻)
https://www.bilibili.com/video/av37442139/?p=3 (bilibili)

輸入數據

List(("a",2),("d",1),("b",8),("d",3)

處理程序scala

package com.opensource.bigdata.spark.local.rdd.operation.dependency.narrow.n_03_pruneDependency.n_03_filterByRange_filter

import com.opensource.bigdata.spark.local.rdd.operation.base.BaseScalaSparkContext

object Run extends BaseScalaSparkContext{

  def main(args: Array[String]): Unit = {

    val sc = pre()
    val rdd1 = sc.parallelize(List(("a",2),("d",1),("b",8),("d",3)),2)  //ParallelCollectionRDD
    val rdd2 =rdd1.filterByRange("a","b")  //MapParttionsRDD

    println("rdd \n" + rdd2.collect().mkString("\n"))

    sc.stop()
  }

}

數據處理圖

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

Spark PruneDependency 依賴關係 Filter

Spark PruneDependency 依賴關係 Filter

更多資源

youtub視頻演示

輸入數據

處理程序scala

數據處理圖

前端使用 Konva 實現可視化設計器（13）- 折線 - 最優路徑應用【思路篇】

語音識別實時對比(百度收費 VS SpeechTexter免費)

google sdk speech-to-text(谷歌語音轉文本、谷歌語音轉字幕)

google 語音識別 VS 百度語音識別

Flink 1.7.2 dataset transformation 示例

Spark2.4.0源碼分析之WorldCount Stage提交(DAGScheduler)(六)

Mac下配置sublime實現LaTeX

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結