flink讀取數據源的四種方式

原創

2020-03-01 17:44

object SourceDemo {
  def main(args: Array[String]): Unit = {
    val env = StreamExecutionEnvironment.getExecutionEnvironment

    //數據來源
    //1.從文件讀取
    val inpath="D:\\programs\\sparkPrograms\\FlinkProgarm\\src\\main\\resources\\hello.txt"
    val stream1 =  env.readTextFile(inpath)

    //2.從socket流中讀取
    val stream2 =env.socketTextStream("hadoop01",7777)

    //3.從kafka中讀取
    val properties=new Properties()
    properties.setProperty("bootstrap.servers","hadoop01:9092")
    properties.setProperty("group.id","cosumer-group")
    properties.setProperty("key.deserializer","org.apache.kafka.common.serialization.StringDeserializer")
    properties.setProperty("value.deserializer",
      "org.apache.kafka.common.serialization.StringDeserializer")
    properties.setProperty("auto.offset.reset", "latest")
    val stream3 = env.addSource(new FlinkKafkaConsumer011[String]("sensor", new
        SimpleStringSchema(), properties))
    //    stream3.print("stream3")
    
    //4.自定義Source
    val stream4: DataStream[SensorReading] = env.addSource(new MySensorSource())
    stream4.print("Stream4")
    
    env.execute()
  }

  case class SensorReading(id: String, timestamp: Long, temperature: Double)

  class MySensorSource extends SourceFunction[SensorReading]{
    //flag 表示數據源是否正常運行
    var running:Boolean = true

    override def run(sourceContext: SourceFunction.SourceContext[SensorReading]): Unit = {
      //初始化一個隨機數發生器
      val rand = new Random()

      var curTemp=1.to(10).map(
        //rand.nextGaussian() 正太分佈隨機數 範圍在正負2seigama之間
        i=>("sensor_"+i,65+rand.nextGaussian()*20)
      )

      while (running){
        //更新溫度值
        curTemp = curTemp.map(
          t=>(t._1,t._2+rand.nextGaussian())
        )
        //獲取當前時間戳
        val curTime = System.currentTimeMillis()

        curTemp.foreach(
          //使用collect方法 將數據一條一條發送出去
          t=>sourceContext.collect(SensorReading(t._1,curTime,t._2))
        )
        Thread.sleep(100)
      }
    }

    override def cancel(): Unit = {
      running=false
    }
  }
}

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

flink讀取數據源的四種方式

MySQL 核心模塊揭祕 | 18 期 | 鎖在內存里長什麼樣*

使用perf工具生成火焰圖

大齡程序員思考

響應式界面控件DevExtreme * 更強的數據分析和可視化功能

HttpSecurity 是如何組裝過濾器鏈的

數說海南——近6年海南各市縣人口簡單看

長序列中Transformers的高級注意力機制總結

WebStorm 創建 Vue 項目

Flink_WordCount

java idea使用小辣椒的步驟

mysql數據庫導出導入（大量數據也適用）

mysql 命令行中建表，插入，查詢

flink讀取數據源的四種方式

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結