df.write.format("csv").save("hdfs://s206/user/hive/warehouse/mobike.db/logs/year=2019/month=7/day=5/hour=7/minute=4")
df.show()
錯誤代碼
scala> users.select("name","favorite_color").write.format("json").save("file:///home/hadoop/tmp/parquet2json/")
org.apache.spark.sql.AnalysisException: path file:/home/hadoop/tmp/parquet2json already exists.;
at org.apache.spark.sql.execution.datasources.InsertIntoHadoopFsRelationCommand.run(InsertIntoHadoopFsRelationCommand.scala:109)
at org.apache.spark.sql.execution.command.DataWritingCommandExec.sideEffectResult$lzycompute(commands.scala:104)
........................................................
解決:
可以通過設置savemode來解決這個問題
df.write.format("csv").mode("overwrite").save("hdfs://s206/user/hive/warehouse/mobike.db/logs/year=2019/month=7/day=5/hour=7/minute=4")
df.show()