Hive平臺UDF調用實踐之亂碼

應用場景

從Hive數據倉庫批量撈取數據通過UDF中HttpURLConnection調用至服務端;

問題

服務端拿到的中文數據部分存在亂碼;

排查

  • 1、查詢MySql數據庫,發現源數據非亂碼且編碼格式爲UTF-8;
  • 2、查詢Hive數據倉庫,發現數據非亂碼且編碼格式爲UTF-8;
  • 3、初步判斷亂碼發生在HttpURLConnection調用過程中;

解決

修改前部分代碼:

URL url = new URL(requestUrl);
httpURLConnection = (HttpURLConnection) url.openConnection();
httpURLConnection.setRequestProperty("Content-type", "application/json");
httpURLConnection.setDoOutput(true);
httpURLConnection.setDoInput(true);
outputStream = httpURLConnection.getOutputStream();
printWriter = new PrintWriter(outputStream);
printWriter.print(body);
printWriter.flush();
printWriter.close();

修改後部分代碼:

URL url = new URL(requestUrl);
httpURLConnection = (HttpURLConnection) url.openConnection();
httpURLConnection.setRequestProperty("Content-type", "application/json;charset=UTF-8");
httpURLConnection.setDoOutput(true);
httpURLConnection.setDoInput(true);
dataOutputStream = new DataOutputStream(httpURLConnection.getOutputStream());
dataOutputStream.write(body.getBytes("UTF-8"));
dataOutputStream.flush();
printWriter.close();
發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章