應用場景
從Hive數據倉庫批量撈取數據通過UDF中HttpURLConnection調用至服務端;
問題
服務端拿到的中文數據部分存在亂碼;
排查
- 1、查詢MySql數據庫,發現源數據非亂碼且編碼格式爲UTF-8;
- 2、查詢Hive數據倉庫,發現數據非亂碼且編碼格式爲UTF-8;
- 3、初步判斷亂碼發生在HttpURLConnection調用過程中;
解決
修改前部分代碼:
URL url = new URL(requestUrl);
httpURLConnection = (HttpURLConnection) url.openConnection();
httpURLConnection.setRequestProperty("Content-type", "application/json");
httpURLConnection.setDoOutput(true);
httpURLConnection.setDoInput(true);
outputStream = httpURLConnection.getOutputStream();
printWriter = new PrintWriter(outputStream);
printWriter.print(body);
printWriter.flush();
printWriter.close();
修改後部分代碼:
URL url = new URL(requestUrl);
httpURLConnection = (HttpURLConnection) url.openConnection();
httpURLConnection.setRequestProperty("Content-type", "application/json;charset=UTF-8");
httpURLConnection.setDoOutput(true);
httpURLConnection.setDoInput(true);
dataOutputStream = new DataOutputStream(httpURLConnection.getOutputStream());
dataOutputStream.write(body.getBytes("UTF-8"));
dataOutputStream.flush();
printWriter.close();