使用C++11 Locale轉換文本編碼

原創

2018-09-11 08:53

C++11標準增加了一些新的類模板用於改進對國際化和本地化的支持。其中，std::wstring_convert、std::codecvt_utf8等類的出現解決了以往C++難以實現在Unicode到UTF-8以及CJK等本地多字節編碼之間轉換文本的問題，現在終於不用再去勞煩第三方庫和MultiByteToWideChar/WideCharToMultiByte等繁瑣的WindowsAPI了。

下面的代碼演示瞭如何利用這些新類，結合原有的codecvt機制，將UTF-8編碼的原始字符串轉換爲Unicode，然後再轉換爲中文GBK編碼。

#include<tchar.h>
#include<locale>
#include<codecvt>
#include<iostream>

int_tmain(intargc,_TCHAR*argv[])
{
std::stringmystring("\xe4\xb8\xad\xe6\x96\x87");//UTF-8編碼的“中文”字符串
std::wstring_convert<std::codecvt_utf8<wchar_t>>cvt_utf8;//UTF-8<->Unicode轉換器
std::wstring_convert<std::codecvt<wchar_t,char,std::mbstate_t>>cvt_ansi(newstd::codecvt<wchar_t,char,std::mbstate_t>("CHS"));//GBK<->Unicode轉換器
std::wstringws=cvt_utf8.from_bytes(mystring);//UTF-8轉換爲Unicode
std::stringmyansistr=cvt_ansi.to_bytes(ws);//Unicode轉換爲GBK
std::cout<<myansistr<<std::endl;
return0;
}

注：以上代碼在VisualStudio2010SP1/2012環境下編譯通過。

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

使用C++11 Locale轉換文本編碼

PDManer [元數建模]-v4.9.0 發佈：一款簡單好用的數據庫建模平臺

使用neovim打造go ide(支持代碼跳轉, 代碼補全, 實時語法檢查)

sql求連續值問題

cs01 CSS Syntax

挑戰程序設計競賽 2.3章習題 poj 3046 Ant Counting

[MASM拾遺]Offset僞指令

h30 HTML Layout Elements

瞭解顯卡

一款基於C#開發的通訊調試工具（支持Modbus RTU、MQTT調試）

Linux/Golang/glibC系統調用

使用Visual Studio 2010編譯Firebird 2.5.2源代碼

我的友情鏈接

Visual C++ 2012編譯器更新(預覽版)發佈

使用C++11 Locale轉換文本編碼

Windows下跨VC版本編譯.pyd擴展(extension)模塊

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結