[python] 批量替換文件夾下的文件編碼格式

原創

2020-06-26 05:50


import chardet
import os

filename = 'H:\\python\\source' #文件夾路徑
Aim_Format = 'utf-8' #目標編碼格式
code_ifo = 'xxx'

#保存文件
def write_file(content, file):
    with open(file, 'wb') as f:
        f.write(content)

#遍歷文件夾下所有文件
def get_filelist(dir, file):
    newDir = dir
    if os.path.isfile(dir):
        file.append(dir)
        Encoding_Format_Trans(dir, Aim_Format)#修改編碼格式
    elif os.path.isdir(dir):
        for s in os.listdir(dir):
            newDir = os.path.join(dir, s)
            get_filelist(newDir, file)
    return file

#獲取單個文件的編碼信息
def get_file_info(file):
    f = open(file, 'rb')
    data = f.read()
    return chardet.detect(data)['encoding'].strip()  #空文本會報錯

#編碼格式轉換
def Encoding_Format_Trans (path_name_, _Aim_Format):
    code_ifo = get_file_info(path_name_)
    print('before ', code_ifo)
    if code_ifo != _Aim_Format:
        if code_ifo == 'GB2312': #gbk的字符集更全，能解一些2312爲亂碼的文字
            code_ifo = 'gbk'
        f = open(path_name_, 'rb')
        file_decode = f.read().decode(code_ifo, 'ignore') #編碼的字符---> unicode
        file_encode = file_decode.encode(_Aim_Format) #unicode-->目標編碼格式
        write_file(file_encode, path_name_)
        code_test = get_file_info(path_name_)
        print('after ', code_test)



if __name__ == "__main__":
    list = get_filelist(filename, [])
    print('over')

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

[python] 批量替換文件夾下的文件編碼格式

vue項目獲取富文本編輯器wangEditor內容導出爲word（html轉word格式並下載）

dotnet C# 創建 X11 應用時設置窗口背景顏色

Navicat安裝與激活教程

TDengine docker安裝方法

vue3組件通信與props

sapui5

Alpine Linux apk add DNS lookup error

部分JDK版本的發佈時間

工作中用到的腳本合集

合併代碼時Beyond Compare設置

爲何有的單片機的晶振要選擇11.0592M？

TCP建立連接爲什麼是三次，斷開連接爲什麼是四次？

(轉載)Python常見字符編碼間的轉換

Python調用DLL庫

【Linux】信號量

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結