宜配屋

使用python的chardet库获得文件编码并修改编码

yipeiwu_com6年前 (2020-03-06)Python基础

首先需要安装chardet库，有很多方式，我才用的是比较笨的方式：sudo pip install chardet

#!/usr/bin/env python
# coding: UTF-8
import sys
import os
import chardet

 
def print_usage():
　　print '''usage:
　　change_charset [file|directory] [charset] [output file]\n
　　for example:
　　  change 1.txt utf-8 n1.txt
　　  change 1.txt utf-8
　　  change . utf-8
　　  change 1.txt
'''
def get_charset(s):
　　return chardet.detect(s)['encoding']

 
def remove(file_name):
　　os.remove(file_name)

 
def change_file_charset(file_name, output_file_name, charset):
　　f = open(file_name)
　　s = f.read()
　　f.close()

　　if file_name == output_file_name or output_file_name == "":
　　　　remove(file_name)

　　old_charset = get_charset(s)
　　u = s.decode(old_charset)

　　if output_file_name == "":
　　　　output_file_name = file_name
　　f = open(output_file_name, 'w')
　　s = u.encode(charset)
　　f.write(s)
　　f.close()

 
def do(file_name, output_file_name, charset):
　　if os.path.isdir(file_name):
　　　　for item in os.listdir(file_name):
　　　　　　try:
　　　　　　　　if os.path.isdir(file_name+"/"+item):
　　　　　　　　　　do(file_name+"/"+item, "", charset)
　　　　　　　　else:
　　　　　　　　　　change_file_charset(file_name+"/"+item, "", charset)
　　　　　　except OSError, e:
　　　　　　　　print e
　　else:
　　　　change_file_charset(file_name, output_file_name, charset)

 
if __name__ == '__main__':
　　length = len(sys.argv)

　　if length == 1:
　　　　print_usage()
　　elif length == 2:
　　　　do(sys.argv[1], "", "utf-8")
　　elif length == 3:
　　　　do(sys.argv[1], "", sys.argv[2])
　　elif length == 4:
　　　　do(sys.argv[1], sys.argv[3], sys.argv[2])
　　else:
　　　　print_usage()

使用python的chardet库获得文件编码并修改编码

相关文章

使用Python下载Bing图片（代码）

Python实现FTP文件传输的实例

Python+Redis实现布隆过滤器

Python 字符串操作(string替换、删除、截取、复制、连接、比较、查找、包含、大小写转换、分割等)

Python Subprocess模块原理及实例

© YiPeiWu.com 【宜配屋】粤ICP备17031333号

Powered By Z-BlogPHP. Theme by TOYEAN.

宜配屋

使用python的chardet库获得文件编码并修改编码

相关文章

使用Python下载Bing图片（代码）

Python实现FTP文件传输的实例

Python+Redis实现布隆过滤器

Python 字符串操作(string替换、删除、截取、复制、连接、比较、查找、包含、大小写转换、分割等)

Python Subprocess模块原理及实例

© YiPeiWu.com 【宜配屋】 粤ICP备17031333号 var _hmt = _hmt || [];(function() { var hm = document.createElement("script"); hm.src = "https://hm.baidu.com/hm.js?8aa60ae04b767b2af31903508928acc0"; var s = document.getElementsByTagName("script")[0]; s.parentNode.insertBefore(hm, s);})();

Powered By Z-BlogPHP. Theme by TOYEAN.

© YiPeiWu.com 【宜配屋】粤ICP备17031333号