|
发表于 2020-6-14 20:18:34
|
显示全部楼层
用第三方库,要不就自己写复杂的代码判断
- 检测文件编码
- import cchardet as chardet
- # 先检测出文件编码
- with open("test.txt", "rb") as f:
- msg = f.read()
- enc = chardet.detect(msg) # 返回的是个字典 编码和准确度。如:{'encoding': 'UTF-8', 'confidence': 0.9900000095367432}
- print(enc)
- enc = enc['encoding']
- # 然后以指定编码打开文件
- with open("test.txt", "r", encoding=enc) as f:
- print(f.read())
- 网页编码判断
- import requests
- import cchardet
- res = requests.get('http://www.baidu.com/')
- rawdata = res.content
- enc = cchardet.detect(rawdata)
- enc = enc['encoding']
- print(enc)
复制代码 |
|