[已解决]为什么两个编码都不行？

杨龙2 · 发表于 2020-2-14 08:01:17

马上注册，结交更多好友，享用更多功能^_^

您需要登录才可以下载或查看，没有账号？立即注册

x

Traceback (most recent call last):
  File "F:/python作业/034/with.py", line 15, in <module>
differ = file_compare(file1,file2)
  File "F:/python作业/034/with.py", line 6, in file_compare
each_line2 = f2.readline()
UnicodeDecodeError: 'gbk' codec can't decode byte 0xbc in position 8: illegal multibyte sequence
>>>
====================== RESTART: F:/python作业/034/with.py ======================
请输入需要比较的头一个文件名:E:\A\1.txt
请输入输入需要比较的另一个文件名:E:\A\record.txt
Traceback (most recent call last):
  File "F:/python作业/034/with.py", line 15, in <module>
differ = file_compare(file1,file2)
  File "F:/python作业/034/with.py", line 6, in file_compare
each_line2 = f2.readline()
  File "D:\python3.7.3\lib\codecs.py", line 322, in decode
(result, consumed) = self._buffer_decode(data, self.errors, final)
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xbf in position 2: invalid start byte
>>>

最佳答案

月排行榜 / 总排行榜

hrp

2020-2-14 12:01:12

你的文本文件编码不一定是GBK编码和UTF-8编码，可以用记事本打开，另存为,选UTF8编码保存，你的代码再指定用UTF8打开文件。

或者用判断文件编码的模块来判断，再用正确编码打开：

import chardet
with open('xxx', 'rb') as f:
m = f.read()
ec = chardet.detect(m)
with open('xxx', 'rt', encoding=ec['encoding']) as f:
r = f.read()
print(r)

复制代码

跳转到最佳答案楼层

kirte · 发表于 2020-2-14 08:03:53

代码呢

qiuyouzhi · 发表于 2020-2-14 08:11:49

你那个文件应该是中英文都有吧

qiuyouzhi · 发表于 2020-2-14 08:13:17

你把两个文件都改成英文或者中文试试

杨龙2 · 发表于 2020-2-14 08:58:25

qiuyouzhi 发表于 2020-2-14 08:11
你那个文件应该是中英文都有吧

上面有访问路径，名字都是英文名

qiuyouzhi · 发表于 2020-2-14 10:06:48

杨龙2 发表于 2020-2-14 08:58
上面有访问路径，名字都是英文名

我说内容

yexing · 发表于 2020-2-14 10:23:13

本帖最后由 yexing 于 2020-2-14 10:38 编辑

编码问题，你试试记事本打开文件，然后文件-另存为，编码选择 utf-8 。
然后这样打开文件：

f = open("文件路径", encoding="utf-8")

复制代码

杨龙2 · 发表于 2020-2-14 11:48:49

qiuyouzhi 发表于 2020-2-14 10:06
我说内容

def file_compare(file1,file2):
count = 0
differ = []
with open(file1,encoding='utf-8') as f1, open(file2,encoding='utf-8') as f2:#打开多个文件的with格式：with open() as f1,open() as f2,..... 或者看成迭代打开文件
      for each_line1 in f1:
         each_line2 = f2.readline()
         count += 1
         if each_line1 != each_line2:
            differ.append(count)
return differ

file1 = input('请输入需要比较的头一个文件名:')
file2 = input('请输入输入需要比较的另一个文件名:')

differ = file_compare(file1,file2)

length = len(differ)
if length == 0:
print('两个文件完全相同！')
else:
print('两个文件共有【%d】处不同：'%length)
for each in differ:
      print('第%d行不一样'%each)

杨龙2 · 发表于 2020-2-14 11:49:38

yexing 发表于 2020-2-14 10:23
编码问题，你试试记事本打开文件，然后文件-另存为，编码选择 utf-8 。
然后这样打开文件：

用记事本可以打开文件，def file_compare(file1,file2):
count = 0
differ = []
with open(file1,encoding='utf-8') as f1, open(file2,encoding='utf-8') as f2:#打开多个文件的with格式：with open() as f1,open() as f2,..... 或者看成迭代打开文件
      for each_line1 in f1:
         each_line2 = f2.readline()
         count += 1
         if each_line1 != each_line2:
            differ.append(count)
return differ

file1 = input('请输入需要比较的头一个文件名:')
file2 = input('请输入输入需要比较的另一个文件名:')

differ = file_compare(file1,file2)

length = len(differ)
if length == 0:
print('两个文件完全相同！')
else:
print('两个文件共有【%d】处不同：'%length)
for each in differ:
      print('第%d行不一样'%each)
这个还是运行错误！是代码的问题吗

qiuyouzhi · 发表于 2020-2-14 12:00:33

杨龙2 发表于 2020-2-14 11:49
用记事本可以打开文件，def file_compare(file1,file2):
count = 0
differ = []

我说文件内容

就是你要比较的那两个文件！

hrp · 发表于 2020-2-14 12:01:12

你的文本文件编码不一定是GBK编码和UTF-8编码，可以用记事本打开，另存为,选UTF8编码保存，你的代码再指定用UTF8打开文件。

或者用判断文件编码的模块来判断，再用正确编码打开：

import chardet
with open('xxx', 'rb') as f:
m = f.read()
ec = chardet.detect(m)
with open('xxx', 'rt', encoding=ec['encoding']) as f:
r = f.read()
print(r)

复制代码

账号		自动登录	找回密码
密码			立即注册

[已解决]为什么两个编码都不行？

马上注册，结交更多好友，享用更多功能^_^

浏览过的版块