[已解决]写一个程序，检测指定 URL 的编码

butterX · 发表于 2019-7-12 22:03:00

马上注册，结交更多好友，享用更多功能^_^

您需要登录才可以下载或查看，没有账号？立即注册

x

代码不知道哪里出错了，运行没有报错，但是跑起来总是将指定网址直接打开了，而不返回我需要的编码类型

代码如下：

1 import urllib.request
2 import chardet

5 def main():
6    url = input('请输入网址：')
7    response = urllib.request.urlopen(url)
8    html = response.read()

10    encode = chardet.detect(html)['encoding']
11    if encode == 'GB2312':
12          encode = 'GBK'

14    print('该网页的编码是： %s' % encode)

15  if __name__ == '__main__':
16       main()

返回：
C:\Users\11749\Documents\python\venv\Scripts\python.exe C:/Users/11749/Documents/python/Py_110.py
请输入网址：https://www.liaoxuefeng.com/wiki ... 00/1183255880134144
Process finished with exit code -1

（自动打开网页之后迟迟没有后续反应，所以我自己手动结束了程序）

最佳答案

月排行榜 / 总排行榜

chxchxkkk

2019-7-13 18:05:20

两种方法：
import chardet, requests, urllib
from urllib.request import urlopen
url = 'https://www.liaoxuefeng.com/wiki ... 00/1183255880134144'
headers = {'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 '
'(KHTML, like Gecko) Chrome/63.0.3239.26 Safari/537.36 '
'Core/1.63.6823.400 QQBrowser/10.3.3117.400'}
res = requests.get(url, headers = headers)
s = res.encoding
print(s)

resp = urlopen('http://www.baidu.com')
html = resp.read()
chardet1 = chardet.detect(html)
m = chardet1['encoding']

print(m)
结果：
UTF-8
utf-8

Process finished with exit code 0

第二种方法用你原来的网址，报错503

跳转到最佳答案楼层

新手·ing · 发表于 2019-7-13 06:50:21

你换个网址，这个代码没问题

butterX · 发表于 2019-7-13 10:49:33

新手·ing 发表于 2019-7-13 06:50
你换个网址，这个代码没问题

我这个代码里边有关于打开网页的命令吗

我试了几个都没有输出，但是网页到是一个不少都给我打开了

新手·ing · 发表于 2019-7-13 17:34:42

。。我的意思是你输入别的网址

chxchxkkk · 发表于 2019-7-13 18:05:20

这个最佳答案由 chxchxkkk 给出，感谢 chxchxkkk 的回答。

单击隐藏图章

两种方法：
import chardet, requests, urllib
from urllib.request import urlopen
url = 'https://www.liaoxuefeng.com/wiki ... 00/1183255880134144'
headers = {'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 '
'(KHTML, like Gecko) Chrome/63.0.3239.26 Safari/537.36 '
'Core/1.63.6823.400 QQBrowser/10.3.3117.400'}
res = requests.get(url, headers = headers)
s = res.encoding
print(s)

resp = urlopen('http://www.baidu.com')
html = resp.read()
chardet1 = chardet.detect(html)
m = chardet1['encoding']

print(m)
结果：
UTF-8
utf-8

Process finished with exit code 0

第二种方法用你原来的网址，报错503

账号		自动登录	找回密码
密码			立即注册

[已解决]写一个程序，检测指定 URL 的编码

马上注册，结交更多好友，享用更多功能^_^

浏览过的版块