木习衣 发表于 2020-3-22 17:01:23

看视频码的代码,不知道为啥运行时会显示下面的错误,求助大佬

import urllib.request
import re

def open_url(url):
    req=urllib.request.Request(url)
    req.add_header('User-Agent','Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/63.0.3239.26 Safari/537.36 Core/1.63.6788.400 QQBrowser/10.3.2843.400')
    page=urllib.request.urlopen(req)
    html=page.read().decode('utf-8')

   
    return html


def get_img(html):
    p=r'<img src="([^"]+\.jpeg)"'
    imglist=re.findall(p,html)
    for each in imglist:
      print(each)





if __name__=='__main__':
    url="http://pic.sogou.com/d?query=%CD%F5%D2%BB%B2%A9%B1%DA%D6%BD&st=255&mode=13&dm=4&cwidth=1280&cheight=720&did=18#did17"
    get_img(open_url(url))



Traceback (most recent call last):
File "C:\Users\lenovo\Desktop\tieba.py", line 26, in <module>
    get_img(open_url(url))
File "C:\Users\lenovo\Desktop\tieba.py", line 8, in open_url
    html=page.read().decode('utf-8')
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xbb in position 880: invalid start byte

wp231957 发表于 2020-3-22 18:26:59

url="http://pic.sogou.com/d?query=%CD%F5%D2%BB%B2%A9%B1%DA%D6%BD&st=255&mode=13&dm=4&cwidth=1280&cheight=720&did=18#did17"

这个网站用的是GBK你用utf-8解码 肯定是不行
页: [1]
查看完整版本: 看视频码的代码,不知道为啥运行时会显示下面的错误,求助大佬