|
发表于 2018-10-10 00:07:11
|
显示全部楼层
我用你的代码可以,但是下载的图片没加后缀名所以打不开,
你可以在get_img函数中修改filename
另外之所以那样显示是因为你没有任何输出啊
如果你想看看自己是否下载成功同样可以修改get_img函数,在后面加个print("success")
- import urllib.request
- import re
- def open_url(url):
- req = urllib.request.Request(url)
- req.add_header('User-Agent',
- 'Mozilla/5.0 (Windows NT 6.3; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/55.0.2883.87 UBrowser/6.2.4094.1 Safari/537.36')
- page = urllib.request.urlopen(req)
- html = page.read().decode('utf-8')
- return html
- def get_img(html):
- p = r'<img class="BDE_Image" src="([^"]+\.jpg)"'
- imglist = re.findall(p, html)
- for each in imglist:
- filename = each.split('/')[-1][:-4]+'.jpeg'
- urllib.request.urlretrieve(each, filename, None)
- print("success")
- if __name__ == '__main__':
- url = "https://tieba.baidu.com/p/3563409202"
- get_img(open_url(url))
复制代码 |
|