|
发表于 2018-1-16 15:32:10
|
显示全部楼层
本楼为最佳答案
- import urllib.request
- import re
- from bs4 import BeautifulSoup
- url = 'http://jandan.net/ooxx/page-472#comments'
- req = urllib.request.Request(url)
- req.add_header('User-Agent',"Mozilla/5.0 (Windows NT 6.1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/49.0.2623.221 Safari/537.36 SE 2.X MetaSr 1.0")
- responce = urllib.request.urlopen(req)
-
- html = responce.read().decode('utf-8')
- soup = BeautifulSoup(html, "lxml")
- #_r = r'<a href="(.*?#comments)"'
- #_result = re.findall(_r, html)
- print(soup.img)
复制代码
|
|