|
马上注册,结交更多好友,享用更多功能^_^
您需要 登录 才可以下载或查看,没有账号?立即注册
x
import bs4
import requests
#获得电影的名字
headers={'user-agent':'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/70.0.3538.25 Safari/537.36 Core/1.70.3741.400 QQBrowser/10.5.3863.400'
}
res = requests.get('https://movie.douban.com/chart',headers=headers)
soup = bs4.BeautifulSoup(res.text,'html.parser')
targets = soup.find_all("div",class_="p12")
for each in targets:
print(each.a.span.text)
本帖最后由 qiuyouzhi 于 2020-4-9 15:13 编辑
分步骤分析问题:
1,get确定抓取到网页了吗?
2,soup提取的对吗?class_的内容没有拼错?
3,确定是each.a.span.text?
根据上面的信息,可以发现,是class_拼写错误,
是pl2而不是p12。
代码:
- import bs4
- import requests
- #获得电影的名字
- headers={'user-agent':'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/70.0.3538.25 Safari/537.36 Core/1.70.3741.400 QQBrowser/10.5.3863.400'
- }
- res = requests.get('https://movie.douban.com/chart',headers=headers)
- print(res.text)
- soup = bs4.BeautifulSoup(res.text,'html.parser')
- targets = soup.find_all("div",class_="pl2")
- for each in targets:
- print(each.a.span.text)
复制代码
|
|