rompin 发表于 2020-4-9 15:08:50

待解决(为什么print不出来)

import bs4
import requests
#获得电影的名字
headers={'user-agent':'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/70.0.3538.25 Safari/537.36 Core/1.70.3741.400 QQBrowser/10.5.3863.400'
    }
res = requests.get('https://movie.douban.com/chart',headers=headers)
soup = bs4.BeautifulSoup(res.text,'html.parser')
targets = soup.find_all("div",class_="p12")
for each in targets:
         print(each.a.span.text)

qiuyouzhi 发表于 2020-4-9 15:10:41

本帖最后由 qiuyouzhi 于 2020-4-9 15:13 编辑

分步骤分析问题:
1,get确定抓取到网页了吗?
2,soup提取的对吗?class_的内容没有拼错?
3,确定是each.a.span.text?
根据上面的信息,可以发现,是class_拼写错误,
是pl2而不是p12。
代码:
import bs4
import requests
#获得电影的名字
headers={'user-agent':'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/70.0.3538.25 Safari/537.36 Core/1.70.3741.400 QQBrowser/10.5.3863.400'
    }
res = requests.get('https://movie.douban.com/chart',headers=headers)
print(res.text)

soup = bs4.BeautifulSoup(res.text,'html.parser')
targets = soup.find_all("div",class_="pl2")
for each in targets:
    print(each.a.span.text)
页: [1]
查看完整版本: 待解决(为什么print不出来)