[已解决]爬虫

kygschp · 发表于 2021-7-17 08:32:21

马上注册，结交更多好友，享用更多功能^_^

您需要登录才可以下载或查看，没有账号？立即注册

x

求救各位大神，下面的代码运行后，啥东西都没打印出来

import requests
import bs4

response = requests.get('https://movie.douban.com/typerank?type_name=%E5%89%A7%E6%83%85&type=11&interval_id=100:90&action=')
soup = bs4.BeautifulSoup(response.text, 'html.parser')
targets = soup.find_all('div', class_='movie-name')

for each in targets:
print(each.span.a.text)

最佳答案

月排行榜 / 总排行榜

suchocolate

2021-7-17 14:13:32

加header，不加默认是python-requests，会被反扒的。

import requests
import bs4
headers = {'user-agent': 'Mozilla'}
response = requests.get('https://movie.douban.com/typerank?type_name=%E5%89%A7%E6%83%85&type=11&interval_id=100:90&action=', headers=headers)
soup = bs4.BeautifulSoup(response.text, 'html.parser')
targets = soup.find_all('div', class_='movie-name')
for each in targets:
print(each.span.a.text)

复制代码

跳转到最佳答案楼层

suchocolate · 发表于 2021-7-17 14:13:32

这个最佳答案由 suchocolate 给出，感谢 suchocolate 的回答。

单击隐藏图章

加header，不加默认是python-requests，会被反扒的。

import requests
import bs4
headers = {'user-agent': 'Mozilla'}
response = requests.get('https://movie.douban.com/typerank?type_name=%E5%89%A7%E6%83%85&type=11&interval_id=100:90&action=', headers=headers)
soup = bs4.BeautifulSoup(response.text, 'html.parser')
targets = soup.find_all('div', class_='movie-name')
for each in targets:
print(each.span.a.text)

复制代码

账号		自动登录	找回密码
密码			立即注册

[已解决]爬虫

马上注册，结交更多好友，享用更多功能^_^

浏览过的版块