|
马上注册,结交更多好友,享用更多功能^_^
您需要 登录 才可以下载或查看,没有账号?立即注册
x
本帖最后由 basketmn 于 2021-8-9 14:06 编辑
- import requests
- import re
- from lxml import etree
- url='https://www.qiushibaike.com/video/'
- headers={'User-Agent':
- 'Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:90.0) Gecko/20100101 Firefox/90.0'
- }
- response=requests.get(url=url,headers=headers)
- result=etree.HTML(response.text)
- #tupian=re.findall(r'<div class="thumb">.*?<img src="(.*?)" alt=.*?</div>',response.text,re.S)
- tupian=result.xpath('//video[@controls="controls"]/source/@src')
- print(tupian)
- for img_tupian in tupian:
- video_url='https:'+img_tupian
- shipin=requests.get(url=video_url,headers=headers)
- print(shipin)
- with open('.\','wb') as f:
- f.write(shipin.content)
复制代码
各位大佬,这个被反爬了,返回response [200],怎么解决
没被反扒啊,response[200],表示请求正常,200是状态码,你这个代码就是open那里有点问题,可以改成 - with open('./'+img_tupian.split('/')[-1],'wb')
复制代码
|
|