|
马上注册,结交更多好友,享用更多功能^_^
您需要 登录 才可以下载或查看,没有账号?立即注册
x
import re
import requests
import os
from bs4 import BeautifulSoup as bs
header= {'User-Agent':'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/65.0.3314.0 Safari/537.36 SE 2.X MetaSr 1.0',
'Referer':'https://www.mzitu.com/japan/'}
a='http://info.xitek.com/galleries/'
b=requests.get(a, headers=header).content.decode('utf-8')
print(b)
d=re.findall(r'//info.xitek.com/.+?/202005/\d{2}-\d{6}\.html',b)
i=5
for each in d:
h="http:"+each
m=requests.get(h,headers=header).content.decode('utf-8')
print(m)
n=re.findall(r'/uploads/allimg/\d{6}/\d{2,4}-.{8,12}\.jpg',m)
print(n)
i+=1
z=1
for x in n:
q='http://info.xitek.com'+x
print(q)
y=requests.get(q,headers=header).content
with open('e:\ '+str(i)+'-'+str(z)+'.jpg','wb')as f:
f.write(y)
z+=1
这是我练习爬图爬色影无忌网站的一段代码,能正常运行,也能下载图片,但就是下载的东西打不开,用迅雷检查了图片地址是没问题的迅雷下的能看。高手帮我看看是啥问题?
把:'Referer':'https://www.mzitu.com/japan/'
改成:'Referer':'http://info.xitek.com/galleries/'
|
|