|
|
发表于 2019-6-19 18:43:22
|
显示全部楼层
你检查一下你的headers,
少了一个逗号,少了一个冒号。
- import requests
- from lxml import etree
- headers = {
- "User-Agent":"Mozilla/5.0 (Windows NT 5.1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/56.0.2924.90 Safari/537.36 2345Explorer/9.3.2.17331",
- "Referer":"https://www.mzitu.com/tag/ugirls/"
- }
- response = requests.get("https://www.mzitu.com/tag/ugirls/",headers=headers)
- print(response)
- html = etree.HTML(response.text)
- src_list=html.xpath('//img[@class="lazy"]/@data-original')
- alt_list=html.xpath('//img[@class="lazy"]/@alt')
- for src,alt in zip(src_list, alt_list):
- #print(src,alt)
- response = requests.get(src,headers=headers)
- print(src)
- print(response)
- fileName = alt+".jpg"
- print("正在保:"+fileName)
- # with open(fileName,"wb") as f:
- # f.write(response.content)
复制代码 |
|