|
15鱼币
本帖最后由 v.ki 于 2020-4-25 15:01 编辑
设置里加入了头,robots协议已经设置为fasle了
求助大佬为什么爬到了空气。。。应该怎么改
- import json
- import scrapy
- from ..items import PinduoduoItem
- class PinduoduoSpider(scrapy.Spider):
- name='pinduoduo'
- allowed_domains=['yangkeduo.com']
- page=1
- start_url=['http://mobile.yangkeduo.com/proxy/api/api/alexa/goods/hub?pdduid=0&list_update_time=true&hs_version=2&wrt_type=1&'+'page='+str(page)+'&size=20'
- ]
- print(start_url)
- def parse(self,response):
-
- goods_list_json=json.loads(response.body)
- goods_list=goods_list_json['goods_list']
- if not goods_list:
- return
- for i in goods_list:
- item=PinduoduoItem()
- item['goods_name']=i['goods_name']
- item['goods_id']=i['goods_id']
- yield scrapy.Request(url='http://mobile.yangkeduo.com/proxy/api/reviews/'+str(item['goods_id'])+'/list?page=1&size=10',\
- callback=self.get_comments,meta={'item':item})
-
- self.page+=1
-
- yield scrapy.Requests(url='http://mobile.yangkeduo.com/proxy/api/api/alexa/goods/hub?pdduid=0&list_update_time=true&hs_version=2&wrt_type=1&'+'page='+str(self.page)+'&size=20',\
- callback=self.parse)
-
- def get_comments(self,response):
- print('哈哈哈')
复制代码
|
|