|
马上注册,结交更多好友,享用更多功能^_^
您需要 登录 才可以下载或查看,没有账号?立即注册
x
RT
内容已经拿到手了,测试第一页的时候很正常
但开始弄翻页的的时候,第二页的数据会把第一页的覆盖
求各位大神帮帮,第一时间给最佳
- #应该就是这部分代码缺少什么
- wb = openpyxl.Workbook()
- wb.guess_types = True
- ws = wb.active
- ws.append(['投诉编号','投诉品牌','投诉车系','投诉车型','问题简述','投诉时间','投诉状态'])
- for each in datalist:
- print(each)
- ws.append(each)
- time.sleep(0.5)
- wb.save("车辆问题.xlsx")
复制代码
帮你写了一个
- import requests
- import parsel
- #from lxml import etree
- from openpyxl import load_workbook
- import time
- def main():
- wb = load_workbook('test.xlsx')
- wb.guess_types = True
- ws = wb.active
- ws.append(['投诉编号', '投诉品牌', '投诉车系', '投诉车型', '问题简述', '投诉时间', '投诉状态'])
- headers = {'User-Agent': 'firefox'}
- for page in range(1, 10):
- target = f"http://www.12365auto.com/zlts/273-0-0-0-0-0_0-0-0-0-0-0-0-{page}.shtml"
- print(target)
- r = requests.get(target, headers=headers)
- #html = etree.HTML(r.text)
- html = parsel.Selector(r.text)
- # 搜索编号
- num = html.xpath('//tr/td[1]/text()')
- # 公司名称
- name = html.xpath('//tr/td[2]/text()')
- # 车辆品牌
- serial = html.xpath('//tr/td[3]/text()')
- # 车辆款式
- type = html.xpath('//tr/td[4]/text()')
- # 问题信息
- description = html.xpath('//tr/td/a/text()')
- # 投诉时间
- stime = html.xpath('//tr/td[7]/text()')
- # 投诉状态
- status = html.xpath('//tr/td/em/text()')
- offset = ws.max_row + 1
- for n, v in enumerate(num):
- row = str(n + offset)
- ws['a' + row] = num[n]
- ws['b' + row] = name[n]
- ws['c' + row] = serial[n]
- ws['d' + row] = type[n]
- ws['e' + row] = description[n]
- ws['f' + row] = stime[n]
- ws['g' + row] = status[n]
- wb.save("test.xlsx")
- if __name__ == '__main__':
- main()
复制代码
|
|