|
马上注册,结交更多好友,享用更多功能^_^
您需要 登录 才可以下载或查看,没有账号?立即注册
x
请问这个代码保存成csv文件,想要第一行显示年龄,身高,学历,住址该怎么修改
- import requests
- import pandas as pd
- import re
- from lxml import html
- etree = html.etree
- #主页
- url = 'https://search.baihe.com/search/noLogin?&jsonCallBack=jQuery183026832692080622156_1683878942706'
- info_url = 'https://profile1.baihe.com/?oppID='
- header = {
- "User-Agent": "Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/108.0.0.0 Safari/537.36"
- }
- params = {
- "jsonCallBack": "jQuery183026832692080622156_1683878942706"
- }
- age_list = []
- height_list = []
- degree_list = []
- city_list = []
- response = requests.post(url=url,headers=header,params=params).text
- ex = '"userID":(\d+)'
- matches = re.findall(ex,response,re.S)
- for i in matches:
- ifo_url = info_url + i
- print(ifo_url)
- page = requests.get(url = ifo_url,headers=header).text
- tree = etree.HTML(page)
- # age = tree.xpath('//*[@id="BAIHE"]/div[1]/div//p/text()[1]')[1]
- # height = tree.xpath('//*[@id="BAIHE"]/div[1]/div//p/text()[2]')[1]
- # degree = tree.xpath('//*[@id="BAIHE"]/div[1]/div//p/text()[3]')[1]
- # city = tree.xpath('//*[@id="BAIHE"]/div[1]/div//p/text()[4]')[0]
- #print(age,height,degree,city)
- age_list.append(tree.xpath('//*[@id="BAIHE"]/div[1]/div//p/text()[1]')[1])
- height_list.append(tree.xpath('//*[@id="BAIHE"]/div[1]/div//p/text()[2]')[1])
- degree_list.append(tree.xpath('//*[@id="BAIHE"]/div[1]/div//p/text()[3]')[1])
- city_list.append(tree.xpath('//*[@id="BAIHE"]/div[1]/div//p/text()[4]')[0])
- print('爬取成功')
- df = pd.DataFrame()
- df['年龄'] = age_list
- df['身高'] = height_list
- df['学历'] = degree_list
- df['城市'] = city_list
- df.to_csv('baihe.csv',mode='a+',header=None,index=None,encoding='gbk')
复制代码
就是这个图片里面的内容变成下一个图片的内容
将df.to_csv(..)中的header参数设置为想要的列名即可,例如:
- df.to_csv('baihe.csv', mode='a+', header=['年龄', '身高', '学历', '城市'], index=None, encoding='gbk')
复制代码
这样就会把年龄、身高、学历、城市作为第一行写入到csv文件中了。
|
-
-
|