|
|
马上注册,结交更多好友,享用更多功能^_^
您需要 登录 才可以下载或查看,没有账号?立即注册
x
import requests
from lxml import etree
import csv
fp = open('C:\\learning\\python学习\\douban3.csv','wt',newline='',encoding='utf-8')
writer = csv.writer(fp)
writer.writerow(('time','data'))
headers = {'User-Agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64)'
' AppleWebKit/537.36 (KHTML, like Gecko) Chrome/64.0.3282.119 Safari/537.36'}
urls = 'http://fund.eastmoney.com/f10/jjjz_161725.html'
res = requests.get(urls, headers=headers)
selector = etree.HTML(res.text)
url_infos = selector.xpath('//tr')
for url_info in url_infos:
time = url_info.xpath('td/text()')
writer.writerow((time))
fp.close()
以上是代码,为了爬取目标网站的表格内的数据,我直接定位到tr标签对td进行遍历,为何没有任何输出结果,csv文件只有抬头行。
网站URL:http://fund.eastmoney.com/f10/jjjz_161725.html
很期待你的回复!!
|
-
目标文件
-
网站截图
|