|
马上注册,结交更多好友,享用更多功能^_^
您需要 登录 才可以下载或查看,没有账号?立即注册
x
结果是个空列表,xpath定位浏览器提取的和自己写的都试了,结果是空列表
- import requests
- from lxml import etree
- url = 'https://zhuanlan.zhihu.com/pypcfx'
- headers = {'user-agent':'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/70.0.3538.25 Safari/537.36'}
- html = requests.get(url, headers=headers)
- data = etree.HTML(html.text)
- title = data.xpath('//div[@class="Card css-ny4o71"]/div[@class="css-8txec3"]//h2/a/text()')
- print(title)
复制代码
这玩意是通过AJAX传过来的 你试试
- url = 'https://www.zhihu.com/api/v4/columns/pypcfx/pinned-items'
- headers = {'user-agent':'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/70.0.3538.25 Safari/537.36'}
- html = requests.get(url, headers=headers).json()["data"][0]["content"]
- print(html)
复制代码
|
|