python3中xpath怎么使用

石头怪 · 发表于 2017-6-12 11:23:31

马上注册，结交更多好友，享用更多功能^_^

您需要登录才可以下载或查看，没有账号？立即注册

x

import urllib.request

url = 'http://tieba.baidu.com/p/5148091325'

response = urllib.request.urlopen(url)

html = response.xpath('//*[@id="post_content_108012954576"]/text()')

print(html)

报错信息：Traceback (most recent call last):
File "C:\Users\memedai\Desktop\xpath.py", line 7, in <module>
html = response.xpath('//*[@id="post_content_108012954576"]/text()')
AttributeError: 'HTTPResponse' object has no attribute 'xpath'

SixPy · 发表于 2017-6-12 11:57:52

lxml

gopythoner · 发表于 2017-6-12 14:05:33

bs4 的BeautifulSoup才有xpath
又不是Python自带的，你直接肯定用不了

yongxi · 发表于 2017-6-12 15:59:07

楼上正解

dori233 · 发表于 2017-6-13 20:38:36

你得先装上lxml,安装过程可能有障碍,具体可以百度或者知乎.
给个网址,我一开始是按照这个说明安装成功的
https://www.zhihu.com/question/49470061/answer/116163941

后来其实有更简单的方法,假如你有使用pycharm的情况下.
settings→ Project → Interpretre ,看到右边绿色那个加号了没有?
对,点进去然后寻找把,各种库都在这里了.
并且装完还自动把语法提示都有了.

然后 xpath方法在lxml里面.

楼上也说了,BeautifulSoup,它也需要lxml解析

或许你学了BeautifulSoup就不想用lxml了.
但是lxml对于单一网页还是有奇效的,至少解析速度快一点点.

账号		自动登录	找回密码
密码			立即注册