|
马上注册,结交更多好友,享用更多功能^_^
您需要 登录 才可以下载或查看,没有账号?立即注册
x
先不说废话,上代码,书本173页
- import urllib.request
- import urllib.parse
- import re
- from bs4 import BeautifulSoup
- def demo():
- keyword = input("enter a keyword:")
- keyword = urllib.parse.urlencode({"word":keyword})
- response =urllib.request.urlopen("http://baike.baidu.com/search/word?%s"%keyword)
- html = response.read()
- soup = BeautifulSoup(html,"html.parse")
-
- for each in soup.find_all(href = re.compile("view")):
- content = "".join([each.text])
- url2 = "".join(["http://baike.baidu.com",each["href"]])
- response2 = urllib.request.urlopen(url2)
- html2 = response2.read()
- soup2 = BeautifulSoup(html2,"html.parse")
- if soup.h2:
- content = "".join([content,soup2.h2.text])
- content = "".join([content,"->",url2])
- print(content)
-
- if __name__=="__main__":
- demo()
复制代码
以下是报错:
- enter a keyword:111
- Traceback (most recent call last):
- File "E:\systemtemp\eclipsework\PydevDemo\src\xiaojiayu\BeautifuSoup4Demo.py", line 68, in <module>
- demo()
- File "E:\systemtemp\eclipsework\PydevDemo\src\xiaojiayu\BeautifuSoup4Demo.py", line 54, in demo
- soup = BeautifulSoup(html,"html.parse")
- File "D:\application\Python\Python35\lib\site-packages\bs4\__init__.py", line 165, in __init__
- % ",".join(features))
- bs4.FeatureNotFound: Couldn't find a tree builder with the features you requested: html.parse. Do you need to install a parser library?
复制代码
新手,求教如何解决?
因为你抄错了……
- soup2 = BeautifulSoup(html2,"html.parse")
复制代码
应该是
- soup2 = BeautifulSoup(html2,"html.parser")
复制代码
|
|