|
|
马上注册,结交更多好友,享用更多功能^_^
您需要 登录 才可以下载或查看,没有账号?立即注册
x
代码如下:
- import scrapy
- class DmozSpider(scrapy.Spider):
- name = "dmoz"
- allowed_domains = ['domztools.net']
- start_urls = [
- 'http://www.dmoztools.net/Computers/Programming/Languages/Python/Books/',
- 'http://www.dmoztools.net/Computers/Programming/Languages/Python/Resources/'
- ]
-
- def parse(self,response):
- sel = xpath.selector.Selector(response)
- sites = sel.xpath('//div[@class="title-and-desc"]/a')
- for site in sites:
- title = site.xpath('div/text()').extract()
- link = site.xpath('div/@href').extract()
- desc = site.xpath('div[@class="title-and-desc"]/text()').extract()
- print(title,link.desc)
复制代码
但是在cmd中运行时出现如下错误:
Traceback (most recent call last):
File "e:\python36\lib\site-packages\twisted\internet\defer.py", line 653, in _runCallbacks
current.result = callback(current.result, *args, **kw)
File "C:\Users\dwc\Desktop\scr1\scr1\spiders\dmoz_spider.py", line 12, in parse
sel = xpath.selector.Selector(response)
NameError: name 'xpath' is not defined
2017-12-11 20:10:01 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.dmoztools.net/Computers/Programming/Languages/Python/Resources/> (referer: None)
2017-12-11 20:10:01 [scrapy.core.scraper] ERROR: Spider error processing <GET http://www.dmoztools.net/Computers/Programming/Languages/Python/Resources/> (referer: None)
Traceback (most recent call last):
File "e:\python36\lib\site-packages\twisted\internet\defer.py", line 653, in _runCallbacks
current.result = callback(current.result, *args, **kw)
File "C:\Users\dwc\Desktop\scr1\scr1\spiders\dmoz_spider.py", line 12, in parse
sel = xpath.selector.Selector(response)
NameError: name 'xpath' is not defined
求大佬解答,谢谢
|
|