爬虫返回空列表,Python交流,编程语言专区,鱼C论坛

a870950142 发表于 2022-11-14 20:54:46

爬虫返回空列表

import requests
from lxml import etree
url='诶吃踢踢劈艾斯冒号杠杠三达八六.pearvideo.com/'
respones=requests.get(url)
#print(respones.status_code)
#print(respones.text)
resp=etree.HTML(respones.text)
lujing=resp.xpath('/html/body/div/div/div/div/ul/li/div/text()')
print(lujing)

哪位大佬帮忙解释下，为什么返回的是空列表/n，是被反爬了还是路径有问题
正在学习中，上次也是返回个空列表

wp231957 发表于 2022-11-14 21:02:41

十有八九不是静态页面

suchocolate 发表于 2022-11-15 09:47:41

你想爬这个网页的什么东东？

a870950142 发表于 2022-11-15 11:48:50

suchocolate 发表于 2022-11-15 09:47
你想爬这个网页的什么东东？

我想取出来这个网页中任意视频的a标签的href属性值：<a href="video_1596592"

就是用这个网页随便练习下，我想知道我的问题出现在了哪里，为什么会返回空列表，是什么原因~

自学小白菜 发表于 2022-11-15 21:45:18

import requests
from lxml import etree
url='https://www.pearvideo.com/'
respones=requests.get(url)
resp=etree.HTML(respones.text)
lujing=resp.xpath('//div[@class="vervideo-tlist-big"]//a/@href')
print(lujing)

这样应该可以，尝试了一下，结果为['video_1724597']

页: [1]

鱼C论坛's Archiver

爬虫返回空列表