设为首页收藏本站

切换到窄版

鱼C论坛»论坛 › 编程语言专区 › Python交流 › 使用beautifulsoup的问题

发新帖

查看: 846|回复: 1

[已解决]使用beautifulsoup的问题

发表于 2018-2-4 18:51:45 | 显示全部楼层 |阅读模式

马上注册，结交更多好友，享用更多功能^_^

您需要登录才可以下载或查看，没有账号？立即注册

x

>>> html_doc = """<html><head><title>睡鼠的故事</title></head>
<body>
<p class="title"><b>睡鼠的故事</b></p>
<p class="story">从前有三位小姐姐，她们的名字是：
<a href="http://example.com/elsie" class="sister" id="link1">埃尔西</a>，
<a href="http://example.com/lacie" class="sister" id="link2">莱斯</a>和
<a href="http://example.com/tillie" class="sister" id="link3">蒂尔莉</a>；
她们住在一个井底下面。</p>
<p class="story">...</p>
"""
>>> from bs4 import BeautifulSoup
>>> soup = BeautifulSoup(html_doc, 'html.parser')

复制代码

看了小甲鱼老师的帮助文档，知道使用

>>> soup.html.body.p
<p class="title"><b>睡鼠的故事</b></p>

复制代码

得到body标签下第一个p标签的内容

请问有没有办法使用类似这样的代码
soup.html.body.p（class='story'）
直接得到body标签下另一个p标签的内容？即返回

<p class="story">从前有三位小姐姐，她们的名字是：
<a href="http://example.com/elsie" class="sister" id="link1">埃尔西</a>，
<a href="http://example.com/lacie" class="sister" id="link2">莱斯</a>和
<a href="http://example.com/tillie" class="sister" id="link3">蒂尔莉</a>；
她们住在一个井底下面。</p>

复制代码

最佳答案

月排行榜 / 总排行榜

月亮下的么么哒

2018-2-6 18:06:40

soup = BeautifulSoup(html_doc, "lxml")
all_link = soup.find_all("p", attrs={"class": "story"})

复制代码

跳转到最佳答案楼层

小甲鱼最新课程 -> https://ilovefishc.com

回复

使用道具举报

月亮下的么么哒

发表于 2018-2-6 18:06:40 | 显示全部楼层本楼为最佳答案

这个最佳答案由月亮下的么么哒给出，感谢月亮下的么么哒的回答。

单击隐藏图章

soup = BeautifulSoup(html_doc, "lxml")
all_link = soup.find_all("p", attrs={"class": "story"})

复制代码

小甲鱼最新课程 -> https://ilovefishc.com

回复支持反对

使用道具举报

发新帖

小黑屋|手机版|Archiver|鱼C工作室 ( 粤ICP备18085999号-1 | 粤公网安备 44051102000585号)

GMT+8, 2026-3-7 06:07

Powered by Discuz! X3.4

© 2001-2023 Discuz! Team.

快速回复 返回顶部 返回列表