|
|
发表于 2019-8-20 17:17:28
|
显示全部楼层
你这描述不是很清楚,是要把所有的a标签提取出来么
- divs = soup.find_all('div',class_='polysemantList-header-title')
- for div in divs:
- a_list = div.find_all('a')
- for a in a_list:
- print(a)
复制代码
打印结果是所有a标签
- <a href="/item/%E7%99%BE%E5%BA%A6%E7%99%BE%E7%A7%91%EF%BC%9A%E5%A4%9A%E4%B9%89%E8%AF%8D" target="_blank">多义词</a>
- <a href="/item/%E4%B9%89%E9%A1%B9" target="_blank">义项</a>
- <a href="/item/%E7%8C%AA%E5%85%AB%E6%88%92?force=1" target="_blank">共10个义项</a>
- <a class="polysemant-button polysemant-button--add J-polysemant-button--add" data-href="/createsub/%E7%8C%AA%E5%85%AB%E6%88%92" href="javascript:;">
- <span class="polysemant-button__text J-polysemant-button__text">添加义项</span>
- <em class="cmn-icon wiki-lemma-icons wiki-lemma-icons_add polysemant-button__icon J-polysemant-button__icon"></em>
- </a>
复制代码 |
|