python爬虫,Python交流,编程语言专区,鱼C论坛

cjy520 发表于 2021-6-15 23:21:02

python爬虫

求大佬：python怎么爬取这个标签里的超链接

smog 发表于 2021-6-15 23:59:10

本帖最后由 smog 于 2021-6-17 12:22 编辑

re正则：
re.findall('<a rpos cpos="title" href="(https://news.ycwb.com/202)">', txt)(txt 是响应的字符串)
xpath:
response.xpath('//a[@cpos="title"]/@href').extract_first()
beautifulsoup:
soup = BeautifulSoup(txt, 'lxml')
soup.find('a', {'cpos="title"'})['href']
...

suchocolate 发表于 2021-6-16 03:20:46

re: href="(.*?)"
xpath: '/a/@href'

页: [1]

鱼C论坛's Archiver

python爬虫