|
马上注册,结交更多好友,享用更多功能^_^
您需要 登录 才可以下载或查看,没有账号?立即注册
x
- import requests
- from bs4 import BeautifulSoup
- url = "https://www.itotii.net/584.html"
- headres = {'User-Agent':'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/80.0.3987.87 Safari/537.36 SE 2.X MetaSr 1.0'}
- res=requests.get(url,headers=headres)
- res.encoding = 'utf-8'
- print(res.status_code)
- soup = BeautifulSoup(res.text,'html.parser')
- item = soup.find_all(class_="article-content")
- print(item)
- for src in item:
- herf = item.find_all(data-tag="bdshare")
- print(herf['src'])
复制代码
网页源代码img标签不存在属性data-tag。
- import requests
- from bs4 import BeautifulSoup
- url = "https://www.itotii.net/584.html"
- headres = {'User-Agent':'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/80.0.3987.87 Safari/537.36 SE 2.X MetaSr 1.0'}
- res = requests.get(url, headers=headres)
- soup = BeautifulSoup(res.text,'html.parser')
- item = soup.find_all(class_="article-content")[0]
- imgs = item.find_all("img")
- for img in imgs:
- herf = img["src"]
- print(herf)
复制代码
|
|