|
马上注册,结交更多好友,享用更多功能^_^
您需要 登录 才可以下载或查看,没有账号?立即注册
x
<div class="el">
<p class="t1 ">
<em class="check" name="delivery_em" onclick="checkboxClick(this)"></em>
<input class="checkbox" type="checkbox" name="delivery_jobid" value="117950378" jt="0" style="display:none" />
<span>
<a target="_blank" title="人事专员(薪酬)" href="https://jobs.51job.com/shenzhen-nsq/117950378.html?s=01&t=0" onmousedown="">
人事专员(薪酬) </a>
</span>
</p>
<span class="t2"><a target="_blank" title="前海云链科技(深圳)有限公司" href="https://jobs.51job.com/all/co5579986.html">前海云链科技(深圳)有限公司</a></span>
<span class="t3">深圳-南山区</span>
<span class="t4">6-8千/月</span>
<span class="t5">05-05</span>
</div>
这个是前程无忧的职位信息,其他的公司、薪酬都爬下来了,但是不知道该怎么爬取<a>标签里的职位名称,恳求大佬们指教。
你这样试试:
- import requests
- from bs4 import BeautifulSoup
- url = 'https://search.51job.com/list/020000%252C040000%252C030200%252C080200%252C010000,000000,0608,00,9,99,%25E8%2596%25AA%25E9%2585%25AC,2,1.html?lang=c&postchannel=0000&workyear=99&cotype=99°reefrom=99&jobterm=99&companysize=99&ord_field=0&dibiaoid=0&line=&welfare='
- kv = {'user-agent' : 'Chrome/10'}
- r = requests.get(url, headers = kv)
- r.encoding = r.apparent_encoding
- soup = BeautifulSoup(r.text, 'html.parser')
- target = soup("p", class_="t1")
- for each in target:
- print(each.span.a['title'])
复制代码
我这里岗位都打印出来了
|
|