|
马上注册,结交更多好友,享用更多功能^_^
您需要 登录 才可以下载或查看,没有账号?立即注册
x
- import re, requests
- url = "https://movie.douban.com/top250"
- header = {
- "user-agent": "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko)"
- }
- resp = requests.get(url, headers=header)
- page_content = resp.text
- # print(result)
- obj = re.compile(r'<li>.*?<span class="title">(?P<name>.*?)</span>'
- r'<span class="playable">(?P<watch>.*?)</span>'
- r'<span class="rating_num" property="v:average">(?P<score>.*?)</span>', re.S)
- result = obj.finditer(page_content)
- for i in result:
- print(i.group("name"), i.group("score"), i.group("watch"))
复制代码
这个程序是想把豆瓣电影评分TOP250上的电影名称,评分,是否可播放给爬取下来,
可是结果为空,这是为啥啊?
哪位大神帮忙看一下啊?
- url = "https://movie.douban.com/top250"
- header = {
- "user-agent": "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko)"
- }
- resp = requests.get(url, headers=header)
- page_content = resp.text
- # print(result)
- obj = re.compile(r'<li>.*?<span class="title">(?P<name>.*?)</span>.*?'
- r'<span class="playable">(?P<watch>.*?)</span>.*?'
- r'<span class="rating_num" property="v:average">(?P<score>.*?)</span>.*?', re.S)
- result = obj.finditer(page_content)
- for i in result:
- print(i.group("name"), i.group("score"), i.group("watch"))
复制代码
|
|