《Python第二版》中爬虫文件的问题

哈哈怪 · 发表于 2019-7-27 16:43:43

马上注册，结交更多好友，享用更多功能^_^

您需要登录才可以下载或查看，没有账号？立即注册

x

这一段代码是小甲鱼老师改的爬百度贴吧上图片的这里代码的问题看不懂

import urllib.request
import re
import os
import requests
def open_url(url):
req = urllib.request.Request(url)
req.add_header('User-Agent','Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/75.0.3770.100 Safari/537.36')
page = urllib.request.urlopen(req)
html = page.read().decode('utf-8')
return html
def get_img(html):
p = r'<img class="BDE_Image".*?src="[^"]*\.jpg)".*?>'
imglist = re.findall(p,html)
try:
os.mkdir('NewPics')
except FileExistsError:
pass
os.chdir('NewPics')
for each in imglist:
filename = each.split("/")[-1]
urllib.request.urlretrieve(each,filename,None)
if __name__=='__main__':
url = 'http://tieba.baidu.com/p/3823765471'
get_img(open_url(url))

复制代码

但是最后报错的内容我看不懂

哈哈怪 · 发表于 2019-7-27 16:44:27

File "F:\python\lib\re.py", line 223, in findall
return _compile(pattern, flags).findall(string)
  File "F:\python\lib\re.py", line 286, in _compile
p = sre_compile.compile(pattern, flags)
  File "F:\python\lib\sre_compile.py", line 764, in compile
p = sre_parse.parse(p, flags)
  File "F:\python\lib\sre_parse.py", line 944, in parse
raise source.error("unbalanced parenthesis")
re.error: unbalanced parenthesis at position 40

没见过这一段报错请问各位大佬这个怎么改

账号		自动登录	找回密码
密码			立即注册

《Python第二版》中爬虫文件的问题

马上注册，结交更多好友，享用更多功能^_^

浏览过的版块