本帖最后由 什么鬼… 于 2017-2-23 11:28 编辑 import urllib.request
import urllib.parse
import http.cookiejar
import random
email = input('输入邮箱:')
password=input('输入密码:')
url = 'https://accounts.pixiv.net/login?lang=zh&source=pc&view_type=page&ref=wwwtop_accounts_index'
cookie = http.cookiejar.CookieJar()
cjhdr = urllib.request.HTTPCookieProcessor(cookie)
agent_list = ['Mozilla/5.0 (Windows; U; Windows NT 5.2) AppleWebKit/525.13 (KHTML, like Gecko) Chrome/0.2.149.27 Safari/525.13 '
,'Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.8.1.12) Gecko/20080219 Firefox/2.0.0.12 Navigator/9.0.0.6'
,'Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1) ; QIHU 360EE)'
]
opener =urllib.request.build_opener(cjhdr)
urllib.request.install_opener(opener)
agent = random.choice(agent_list)
data = {}
data['password']=password
data['pixiv_id']=email
data['post_key']= '42ff0b9f520223dfbc9e6106ca4254ba'
data['ref']='wwwtop_accounts_index'
data['return_to'] = 'http%3A%2F%2Fwww.pixiv.net%2F'
data['source']='pc'
data = urllib.parse.urlencode(data).encode()
req = urllib.request.Request(url,data,origin_req_host='https://accounts.pixiv.net/api/login?lang=zh')
req.add_header('User-Agent',agent)
response = urllib.request.urlopen(req,data)
content = response.read()
python版本:python3
最近想写一个爬虫来爬p站的图片,然而途中遇到了很多的问题,一开始是正则表达式不会然后跑去学正则表达式,学完了之后再回去发现不行,看了好久才发现要登录,于是改用post来发送表单,然而现在又遇到了问题,这段代码执行之后就是urllib.error.HTTPError: HTTP Error 400: Bad Request
现在是真的不知道怎么办了,大佬们求助啊 |