|
15鱼币
本帖最后由 什么鬼… 于 2017-2-23 11:28 编辑
- import urllib.request
- import urllib.parse
- import http.cookiejar
- import random
- email = input('输入邮箱:')
- password=input('输入密码:')
- url = 'https://accounts.pixiv.net/login?lang=zh&source=pc&view_type=page&ref=wwwtop_accounts_index'
- cookie = http.cookiejar.CookieJar()
- cjhdr = urllib.request.HTTPCookieProcessor(cookie)
- agent_list = ['Mozilla/5.0 (Windows; U; Windows NT 5.2) AppleWebKit/525.13 (KHTML, like Gecko) Chrome/0.2.149.27 Safari/525.13 '
- ,'Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.8.1.12) Gecko/20080219 Firefox/2.0.0.12 Navigator/9.0.0.6'
- ,'Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1) ; QIHU 360EE)'
- ]
- opener =urllib.request.build_opener(cjhdr)
- urllib.request.install_opener(opener)
- agent = random.choice(agent_list)
- data = {}
- data['password']=password
- data['pixiv_id']=email
- data['post_key']= '42ff0b9f520223dfbc9e6106ca4254ba'
- data['ref']='wwwtop_accounts_index'
- data['return_to'] = 'http%3A%2F%2Fwww.pixiv.net%2F'
- data['source']='pc'
- data = urllib.parse.urlencode(data).encode()
- req = urllib.request.Request(url,data,origin_req_host='https://accounts.pixiv.net/api/login?lang=zh')
- req.add_header('User-Agent',agent)
- response = urllib.request.urlopen(req,data)
- content = response.read()
复制代码
python版本:python3
最近想写一个爬虫来爬p站的图片,然而途中遇到了很多的问题,一开始是正则表达式不会然后跑去学正则表达式,学完了之后再回去发现不行,看了好久才发现要登录,于是改用post来发送表单,然而现在又遇到了问题,这段代码执行之后就是urllib.error.HTTPError: HTTP Error 400: Bad Request
现在是真的不知道怎么办了,大佬们求助啊 |
|