|
发表于 2020-6-24 14:14:59
|
显示全部楼层
代理和网站都有问题。
新人先练习基础的,httpbin.org这个网站可以练习http爬虫。
1.get
- from urllib import request
- from urllib import parse
- headers = {'User-Agent': 'Firefox'}
- req = request.Request('http://httpbin.org/get', headers=headers)
- r = request.urlopen(req)
- print(r.read().decode('utf-8'))
复制代码
2.post
- from urllib import request
- from urllib import parse
- headers = {'User-Agent': 'Firefox'}
- data = {'name': 'haha','time': '20200624'}
- b_data = bytes(parse.urlencode(data), encoding='utf-8')
- req = request.Request('http://httpbin.org/post', data=b_data, headers=headers, method='POST')
- r = request.urlopen(req)
- print(r.read().decode('utf-8'))
复制代码
|
|