|
|
马上注册,结交更多好友,享用更多功能^_^
您需要 登录 才可以下载或查看,没有账号?立即注册
x
本帖最后由 神奇的yxq 于 2017-10-10 20:17 编辑
代码如下:
- import urllib.request
- import pickle
- import random
- import re
- url = "http://myip.kkcha.com/"
- ip_pat = re.compile("\d{3}\.\d{3}\.\d{3}\.\d{3}")
- print("connecting...")
- response = urllib.request.urlopen(url)
- print("reading...")
- content = response.read()
- print("decoding...")
- html = content.decode("utf-8")
- print("无代理:",ip_pat.findall(html))
- with open("proxy_ip_list.pickle","rb") as f:
- ip_list = pickle.load(f)
- print("proxy ip had loaded! nums=",len(ip_list))
- ip = random.choice(ip_list)
- proxy = urllib.request.ProxyHandler({ip[0]:ip[1]})
- opener = urllib.request.build_opener(proxy)
- opener.addheaders = [("Connection","keep-alive"),("User-Agent","Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/55.0.2883.87 Safari/537.36 QIHU 360EE")]
- urllib.request.install_opener(opener)
- req = urllib.request.Request(url)
- print("connecting...")
- response2 = urllib.request.urlopen(req)
- print("reading...")
- content2 = response2.read()
- print("decoding...")
- html2 = content2.decode("utf-8")
- print("代理:",ip_pat.findall(html2))
- print("代理IP列表")
- for each in ip_list:
- print(each)
复制代码
问题:
导入的proxy_ip_list是我用爬虫从西刺爬取的最新的ip列表:http://www.xicidaili.com/.
按以上代码设置的代理却发现并没有正确的代理.
运行结果如下:
---------------------------
connecting...
reading...
decoding...
无代理: ['117.151.103.208']
proxy ip had loaded! nums= 100
connecting...
reading...
decoding...
代理: ['117.151.103.208']
代理IP列表
('HTTP', '117.78.37.198:8000')
('HTTPS', '110.73.43.36:8123')
...ip列表省略
----------------------------
不知道到底时哪出问题了 如果是代理服务器出了问题那应该无反应才对啊
|
|