|
|
马上注册,结交更多好友,享用更多功能^_^
您需要 登录 才可以下载或查看,没有账号?立即注册
x
本帖最后由 mozzielx 于 2016-7-28 14:52 编辑
1.代理ip使用 不是连不上 就是用的还是主机ip?
2.想用ip_dict = dict(zip(keys,values) ,但是结果只有两个元素?
- # -*- coding: utf-8 -*-
- """
- Created on Thu Jul 28 13:15:02 2016
- @author: mozzielx
- """
- import urllib.request
- import random
- from urllib.request import urlopen,Request
- from bs4 import BeautifulSoup
- ip_dict={}
- keys=[]
- values=[]
- #with open('代理ip.txt' , 'w') as f:
- for page in range(1, 3):#获取前9页
- url= 'http://www.xici.net.co/nn/' + str(page)
- req = Request(url)
- req.add_header('User-Agent','Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/51.0.2704.84 Safari/537.36')
- html_doc = urlopen(req).read()
-
- soup = BeautifulSoup(html_doc,'lxml')
- trs = soup.find('table', id='ip_list').find_all('tr')
- for tr in trs[1:]:
- tds = tr.find_all('td')
- ip = tds[1].text.strip()
- port = tds[2].text.strip()
- protocol = tds[5].text.strip()
- keys.append(protocol)
- values.append(ip + r':' + port)
- ip_dict = dict(zip(keys,values)
- #代理使用
- ip_num = random.randint(0,len(keys) + 1)
- proxy_support = urllib.request.ProxyHandler({keys[ip_num]:values[ip_num]})
- print(keys[ip_num],values[ip_num])#打印选择的ip
- opener = urllib.request.build_opener(proxy_support)
- opener.addhandlers = [('User-Agent','Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/51.0.2704.84 Safari/537.36')]
- urllib.request.install_opener(opener)
- #查看所用ip,与上文打印的ip进行比较
- url='http://www.whatismyip.com.tw'#查询ip地址的网站
- temp = urlopen(url)
- html = temp.read().decode('utf-8')
- print(html)
复制代码 |
|