ÓãCÂÛ̳

 ÕÒ»ØÃÜÂë
 Á¢¼´×¢²á
²é¿´: 740|»Ø¸´: 16

ΪʲôͼƬûÏÂÔØÏÂÀ´£¬Ò²Ã»±¨´í

[¸´ÖÆÁ´½Ó]
·¢±íÓÚ 2018-10-19 10:03:53 | ÏÔʾȫ²¿Â¥²ã |ÔĶÁģʽ

ÂíÉÏ×¢²á£¬½á½»¸ü¶àºÃÓÑ£¬ÏíÓøü¶à¹¦ÄÜ^_^

ÄúÐèÒª µÇ¼ ²Å¿ÉÒÔÏÂÔØ»ò²é¿´£¬Ã»ÓÐÕ˺ţ¿Á¢¼´×¢²á

x
import urllib.request
import os

def get_url(url):

    headers = {
        'User-Agent':'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/69.0.3497.100 Safari/537.36'
    }
    req = urllib.request.Request(url,headers=headers)
    response = urllib.request.urlopen(req)
    html = response.read()
    return html


def url_img(url):
    img_addrs = []
    html = get_url(url).decode('utf-8')
    a = html.find('img src=')
    while a != -1:
        b = html.find('.jpg', a, a + 255)
        if b != -1:
            img_addrs.append(html[a:b] + '.jpg')
        else:
            print('ÕÒ²»µ½Í¼Æ¬µØÖ·')
        a = html.find('img src',b)
    return img_addrs
    #print(url_img(url))                              #»ñÈ¡ÁбíÄÚÒ³ÃæµÚÒ»Ò³ËùÓÐͼƬµØÖ·
#print(len(url_img(url)))                         #ÁбíÄÚÓжàÉÙͼƬ

def save_imgs(folder,url_img):

    for each in url_img:
        img_url = each.split('"')[1]
        with open(img_url,'wb') as f:
            img = get_url(each)
            f.write(img)

def download_mm(folder='katong',pages=10):
    os.mkdir(folder)
    os.chdir(folder)

    url = 'http://sc.chinaz.com/tupian/katongtupian_2.html'

    for i in range(1,pages):
        i += 1
        get_img = 'http://sc.chinaz.com/tupian/katongtupian' + '_' + str(i) + '.html'
        img_addrs = url_img(get_img)
        save_imgs(folder,img_addrs)


if __name__ == '__main__':
    download_mm()
ÏëÖªµÀС¼×Óã×î½üÔÚ×öɶ£¿Çë·ÃÎÊ -> ilovefishc.com
 Â¥Ö÷| ·¢±íÓÚ 2018-10-19 10:04:40 | ÏÔʾȫ²¿Â¥²ã
µÚÒ»´ÎдÅÀ³æ£¬²»ÖªµÀΪɶûÅÀµ½ÎļþÀÇó´óÀÐÖ»ÕÐ
ÏëÖªµÀС¼×Óã×î½üÔÚ×öɶ£¿Çë·ÃÎÊ -> ilovefishc.com
·¢±íÓÚ 2018-10-19 10:12:51 | ÏÔʾȫ²¿Â¥²ã
ÄãÉó²éÔªËØÁËô£¬ÀïÃæ²»ÊÇimg src=ÕâÖÖģʽ£¬²»ÄÜÖ±½ÓÓÃС¼×ÓãÅÀÃÃ×ÓͼµÄ´úÂë
ÏëÖªµÀС¼×Óã×î½üÔÚ×öɶ£¿Çë·ÃÎÊ -> ilovefishc.com
 Â¥Ö÷| ·¢±íÓÚ 2018-10-19 10:15:23 | ÏÔʾȫ²¿Â¥²ã
ËþÀû°à ·¢±íÓÚ 2018-10-19 10:12
ÄãÉó²éÔªËØÁËô£¬ÀïÃæ²»ÊÇimg src=ÕâÖÖģʽ£¬²»ÄÜÖ±½ÓÓÃС¼×ÓãÅÀÃÃ×ÓͼµÄ´úÂë

ÊÇÄÇÖÖģʽ
ÏëÖªµÀС¼×Óã×î½üÔÚ×öɶ£¿Çë·ÃÎÊ -> ilovefishc.com
 Â¥Ö÷| ·¢±íÓÚ 2018-10-19 10:15:55 | ÏÔʾȫ²¿Â¥²ã
http://sc.chinaz.com/tupian/katongtupian_2.html
ÎÒÅ¿µÄÊÇÕâ¸öÍøÒ³
ÏëÖªµÀС¼×Óã×î½üÔÚ×öɶ£¿Çë·ÃÎÊ -> ilovefishc.com
·¢±íÓÚ 2018-10-19 10:18:10 | ÏÔʾȫ²¿Â¥²ã
¾ÍÊÇÄãÕâ¸öÍøÒ³
1.png
ÏëÖªµÀС¼×Óã×î½üÔÚ×öɶ£¿Çë·ÃÎÊ -> ilovefishc.com
 Â¥Ö÷| ·¢±íÓÚ 2018-10-19 10:19:38 | ÏÔʾȫ²¿Â¥²ã
ËþÀû°à ·¢±íÓÚ 2018-10-19 10:18
¾ÍÊÇÄãÕâ¸öÍøÒ³

ÔõôÄãµÄ¸úÎҵIJ»Ò»Ñù°¡ £¬ÎÒµÄÊÇÕý³£ÍøÖ·
ÏëÖªµÀС¼×Óã×î½üÔÚ×öɶ£¿Çë·ÃÎÊ -> ilovefishc.com
 Â¥Ö÷| ·¢±íÓÚ 2018-10-19 10:20:28 | ÏÔʾȫ²¿Â¥²ã
ËþÀû°à ·¢±íÓÚ 2018-10-19 10:18
¾ÍÊÇÄãÕâ¸öÍøÒ³

<img alt="ÀøÖ¾µÄ½äÑÌͼƬ" src="http://pic.sc.chinaz.com/Files/pic/pic9/201806/zzpic12352_s.jpg">
ÏëÖªµÀС¼×Óã×î½üÔÚ×öɶ£¿Çë·ÃÎÊ -> ilovefishc.com
 Â¥Ö÷| ·¢±íÓÚ 2018-10-19 10:21:07 | ÏÔʾȫ²¿Â¥²ã
ËþÀû°à ·¢±íÓÚ 2018-10-19 10:18
¾ÍÊÇÄãÕâ¸öÍøÒ³

ѽ£¬¿´´íÁË£¬²»ºÃÒâ˼
ÏëÖªµÀС¼×Óã×î½üÔÚ×öɶ£¿Çë·ÃÎÊ -> ilovefishc.com
 Â¥Ö÷| ·¢±íÓÚ 2018-10-19 10:22:30 | ÏÔʾȫ²¿Â¥²ã
ËþÀû°à ·¢±íÓÚ 2018-10-19 10:18
¾ÍÊÇÄãÕâ¸öÍøÒ³

¶¼ÊÇÒ»ÑùµÄÈö£¬Ê²Ã´ÎÊÌâ´óÀС£ÊDz»ÊÇÎÒû½âÎöÍøÒ³£¿
ÏëÖªµÀС¼×Óã×î½üÔÚ×öɶ£¿Çë·ÃÎÊ -> ilovefishc.com
·¢±íÓÚ 2018-10-19 10:27:18 | ÏÔʾȫ²¿Â¥²ã
ʵ¼ÊÉÏÓÃrequests´òÓ¡£¬´úÂëºÍÍøÒ³»¹²»Ò»Ñù£¬ÊÇsrc2=
½¨ÒéÄã»»¸öÍøÕ¾£¬»òÕßÍùºóѧѧ
ÏëÖªµÀС¼×Óã×î½üÔÚ×öɶ£¿Çë·ÃÎÊ -> ilovefishc.com
 Â¥Ö÷| ·¢±íÓÚ 2018-10-19 10:32:40 | ÏÔʾȫ²¿Â¥²ã
ËþÀû°à ·¢±íÓÚ 2018-10-19 10:27
ʵ¼ÊÉÏÓÃrequests´òÓ¡£¬´úÂëºÍÍøÒ³»¹²»Ò»Ñù£¬ÊÇsrc2=
½¨ÒéÄã»»¸öÍøÕ¾£¬»òÕßÍùºóѧѧ

ÎÒÓÃdecode('utf-8')½âÂëÁË£¬ÔÚpacharmÉÏûËѲ»µ½img src=
²»½âÂë¾ÍÄÜËѵ½ÁË
ÏëÖªµÀС¼×Óã×î½üÔÚ×öɶ£¿Çë·ÃÎÊ -> ilovefishc.com
·¢±íÓÚ 2018-10-19 10:51:09 | ÏÔʾȫ²¿Â¥²ã
import requests
import os
from bs4 import BeautifulSoup as bs

def get_url(url):

    headers = {
        'User-Agent':'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/69.0.3497.100 Safari/537.36'
    }
    res=requests.get(url,headers=headers)

    return res


def url_img(url):
    img_addrs = []
    soup=bs(get_url(url).text,'html.parser')
    for each in soup.find_all(name='img'):
        img_addrs.append(each['src2'])
    return img_addrs

def save_imgs(folder,url_img):

    for each in url_img:
        img=each.split('/')[-1]
        try:
            with open(img,'wb') as f:
                im = get_url(each).content
                f.write(im)
        except:
            print('Ò»ÕżÙͼ')

def download_mm(folder='katong',pages=5):
    os.mkdir(folder)
    os.chdir(folder)
    for i in range(1,pages):
        i += 1
        get_img = 'http://sc.chinaz.com/tupian/katongtupian' + '_' + str(i) + '.html'
        img_addrs = url_img(get_img) 
        save_imgs(folder,img_addrs)


if __name__ == '__main__':
    download_mm()
ÏëÖªµÀС¼×Óã×î½üÔÚ×öɶ£¿Çë·ÃÎÊ -> ilovefishc.com
 Â¥Ö÷| ·¢±íÓÚ 2018-10-19 10:56:53 | ÏÔʾȫ²¿Â¥²ã

¿ÉÒÔµÄÐֵܣ¬µ«ÊÇ¿´²»¶®Õâ¸öfrom bs4 import BeautifulSoup as bsÄ£¿éÕ¦ÓõÄ
ÏëÖªµÀС¼×Óã×î½üÔÚ×öɶ£¿Çë·ÃÎÊ -> ilovefishc.com
·¢±íÓÚ 2018-10-19 10:58:09 | ÏÔʾȫ²¿Â¥²ã
Äãѧµ½¼«¿ÍϵÁоÍÖªµÀÁË£¬
¸Õѧrequest½¨ÒéÅÀµã°Ù¶ÈÌù°ÉʲôµÄÁ·Á·ÊÖ
ÏëÖªµÀС¼×Óã×î½üÔÚ×öɶ£¿Çë·ÃÎÊ -> ilovefishc.com
 Â¥Ö÷| ·¢±íÓÚ 2018-10-19 11:02:01 | ÏÔʾȫ²¿Â¥²ã
ËþÀû°à ·¢±íÓÚ 2018-10-19 10:58
Äãѧµ½¼«¿ÍϵÁоÍÖªµÀÁË£¬
¸Õѧrequest½¨ÒéÅÀµã°Ù¶ÈÌù°ÉʲôµÄÁ·Á·ÊÖ

°Ù¶ÈÌù°É£¿ÅÀÎÄ×Ö£¿
ÏëÖªµÀС¼×Óã×î½üÔÚ×öɶ£¿Çë·ÃÎÊ -> ilovefishc.com
·¢±íÓÚ 2018-10-19 11:07:21 | ÏÔʾȫ²¿Â¥²ã
923204485 ·¢±íÓÚ 2018-10-19 11:02
°Ù¶ÈÌù°É£¿ÅÀÎÄ×Ö£¿

ÄãÅÀͼƬÎÄ×Ö¶¼ÐУ¬²»¹ýÒ²²»Ò»¶¨¶¼ÊÇ¿ÉÒÔÅÀµÄ£¬ÏÈÊÔ¼¸¸ö£¬ÄÜÅÀÁË£¬½¨Òé¾Í¿ÉÒÔÖ±½ÓÌø¹ýtkinterºÍpygameÏÈÈ¥°ÑС¼×Ó㼫¿ÍµÄÅÀ³æ¿´ÁË£¬»ØÍ·ÔÙ¿´tkinterºÍpygame
ÏëÖªµÀС¼×Óã×î½üÔÚ×öɶ£¿Çë·ÃÎÊ -> ilovefishc.com
ÄúÐèÒªµÇ¼ºó²Å¿ÉÒÔ»ØÌû µÇ¼ | Á¢¼´×¢²á

±¾°æ»ý·Ö¹æÔò

СºÚÎÝ|ÊÖ»ú°æ|Archiver|ÓãC¹¤×÷ÊÒ ( ÔÁICP±¸18085999ºÅ-1 | ÔÁ¹«Íø°²±¸ 44051102000585ºÅ)

GMT+8, 2024-10-7 01:29

Powered by Discuz! X3.4

© 2001-2023 Discuz! Team.

¿ìËٻظ´ ·µ»Ø¶¥²¿ ·µ»ØÁбí