爬取豆瓣前250(正则表达式),Python交流,编程语言专区,鱼C论坛

Pythonnewers 发表于 2020-5-6 10:07:13

爬取豆瓣前250(正则表达式)

import requests
import bs4
import re
import time
headers = {
"User-Agent": "Mozilla/5.0 (Windows NT 6.1; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/83.0.4103.14 Safari/537.36 Edg/83.0.478.13"}
x = 1
with open("豆瓣前250电影.txt", "w") as writer:

for i in (range(0, 10)):
   html = requests.get("https://movie.douban.com/top250?start=" +
                        str(i)+"&filter=", headers=headers).text
   titles = re.findall('<span class="title">(.+?)</span>', html, re.S)
   for title in titles:
         if title[:3] != "&nb":
            writer.writelines("第"+str(x)+":"+title+"\n")
            x += 1
            print(title)
         else:
            pass
print("结束")
带写在txt里哟

Mike_python小 发表于 2020-5-27 08:15:08

@liuzhengyuan @heidern0612 来人呀

oooipussy 发表于 2020-6-20 17:41:06

学习了~

java2python 发表于 2020-6-20 18:52:26

学习哈

页: [1]

鱼C论坛's Archiver

爬取豆瓣前250(正则表达式)