python 爬取淘宝并分析词频,萌新交流区,萌新训练营,鱼C论坛

小人发表于 2017-9-23 15:38:10

python 爬取淘宝并分析词频

from urllib import request
import urllib
import re
from jieba import analyse
search=urllib.parse.quote('哲♂学')
f=open('1.txt','a')
for i in range(10):

print('正在读取第'+str(i+1)+'页数据...')
response=request.urlopen('https://s.taobao.com/search?q='+search+'&s='+str(i*44)).read().decode('utf-8')

title=re.findall(r'"raw_title":"([^"]+)"',response)

for each in title:


   f.write(each+'\n')

f.close()

content=open('1.txt','rb').read()

tags = analyse.extract_tags(content, topK=100, withWeight=False)
print(tags)
text =" ".join(tags)

BlackMan 发表于 2017-11-5 20:47:09

不错

草莓棉花糖之恋 发表于 2019-11-24 18:14:40

不错啊

页: [1]

鱼C论坛's Archiver

python 爬取淘宝 并 分析词频

python 爬取淘宝并分析词频