|
|
马上注册,结交更多好友,享用更多功能^_^
您需要 登录 才可以下载或查看,没有账号?立即注册
x
新手一枚,正在努力学习中,碰到困难了,求大神求助
拿爬取淘宝评论练练手,可是不怎么会写代码
附上代码
import re
import requests as rq
import pandas as pd
#爬取淘宝评论前9页
list1 = range(1,10)
for i in list1:
list=[]
url = 'https://rate.tmall.com/list_detail_rate.htm?itemId=42105400616&spuId=305417465&sellerId=2267266991&order=3¤tPage=(i)'
myweb = rq.get(url)
myjson = re.findall(r'"rateList":(.*?),"tags"',myweb.text)[0].rstrip(',"searchinfo":""')
mytable = pd.read_json(myjson)
list.append(mytable)
print(list[0])
附上错误:
Traceback (most recent call last):
File "C:/Users/11055/PycharmProjects/urll.py", line 11, in <module>
myjson = re.findall(r'"rateList":(.*?),"tags"',myweb.text)[0].rstrip(',"searchinfo":""')
IndexError: list index out of range
请大神指点指点~ |
|