whowho 发表于 2021-1-16 17:22:46

为什么这个爬虫代码

import urllib.request
import urllib.parse
import json

content = input("请输入需要翻译的内容:")

url = 'http://fanyi.youdao.com/translate_o?smartresult=dict&smartresult=rule'
data = {}
data['type'] = 'AUTO'
data['i'] = content
data['doctype'] = 'json'
data['xmlVersion'] = '1.6'
data['keyfrom'] = 'fanyi.web'
data['ue'] = 'UTF-8'
data['typoResult'] = 'true'
data = urllib.parse.urlencode(data).encode('utf-8')

response = urllib.request.urlopen(url,data)
html = response.read().decode('utf-8')

target = json.loads(html)
print("翻译结果:%s" % (target['translateResult']['tgt']))

请输入需要翻译的内容:分数
Traceback (most recent call last):
File "F:\python练习\translation.py", line 22, in <module>
    print("翻译结果:%s" % (target['translateResult']['tgt']))
KeyError: 'translateResult'
>>>


suchocolate 发表于 2021-1-17 10:35:05

供参考:
from urllib import request, parse
import json

wd = input('请输入需要翻译的内容:')
trans = 'http://fanyi.youdao.com/translate?smartresult=dict&smartresult=rule'
headers = {"User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:72.0) Gecko/20100101 Firefox/72.0"}
data = {"doctype": "json", 'i': wd}
b_data = bytes(parse.urlencode(data), encoding='utf-8')
q = request.Request(url=trans, data=b_data, headers=headers, method='POST')
r = request.urlopen(q)
result = json.loads(r.read().decode('utf-8'))
print(result['translateResult']['tgt'])

whowho 发表于 2021-1-17 15:29:29

suchocolate 发表于 2021-1-17 10:35
供参考:

那我的代码哪里出问题了

suchocolate 发表于 2021-1-17 15:53:49

whowho 发表于 2021-1-17 15:29
那我的代码哪里出问题了

1.url错误,要去掉“_o”
2.没带headers容易被反扒

页: [1]
查看完整版本: 为什么这个爬虫代码