|
马上注册,结交更多好友,享用更多功能^_^
您需要 登录 才可以下载或查看,没有账号?立即注册
x
按小甲鱼写的代码:
- import requests
- import re
- import json
- def open_url(keyword):
- keyword = "零基础入门学习Python"
- url = "https://s.taobao.com/search?q={}&sort=sale-desc".format(keyword)
- headers = {"user-agent":"Mozilla/5.0 (Windows NT 10; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/56.0.2924.90 Safari/537.36"}
- res = requests.get(url, headers = headers)
- return res
- def main():
- keyword = input("请输入要查询的商品:")
- res = open_url(keyword)
- with open("items.txt", "w", encoding = "utf-8") as f:
- f.write(res.text)
- if __name__ == "__main__":
- main()
复制代码
得到的文件:
- <!DOCTYPE html>
- <html>
- <head>
- <meta name="data-spm" content="a2107" />
- <title>
- 淘宝网 - 淘!我喜欢
- </title>
- <meta charset="gbk" />
- <meta http-equiv="X-UA-Compatible" content="IE=edge" />
- <meta name="viewport"
- content="width=device-width, initial-scale=1, maximum-scale=1" />
- <meta name="description"
- content="淘宝网(Taobao.com)作为专业的购物网站拥有全球时尚前沿的消费者购物集市,100%认证网上商城及超值二手商品区,同时购物安全,产品丰富,应有尽有,任你选购,让你尽享网上在线购物乐趣!" />
- <meta name="keywords"
- content="淘宝,掏宝,网上购物,C2C,在线交易,交易市场,网上交易,交易市场,网上买,网上卖,购物网站,团购,网上贸易,安全购物,电子商务,放心买,供应,买卖信息,网店,一口价,拍卖,网上开店,网络购物,打折,免费开店,网购,频道,店铺" />
- <script>
- window._lgst_ = new Date().getTime();
- </script>
- <script>
- //全登陆不允许iframe嵌入
- if (window.top !== window.self) {
- window.top.location = window.location;
- }
- </script>
- <link rel="shortcut icon" href="//img.alicdn.com/favicon.ico" type="image/x-icon" />
- <link type="text/css" href="//g.alicdn.com/sd/ncpc/nc.css?t=2018112615" rel="stylesheet" />
- <link rel="stylesheet"
- />
- <link rel="stylesheet"
- />
- <script src="https://g.alicdn.com/kissy/k/1.4.4/seed-min.js"></script>
- </head>
- <body class="chl-reg" data-spm="1"><script>
- with(document)with(body)with(insertBefore(createElement("script"),firstChild))setAttribute("exparams","category=&userid=&aplus&yunid=&&trid=0b1569e515432189800218480e1daa&asid=AQAAAAAkp/tbgX0jagAAAAAmt954U4Ge8g==",id="tb-beacon-aplus",src=(location>"https"?"//g":"//g")+".alicdn.com/alilog/mlog/aplus_v2.js")
- </script>
- <div id="page" class="">
- <div id="header" class="clearfix" data-spm="1000340">
- <div class="logo">
- <h1><a href="//www.taobao.com" title="淘宝网"><i class="iconfont tb-logo-cn"></i><i
- class="iconfont tb-logo-url"></i></a></h1>
- <div class="login-msg error" style="text-align: center;width: 880px; margin: 0 auto;">
- <p class="error" style="float: none;width: auto;">
- 为确保您账户的安全及正常使用,依《网络安全法》相关要求,6月1日起会员账户需绑定手机。如您还未绑定,请尽快完成,感谢您的理解及支持!
- </p>
- </div>
- <a class="link">
- <i class="iconfont"></i> "登录页面"改进建议
- </a>
- </div>
- </div>
- <div id="content"
- <div class="login-newbg" style="background-image: url(https://gtms04.alicdn.com/tps/i4/TB1VeAIrkzoK1RjSZFlSuui4VXa.jpg);height:600px;">
- <input type="hidden" id="J_adUrl" name="adUrl" value="">
- <input type="hidden" id="J_adImage" name="adImage" value="">
- <input type="hidden" id="J_adText" name="adText" value="">
- <input type="hidden" id="J_viewFd4PC" name="viewFd4PC" value="">
- <input type="hidden" id="J_viewFd4Mobile" name="viewFd4Mobile" value="">
- </div>
- <div class="login-adlink">
- <a href="https://pages.tmall.com/wow/act/20716/ppkc?wh_biz=tm&wh_weex=true" target="_blank" onclick="javascript:goldlog.record('/member.11.2','','','H46777405')"></a>
- </div>
- <div class="content-layout">
- <div class="login-box-warp">
- <div class="login-box no-longlogin "
- id="J_LoginBox">
- <script type="text/javascript">
- (function(x) {
- x && (x.className += " loading")
- })(document.getElementById("J_LoginBox"))
- </script>
- <!--login box begin-->
-
- <div class="hd">
-
- <div class="login-switch" onselectstart="return false;">
- <i class="iconfont quick" id="J_Static2Quick" onclick="javascript:goldlog.record('/member.13.1','','','H46777383')"></i>
- <i class="iconfont static" id="J_Quick2Static"></i>
- </div>
- <div class="login-tip">
- <div class="poptip">
- <div class="poptip-arrow">
- <em></em>
- <span></span>
- </div>
- <div class="poptip-content">
- <i class="iconfont"></i>扫码登录更安全
- </div>
- </div>
- </div>
-
- </div>
- <div class="bd">
- <!--登录的错误信息结束-->
-
- <div id="J_QuickLogin" class="ww-login hidden">
- <form action="" class="ww-form">
- <div class="login-title">
- 选择其中一个已登录的账户
- </div>
- <div class="ww-userlist">
- </div>
- <div class="trigger">
- </div>
- <div class="submit">
- <button type="submit" class="J_Submit" id="J_SubmitQuick">登 录</button>
- </div>
- <div class="other-login">
- <a href="" class="light-link" id="J_Sso2Static">使用其他账户登录</a>
- </div>
- </form>
- </div>
- <div id="J_MiserLogin" class="ww-login hidden">
- <form action="" class="ww-form">
- <input type="hidden" id="x_token" value="">
- <div class="login-title">
- 选择其中一个已登录的账户
- </div>
- <div class="ww-userlist">
- </div>
- <div class="trigger">
- </div>
- <div class="submit">
- <p><input type="checkbox" id="J_Mdelete"> 不再记住密码</p>
- <button type="submit" tabindex="5"
- id="J_SubmitMiser">登 录</button>
- </div>
- <div class="other-login">
- <a href="" title="使用其他账户登录" class="light-link" id="J_Miser2Static">使用其他账户登录</a>
- </div>
- </form>
- </div>
-
- <!--标准登录框-->
- <div class="static-form " id="J_StaticForm">
- <div class="login-title">
-
-
- 密码登录
-
- </div>
-
- <form action="/member/login.jhtml?redirectURL=http%3A%2F%2Fs.taobao.com%2Fsearch%3Fq%3D%E9%9B%B6%E5%9F%BA%E7%A1%3F%E5%85%A5%E9%97%A8%E5%AD%A6%E4%B9%A0Python" method="post" id="J_Form">
- <div id="J_Message"
- style="display:none;"
- class="login-msg error">
- <i class="iconfont"></i>
-
- <p class="error"></p>
-
- </div>
- <!-- 手机号登录 -->
-
- <div class="field ph-hide username-field ">
- <label for="TPL_username_1"> <i class="iconfont" title="会员名"></i></label>
- <span class="ph-label">会员名/邮箱/手机号</span> <input type="text"
- name="TPL_username" id="TPL_username_1"
- class="login-text J_UserName"
- value=""
- maxlength="32" tabindex="1" aria-label="会员名/邮箱/手机号"/>
- </div>
- <div class="field pwd-field">
- <label id="password-label" for="TPL_password_1"><i
- class="icon iconfont" title="登录密码"></i></label>
- <span id="J_StandardPwd">
- <input type="password" name="TPL_password"
- id="TPL_password_1" class="login-text" maxlength="40" tabindex="2"
- autocomplete="off" aria-label="登录密码"/>
- </span>
- </div>
-
- <!-- use new slide checkcode -->
- <div id="nocaptcha" class="nc-container tb-login"></div>
-
-
- <div class="submit">
-
- <input id="J_NcoSig" name="ncoSig" type="hidden" />
- <input id="J_NcoSessionid" name="ncoSessionid" type="hidden" />
- <input id="J_NcoToken" type="hidden" name="ncoToken"
- value="673728b297214cbdff38da0432e905228c05650c" />
- <input id="J_NcoShow" type="hidden" name="slideCodeShow" value="false" />
-
- <input type="hidden" name="useMobile" value="false"/>
- <input type="hidden" id="J_lang" name="lang" value="zh_CN"/>
- <input type="hidden" name="loginsite" value="0"
- id="J_loginsite" /> <input type="hidden" name="newlogin" value="" />
-
- <input type="hidden" id="J_TPL_redirect_url" name="TPL_redirect_url"
- value="http://s.taobao.com/search?q=闆跺熀纭?鍏ラ棬瀛︿範Python" />
- <input type="hidden" id="J_From" name="from"
- value="tb" />
- <input type="hidden" name="fc"
- value="default" />
- <input type="hidden" id="J_CssStyle" name="style"
- value="default" />
- <input type="hidden" id="J_CssStyle2" name="css_style"
- value="" />
- <input type="hidden" name="keyLogin" value="false" />
- <input type="hidden" name="qrLogin" value="true" />
- <input type="hidden" name="newMini" value="false" />
- <input type="hidden" name="newMini2" value="false" />
-
- <input type="hidden" name="tid" />
- <input type="hidden" name="loginType" value="3" />
- <input type="hidden" name="minititle"
- value="" />
- <input type="hidden" name="minipara"
- value="" />
- <input type="hidden" name="pstrong" value="" />
-
- <input type="hidden" id="J_sign" name="sign"
- value="" />
- <input type="hidden" id="J_need_sign" name="need_sign"
- value="" />
-
- <input type="hidden" id="J_isIgnore" name="isIgnore"
- value="" />
- <input type="hidden" id="J_full_redirect" name="full_redirect"
- value="" />
- <!-- 子账号跳转方式 -->
- <input type="hidden" name="sub_jump"
- value="" />
-
- <input type="hidden" name="popid"
- value="" />
- <input type="hidden" name="callback" value="" />
-
- <input type="hidden" id="J_guf" name="guf"
- value="" />
- <input type="hidden" id="J_not_duplite_str" name="not_duplite_str"
- value="" />
- <input type="hidden" name="need_user_id"
- value="" />
-
- <input type="hidden" name="poy" />
-
- <input type="hidden" id="gvfdc" name="gvfdcname" value="" />
- <input type="hidden" name="gvfdcre"
- value="" />
-
- <input type="hidden" id="J_from_encoding" name="from_encoding"
- value="" />
-
- <input type="hidden" id="J_sub" name="sub"
- value="" />
-
- <input type="hidden" name="TPL_password_2" id="TPL_password_2" />
- <input type="hidden" id="J_PBK" value="9a39c3fefeadf3d194850ef3a1d707dfa7bec0609a60bfcc7fe4ce2c615908b9599c8911e800aff684f804413324dc6d9f982f437e95ad60327d221a00a2575324263477e4f6a15e3b56a315e0434266e092b2dd5a496d109cb15875256c73a2f0237c5332de28388693c643c8764f137e28e8220437f05b7659f58c4df94685" />
- <input type="hidden" name="loginASR" value="1" />
- <input type="hidden" name="loginASRSuc" value="0" />
-
- <input type="hidden" id="J_allp" name="allp"
- value="" />
- <input type="hidden" name="oslanguage" />
- <input type="hidden" name="sr" />
- <input type="hidden" name="osVer" />
- <input type="hidden" name="naviVer" />
- <input type="hidden" name="osACN" />
- <input type="hidden" name="osAV" />
- <input type="hidden" name="osPF" />
- <input type="hidden" name="miserHardInfo" id="M_hard_info"/>
- <input type="hidden" id="J_Appkey" name="appkey" value="00000000" />
- <input type="hidden" name="nickLoginLink" value="" />
- <input type="hidden" name="mobileLoginLink" value="https://login.taobao.com/member/login.jhtml?redirectURL=http://s.taobao.com/search?q=零基础入门学习Python&sort=sale-desc&useMobile=true" />
- <input type="hidden" name="showAssistantLink" value="" />
-
- <input id="J_UnitClose " value="true" type="hidden" />
- <button type="submit" class="J_Submit" tabindex="3" id="J_SubmitStatic" data-ing="正在登录...">登 录</button>
- </div>
-
- <ul class="entries">
- <li id="J_OtherLogin" class="other-login">
-
-
- <a tabindex="4" class="weibo-login"><i class="iconfont"></i>微博登录</a>
-
- <a tabindex="5" class="alipay-login"><i class="iconfont"></i>支付宝登录</a>
-
-
- </li>
- </ul>
-
-
- <div class="login-links">
-
-
- <a tabindex="6" class="forget-pwd" target="_blank">
- 忘记密码</a>
-
- <a target="_blank">忘记会员名</a>
-
- <a class="register"
- target="_blank" tabindex="7">免费注册</a>
-
- </div>
- </form>
- </div>
-
- <!--快速登录开始-->
- <div class="quick-form">
- <!--扫码登录开始-->
- <div class="qrcode-login" id="J_QRCodeLogin">
- <div class="login-tip" style="display:block;">
- <div class="poptip">
- <div class="poptip-arrow">
- <em></em>
- <span></span>
- </div>
- <div class="poptip-content">
- <i class="iconfont"></i>密码登录在这里</div>
- </div>
- </div>
- <div class="login-title">手机扫码,安全登录</div>
-
- <div class="qrcode-mod">
- <div class="qrcode-main">
- <div class="qrcode-img" id="J_QRCodeImg" style="opacity: 1;"></div>
- <div class="qrcode-help"></div>
- <div class="msg-err">
- <h6>二维码已失效</h6>
- <a href="javascript:;" class="refresh J_QRCodeRefresh">请点击刷新</a>
- </div>
- </div>
- <div class="qrcode-desc" data-spm="25847036">
- <i class="iconfont"></i>
-
- <p><font class="ft-gray">打开 </font><a target=_blank class="light-link">手机淘宝</a> | <a target=_blank class="light-link">手机天猫</a>
- <span class="ft-gray">扫一扫登录</span></p>
-
- </div>
-
- </div>
- <div class="qrcode-msg">
- <div class="msg-ok">
- <div class="msg-icon">
- <i class="iconfont icon-ok"></i>
- <i class="iconfont icon-phone"></i>
- </div>
- <h6>扫描成功!</h6>
- <p>请在手机上确认登录</p>
- <div class="link"><a href="#" class="light-link J_QRCodeRefresh">返回二维码登录</a></div>
- </div>
- </div>
- <div class="login-links">
- <a href="" class="forget-pwd J_Quick2Static" target="_blank">密码登录</a>
- <a class="register" target="_blank" >免费注册</a>
- </div>
- </div>
- <!--扫码登录结束-->
- <!--一键登录开始-->
- <!--一键登录结束-->
- </div>
- <!--快速登录结束-->
- </div>
- </div>
- <!--标准登录框结束-->
- </div>
- </div>
- </div>
- <!--登录脚本-->
- <script>
- window.loginConfig = {
- //验证码
- codeURL: "",
- codeHandle: "#J_StandardCode",
- codeImg: "#J_StandardCode_m",
- enableQRCode: true, // 是否允许二维码登录
- shownQRCode: true, // 是否自动显示二维码登录
- getQRCodeURL: "https://qrlogin.taobao.com/qrcodelogin/generateQRCode4Login.do",
- checkQRCodeURL: "https://qrlogin.taobao.com/qrcodelogin/qrcodeLoginCheck.do",
- intervalQRCode: 2000,
- enableLoginByKey:false,
- startLoginByKeyURL:"https://qrlogin.taobao.com/qrcodelogin/startLoginByKey.do",
- checkLoginByKeyURL:"https://qrlogin.taobao.com/qrcodelogin/loginByKey.do",
- enableMiserLogin:false,
- audioCodeURL: "",
- checkUserNameURL:"/member/request_nick_check.do?_input_charset=utf-8",
- disableQuickLogin:false,
- defaultView: "static"
- };
- </script>
- <!--login box end-->
- </div>
- <!-- .login-box END -->
-
- </div>
-
- <div class="footer" data-spm="1997523009">
-
-
- <div class="footer-hd">
- <p>
- <a >阿里巴巴集团</a>
- <b>|</b>
- <a >阿里巴巴国际站</a>
- <b>|</b>
- <a >阿里巴巴中国站</a>
- <b>|</b>
- <a >全球速卖通</a>
- <b>|</b>
- <a >淘宝网</a>
- <b>|</b>
- <a >天猫</a>
- <b>|</b>
- <a >聚划算</a>
- <b>|</b>
- <a >一淘</a>
- <b>|</b>
- <a >阿里妈妈</a>
- <b>|</b>
- <a >飞猪</a>
- <b>|</b>
- <a >虾米</a>
- <b>|</b>
- <a >阿里云计算</a>
- <b>|</b>
- <a >云OS</a>
- <b>|</b>
- <a >万网</a>
- <b>|</b>
- <a >支付宝</a>
- <b>|</b>
- <a >来往</a>
- </p>
- </div>
- <div class="footer-bd">
- <p>
- <a >关于淘宝</a>
- <a >合作伙伴</a>
- <a >营销中心</a>
- <a href="mailto:lianzheng@taobao.com" title="lianzheng@taobao.com" target="_self">廉正举报</a>
- <a >联系客服</a>
- <a >开放平台</a>
- <a >诚征英才</a>
- <a >联系我们</a>
- <a >网站地图</a>
- <a >法律声明</a>
- <a >隐私权政策</a>
- <em>© 2018 Taobao.com 版权所有</em>
- </p>
- <p>
- <span>网络文化经营许可证:浙网文[2016]0132-032号</span>
- <b>|</b>
- <span>增值电信业务经营许可证:浙B2-20080224-1</span>
- <b>|</b>
- <span>信息网络传播视听节目许可证:1109364号</span>
- </p>
- </div>
- <div class="footer-ft">
- <a >
- <img width="36" height="36" src="https://assets.alicdn.com/apps/login/static/css/images/wj.gif">
- </a>
- <a class="mod mod-a"></a>
- <a class="mod mod-b"></a>
- <a class="mod mod-c"></a>
- <a class="mod mod-d"></a>
- <a class="mod mod-e"></a>
- </div>
- <!-- #footer END -->
- <div id="server-num">login011021081124.center.na62</div>
-
- </div>
-
- </div>
- </div>
- </div>
-
- <script type="text/javascript">
- TRLang = {
- ERROR_NICK_BLANK : '请填写账户名',
- ERROR_PASSWORD_BLANK : '请输入密码',
- ERROR_NICK_PASSWORD_BLANK : '请输入账户名和密码',
- ERROR_CHECKCODE_BLANK : '请输入验证码',
- TIPS_REMEMBER_PASSWORD : '记住密码',
- TIPS_NOT_REMEMBER_PSW_ON_PUBLIC : '不要在公共计算机记住密码,防止账户被盗'
- };
- </script>
-
- <script type="text/javascript" charset="utf-8" src="//g.alicdn.com/sd/ncpc/nc.js?t=2018112615"></script>
- <script>
- var UA_Opt = {};
- UA_Opt.ExTarget = ['TPL_password_1','TPL_password_2','J_Pwd1','J_PwdV'];
- UA_Opt.FormId = "J_Form";
-
- function initNC() {
- var nc = new noCaptcha();
- var opt = {
- renderTo : "nocaptcha",
- appkey : "CF_APP_TBLogin_PC",
- token : "673728b297214cbdff38da0432e905228c05650c",
- elementID : [ "TPL_username_1" ],
- trans : {"behaviorTraceId": "null"},
- is_Opt : 1,
- language : "zh_CN",
- isEnabled : true,
-
- customWidth: 'J_StaticForm',
- customFloatHeight: 420,
-
- times : 3,
- callback: function (data) {
- var S = KISSY;
- S.one("#J_NcoSig").val( data.sig);
- S.one("#J_NcoSessionid").val(data.csessionid);
- },
- error: function (s) {
- window.console && console.log("error");
- window.console && console.log(s);
- },
- is_tbLogin : true
- };
- nc.init(opt);
- }
- initNC();
-
-
-
-
- </script>
- <script>
- (function(S){
-
- S.getScript("https://g.alicdn.com/vip/login/0.5.65/js/??client.js,atp.js?t=20130528");
-
- S.use("event, node", function(){
- S.one(window).on("load", function(){
- eval(function(p,a,c,k,e,d){e=function(c){return c};if(!''.replace(/^/,String)){while(c--){d[c]=k[c]||c}k=[function(e){return d[e]}];e=function(){return'\\w+'};c=1};while(c--){if(k[c]){p=p.replace(new RegExp('\\b'+e(c)+'\\b','g'),k[c])}}return p}('0 1=5;0 2=4;0 3=1&2;6.9("8").7=3;',10,10,'var|a|b|c|110|10|document|value|gvfdc|getElementById'.split('|'),0,{}))
- });
- });
- })(KISSY);
- </script>
- <div id="_umfp"
- style="display: inline; width: 1px; height: 1px; overflow: hidden">
- </div>
- <script>
- (function (w, d, t) {
- var s = d.createElement(t), m = d.getElementsByTagName(t)[0];
- s.async = 1;
- s.src = "https://g.alicdn.com/sd/pointman/js/pt2.js?_=" + Math.floor((new Date()).getTime() / 36e5);
- m.parentNode.insertBefore(s, m);
-
- w._pointman_q = w._pointman_q || [];
- _pointman_q.push(["um", function (umx) {
- var container = document.getElementById("_umfp");
- umx.init({
- timeout: 3000,
- serviceLocation: 'cn',
- appName: 'taobao_login',
- formId: 'J_Form',
- containers:{flash:container ,dcp:container}
- });
- }]);
- })(window, document, "script");
- </script>
- <script src="https://g.alicdn.com/vip/login/0.5.65/js/login/nlogin.js?t=20151220"></script>
- </body>
- </html>
复制代码
这份代码10月10日那天就不行了。
需要把登录后的曲奇和会话,写到header中才行
|
|