鱼C论坛

 找回密码
 立即注册
查看: 2825|回复: 7

关于python 正则表达式匹配html标签的问题

[复制链接]
发表于 2016-11-9 20:15:03 | 显示全部楼层 |阅读模式

马上注册,结交更多好友,享用更多功能^_^

您需要 登录 才可以下载或查看,没有账号?立即注册

x
本帖最后由 牵着猫的老鼠 于 2016-11-14 08:58 编辑
  1. 'html = <input type="hidden" name="_xsrf" value="a50f76d70f703eb69b201b4fe9db94bd">'

  2. reg = re.findall(r'<input type="hidden" name="_xsrf" value="(\w+)">',html)
  3.     print(reg)
复制代码


然后打印出来是空的列表[]
我在其他语言试过能够匹配到(php  js )
请问出现什么问题了  ,并且我<input type="hidden" name="_xsrf" value="(\w+)">这个去掉两个<> 变成input type="hidden" name="_xsrf" value="(\w+)"  python  又能够匹配到了
小甲鱼最新课程 -> https://ilovefishc.com
回复

使用道具 举报

发表于 2016-11-10 08:33:46 | 显示全部楼层
我本地测试是可以找到的 Python3.4
小甲鱼最新课程 -> https://ilovefishc.com
回复 支持 反对

使用道具 举报

 楼主| 发表于 2016-11-10 08:48:53 | 显示全部楼层
hldh214 发表于 2016-11-10 08:33
我本地测试是可以找到的 Python3.4

我试了下,单独这两行是没问题的,但是我读取的是整个网页 然后匹配,发现就匹配不了
小甲鱼最新课程 -> https://ilovefishc.com
回复 支持 反对

使用道具 举报

发表于 2016-11-10 09:13:20 | 显示全部楼层
这都是测试的小伙伴么
小甲鱼最新课程 -> https://ilovefishc.com
回复 支持 反对

使用道具 举报

发表于 2016-11-10 09:37:04 | 显示全部楼层
牵着猫的老鼠 发表于 2016-11-10 08:48
我试了下,单独这两行是没问题的,但是我读取的是整个网页 然后匹配,发现就匹配不了

把整个网页源代码贴上来
小甲鱼最新课程 -> https://ilovefishc.com
回复 支持 反对

使用道具 举报

 楼主| 发表于 2016-11-11 11:13:13 | 显示全部楼层
hldh214 发表于 2016-11-10 09:37
把整个网页源代码贴上来
  1. <!DOCTYPE html>
  2. <html lang="zh-CN" class="">
  3. <head>
  4. <meta charset="utf-8">
  5. <meta name="apple-itunes-app" content="app-id=432274380">
  6. <meta http-equiv="X-UA-Compatible" content="IE=edge,chrome=1" />
  7. <meta name="renderer" content="webkit" />
  8. <meta name="description" content="一个真实的网络问答社区,帮助你寻找答案,分享知识。"/>
  9. <meta name="viewport" content="user-scalable=no, width=device-width, initial-scale=1.0, maximum-scale=1.0"/>
  10. <title>知乎 - 与世界分享你的知识、经验和见解</title>



  11. <link rel="apple-touch-icon"  sizes="152x152">
  12. <link rel="apple-touch-icon"  sizes="120x120">
  13. <link rel="apple-touch-icon"  sizes="76x76">
  14. <link rel="apple-touch-icon"  sizes="60x60">

  15. <link rel="shortcut icon"  type="image/x-icon" />
  16. <link rel="dns-prefetch" href="p1.zhimg.com"/>
  17. <link rel="dns-prefetch" href="p2.zhimg.com"/>
  18. <link rel="dns-prefetch" href="p3.zhimg.com"/>
  19. <link rel="dns-prefetch" href="p4.zhimg.com"/>
  20. <link rel="dns-prefetch" href="comet.zhihu.com"/>
  21. <link rel="dns-prefetch" href="static.zhihu.com"/>
  22. <link rel="dns-prefetch" href="upload.zhihu.com"/>
  23. <link rel="stylesheet" >
  24. <meta name="google-site-verification" content="FTeR0c8arOPKh8c5DYh_9uu98_zJbaWw53J-Sch9MTg" />
  25. <meta name="baidu-site-verification" content="KPFppAFoYF4Kkdv9" />
  26. <meta property="qc:admins" content="00544670776201056375" />
  27. <link rel="canonical"  />
  28. <meta id="znonce" name="znonce" content="53c572b88d2f453791f2010f4009680a">
  29. <!--[if lt IE 9]>
  30. <script src="http://static.zhihu.com/static/components/respond/dest/respond.min.js"></script>
  31. <link  id="respond-proxy" rel="respond-proxy" />
  32. <link href="/static/components/respond/cross-domain/respond.proxy.gif" id="respond-redirect" rel="respond-redirect" />
  33. <script src="/static/components/respond/cross-domain/respond.proxy.js"></script>
  34. <![endif]-->
  35. <script src="http://static.zhihu.com/static/revved/-/js/instant.14757a4a.js"></script>
  36. </head>
  37. <body class="zhi ">



  38. <div class="index-main">
  39. <div class="index-main-body">
  40. <div class="index-header">

  41. <h1 class="logo hide-text">知乎</h1>

  42. <h2 class="subtitle">与世界分享你的知识、经验和见解</h2>
  43. </div>

  44. <div class="desk-front sign-flow clearfix sign-flow-simple">

  45. <div class="index-tab-navs">
  46. <div class="navs-slider">
  47. <a href="#signup" class="active">注册</a>
  48. <a href="#signin">登录</a>
  49. <span class="navs-slider-bar"></span>
  50. </div>
  51. </div>


  52. <div class="view view-signin" data-za-module="SignInForm">
  53. <form method="POST">
  54. <input type="hidden" name="_xsrf" value="079a87ded8977d0601da2dd2da17eb09"/>
  55. <div class="group-inputs">

  56. <div class="email input-wrapper">

  57. <input type="text" name="account" aria-label="手机号或邮箱" placeholder="手机号或邮箱" required>
  58. </div>
  59. <div class="input-wrapper">
  60. <input type="password" name="password" aria-label="密码" placeholder="密码" required>
  61. </div>

  62. <div class="input-wrapper captcha-module" data-type="en" >
  63. <input id="captcha" name="captcha" placeholder="验证码" required data-rule-required="true" data-msg-required="请填写验证码">
  64. <div class="captcha-container">

  65. <img class="js-refreshCaptcha captcha" width="120" height="30" data-tooltip="s$t$看不清楚?换一张"  alt="验证码">
  66. </div>
  67. </div>

  68. </div>
  69. <div class="button-wrapper command">
  70. <button class="sign-button submit" type="submit">登录</button>
  71. </div>
  72. <div class="signin-misc-wrapper clearfix">
  73. <label class="remember-me">
  74. <input type="checkbox" name="remember_me" checked value="true"> 记住我
  75. </label>
  76. <a class="unable-login" href="#">无法登录?</a>
  77. </div>

  78. <div class="social-signup-wrapper" data-za-module="SNSSignIn">
  79. <span class="name js-toggle-sns-buttons">社交帐号登录</span>

  80. <div class="sns-buttons">
  81. <a title="微信登录" class="js-bindwechat" href="#"><i class="sprite-index-icon-wechat"></i></a>
  82. <a title="微博登录" class="js-bindweibo" href="#"><i class="sprite-index-icon-weibo"></i></a>
  83. <a title="QQ 登录" class="js-bindqq" href="#"><i class="sprite-index-icon-qq"></i></a>
  84. </div>


  85. </div>

  86. </form>

  87. <div class="QRCode">
  88. <button class="QRCode-toggleButton">
  89. <span class="sprite-global-icon-qrcode"></span>
  90. <span class="QRCode-toggleButtonText ">下载知乎 App</span>
  91. </button>
  92. <div class="QRCode-card">
  93. <div class="QRCode-image"></div>
  94. <div class="sprite-index-icon-arrow"></div>
  95. </div>
  96. </div>


  97. </div>
  98. <div class="view view-signup selected" data-za-module="SignUpForm">
  99. <form class="zu-side-login-box" action="/register/email" id="sign-form-1" autocomplete="off" method="POST">
  100. <input type="password" hidden>
  101. <input type="hidden" name="_xsrf" value="079a87ded8977d0601da2dd2da17eb09"/>
  102. <div class="group-inputs">


  103. <div class="name input-wrapper">
  104. <input required type="text" name="fullname" aria-label="姓名" placeholder="姓名">
  105. </div>
  106. <div class="email input-wrapper">

  107. <input required type="text" class="account" name="phone_num" aria-label="手机号(仅支持中国大陆)" placeholder="手机号(仅支持中国大陆)">

  108. </div>

  109. <div class="input-wrapper">
  110. <input required type="password" name="password" aria-label="密码" placeholder="密码(不少于 6 位)" autocomplete="off">
  111. </div>

  112. <div class="input-wrapper captcha-module" data-type="en" >
  113. <input id="captcha" name="captcha" placeholder="验证码" required data-rule-required="true" data-msg-required="请填写验证码">
  114. <div class="captcha-container">

  115. <img class="js-refreshCaptcha captcha" width="120" height="30" data-tooltip="s$t$看不清楚?换一张"  alt="验证码">
  116. </div>
  117. </div>

  118. </div>
  119. <div class="button-wrapper command">
  120. <button class="sign-button submit" type="submit">注册知乎</button>
  121. </div>
  122. </form>

  123. <p class="agreement-tip">点击「注册」按钮,即代表你同意<a href="/terms" target="_blank">《知乎协议》</a></p>

  124. <div class="QRCode">
  125. <button class="QRCode-toggleButton">
  126. <span class="sprite-global-icon-qrcode"></span>
  127. <span class="QRCode-toggleButtonText ">下载知乎 App</span>
  128. </button>
  129. <div class="QRCode-card">
  130. <div class="QRCode-image"></div>
  131. <div class="sprite-index-icon-arrow"></div>
  132. </div>
  133. </div>



  134. </div>
  135. </div>
  136. </div>

  137. </div>
  138. <div class="footer">

  139. <span>&copy; 2016 知乎</span>
  140. <span class="dot">·</span>
  141. <a target="_blank" href="/roundtable">知乎圆桌</a>
  142. <span class="dot">·</span>
  143. <a target="_blank" href="/explore" data-za-c="explore" data-za-a="visit_explore" data-za-l="home_bottom_explore">发现</a>
  144. <span class="dot">·</span>
  145. <a target="_blank" href="/app">移动应用</a>
  146. <span class="dot">·</span>


  147. <a href="/org/signin" class="footer-mobile-show">使用机构帐号登录</a>

  148. <span class="dot footer-mobile-show">·</span>

  149. <a href="/contact" class="footer-mobile-show">联系我们</a>
  150. <span class="dot">·</span>
  151. <a target="_blank" href="/careers">来知乎工作</a>
  152. <span class="dot">·</span>
  153. <a  target="_blank">京 ICP 备 13052560 号</a>
  154. <span class="dot">·</span>
  155. <span>京公网安备 11010802010035 号</span>

  156. </div>



  157. <script type="text/json" class="json-inline" data-name="disabled_components">["back_to_top"]</script>
  158. <script type="text/json" class="json-inline" data-name="current_user">["","","","-1","",0,0]</script>
  159. <script type="text/json" class="json-inline" data-name="env">["zhihu.com","comet.zhihu.com",false,null,false,false]</script>

  160. <script type="text/json" class="json-inline" data-name="ga_vars">{"user_created":0,"now":1478833907000,"abtest_mask":"------------------------------","user_attr":[0,0,0,"-","-"],"user_hash":0}</script>

  161. <script src="http://static.zhihu.com/static/revved/-/js/vendor.cb14a042.js"></script>
  162. <script src="http://static.zhihu.com/static/revved/-/js/closure/base.46ab974d.js"></script>

  163. <script src="http://static.zhihu.com/static/revved/-/js/closure/common.c4be407c.js"></script>
  164. <script src="http://static.zhihu.com/static/revved/-/js/closure/page-index.ed6159ab.js"></script>
  165. <meta name="entry" content="ZH.entrySignPage" data-module-id="page-index">


  166. <input type="hidden" name="_xsrf" value="079a87ded8977d0601da2dd2da17eb09"/>
  167. </body>
  168. </html>
复制代码
   然后有‘<input type="hidden" name="_xsrf" value="079a87ded8977d0601da2dd2da17eb09"/>’ 这么一段,你看看,谢谢
小甲鱼最新课程 -> https://ilovefishc.com
回复 支持 反对

使用道具 举报

发表于 2016-11-11 12:27:22 | 显示全部楼层
牵着猫的老鼠 发表于 2016-11-11 11:13
然后有‘’ 这么一段,你看看,谢谢

你的正则结尾掉了一个斜线
小甲鱼最新课程 -> https://ilovefishc.com
回复 支持 反对

使用道具 举报

 楼主| 发表于 2016-11-13 19:06:49 | 显示全部楼层
hldh214 发表于 2016-11-11 12:27
你的正则结尾掉了一个斜线

粗心大意
小甲鱼最新课程 -> https://ilovefishc.com
回复 支持 反对

使用道具 举报

您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

小黑屋|手机版|Archiver|鱼C工作室 ( 粤ICP备18085999号-1 | 粤公网安备 44051102000585号)

GMT+8, 2026-2-23 19:23

Powered by Discuz! X3.4

© 2001-2023 Discuz! Team.

快速回复 返回顶部 返回列表