|
|
马上注册,结交更多好友,享用更多功能^_^
您需要 登录 才可以下载或查看,没有账号?立即注册
x
本帖最后由 天冰 于 2016-8-29 10:40 编辑
- <table id="pid2646354" class="plhin" summary="pid2646354" cellspacing="0" cellpadding="0">
- <tbody><tr>
- <td class="pls" rowspan="2">
- <div id="favatar2646354" class="pls favatar">
- <div class="pi">
- <div class="authi"><a href="space-uid-247745.html" target="_blank" class="xw1">天冰</a>
- </div>
- </div>
- <div class="p_pop blk bui card_gender_0" id="userinfo2646354" style="display: none; margin-top: -11px;">
- <div class="m z">
- <div id="userinfo2646354_ma"></div>
- </div>
- <div class="i y">
- <div>
- <strong><a href="space-uid-247745.html" target="_blank" class="xi2">天冰</a></strong>
- <em>当前在线</em>
- </div><dl class="cl">
- <dt>UID</dt><dd><a href="?247745" target="_blank" class="xi2">247745</a></dd>
- <dt>日志</dt><dd><a href="home.php?mod=space&uid=247745&do=blog&view=me&from=space" target="_blank" class="xi2">0</a></dd>
- <dt>相册</dt><dd><a href="home.php?mod=space&uid=247745&do=album&view=me&from=space" target="_blank" class="xi2">0</a></dd>
- <dt>贡献</dt><dd>3 </dd>
- <dt>荣誉</dt><dd>19 </dd>
- <dt>技术值</dt><dd>0 </dd>
- </dl>
- <div class="imicn">
- <a href="home.php?mod=space&uid=247745&do=profile" target="_blank" title="查看详细资料"><img src="template/dreambred_c_apple/images/common//userinfo.gif" alt="查看详细资料"></a>
- </div>
- <div id="avatarfeed"><span id="threadsortswait"></span></div>
- </div>
- </div>
- <div>
- <div class="avatar" onmouseover="showauthor(this, 'userinfo2646354')"><a href="space-uid-247745.html" class="avtm" target="_blank"><img src="http://bbs.fishc.com/ucenter/avatar.php?uid=247745&size=middle"></a></div>
- </div>
- <p>签到天数: 6 天</p><p>[LV.2]偶尔看看I</p><div class="tns xg2"><table cellspacing="0" cellpadding="0"><tbody><tr><th><p><a href="home.php?mod=space&uid=247745&do=thread&type=thread&view=me&from=space" class="xi2">2</a></p>主题</th><th><p><a href="home.php?mod=space&uid=247745&do=thread&type=reply&view=me&from=space" class="xi2">21</a></p>帖子</th><td><p><a href="home.php?mod=space&uid=247745&do=profile" class="xi2">19</a></p>荣誉</td></tr></tbody></table></div>
- <p><em><a href="home.php?mod=spacecp&ac=usergroup&gid=10" target="_blank">新鱼友</a></em></p>
- <p><span id="g_up2646354" onmouseover="showMenu({'ctrlid':this.id, 'pos':'12!'});"><img src="template/dreambred_c_apple/images/common//star_level1.gif" alt="Rank: 1"></span></p>
- <div id="g_up2646354_menu" class="tip tip_4" style="display: none;"><div class="tip_horn"></div><div class="tip_c">新鱼友, 积分 24, 距离下一级还需 76 积分</div></div>
- <p><span class="pbg2" id="upgradeprogress_2646354" onmouseover="showMenu({'ctrlid':this.id, 'pos':'12!', 'menuid':'g_up2646354_menu'});"><span class="pbr2" style="width:24%;"></span></span></p>
- <div id="g_up2646354_menu" class="tip tip_4" style="display: none;"><div class="tip_horn"></div><div class="tip_c">新鱼友, 积分 24, 距离下一级还需 76 积分</div></div>
- <dl class="pil cl">
- <dt>积分</dt><dd><a href="home.php?mod=space&uid=247745&do=profile" target="_blank" class="xi2">24</a></dd>
- </dl>
- <style type="text/css">img{margin:2px;}</style>
- </div>
- </td>
-
- <td class="plc">
- <div class="pi">
- <strong>
- <a href="forum.php?mod=redirect&goto=findpost&ptid=75446&pid=2646354" id="postnum2646354" onclick="setCopy(this.href, '帖子地址复制成功');return false;">
- <em>6</em><sup>#</sup></a>
- </strong>
- <div class="pti">
- <div class="pdbt">
- </div>
- <div class="authi">
- <img class="authicn vm" id="authicon2646354" src="template/dreambred_c_apple/images/common//ico_lz.png">
- 楼主<span class="pipe">|</span>
- <em id="authorposton2646354">发表于 <span title="2016-8-27 12:02:25">6 小时前</span></em>
- <span class="pipe">|</span>
- <a href="forum.php?mod=viewthread&tid=75446&page=1&authorid=247745" rel="nofollow">只看该作者</a>
- </div>
- </div>
- </div><div class="pct"><div class="a_pt"><a target="_blank" style="font-size: 14px"><font color="#FF0000"><b><div>C语言辅导班,帮助有志青年!按月付费,减轻负担,仅需200元,穷人也能学!</div></b></font></a></div><div class="pcb">
- <div class="t_fsz">
- <table cellspacing="0" cellpadding="0"><tbody><tr><td class="t_f" id="postmessage_2646354">
- <div class="quote"><blockquote><font size="2"><a href="http://bbs.fishc.com/forum.php?mod=redirect&goto=findpost&pid=2646346&ptid=75446" target="_blank"><font color="#999999">hldh214 发表于 2016-8-27 11:37</font></a></font></blockquote></div><br>
- 是不是理解错误,我是想访问列表里的连接: item[0]<br>
- </td></tr></tbody></table>
- </div>
- <div id="comment_2646354" class="cm">
- </div>
- <div id="post_rate_div_2646354"></div>
- </div>
- </div>
- </td></tr>
- <tr><td class="plc plm">
- <div class="sign">如果您的【问题求助】得到满意的解答,请自行将分类修改为【已经解决】;如果想鼓励一下楼主或帮助到您的朋友,可以给他们【评分】鼓励;善用【论坛搜索】功能,那里可能有您想要的答案!</div>
- <div class="a_pb"><a href="http://bbs.fishc.com/thread-56921-1-1.html" target="_blank"><b><font color="#FF0000">【招生】15PB 软件安全培训开始接受第011期报名(2月28号开课)!</font></b></a></div></td>
- </tr>
- <tr id="_postposition2646354"></tr>
- <tr>
- <td class="pls"></td>
- <td class="plc" style="overflow:visible;">
- <div class="po hin">
- <span class="y">
- <label for="manage2646354">
- <input type="checkbox" id="manage2646354" class="pc" onclick="pidchecked(this);modclick(this, 2646354)" value="2646354" autocomplete="off">
- 管理
- </label>
- </span>
- <div class="pob cl">
- <em>
- <a class="fastre" href="forum.php?mod=post&action=reply&fid=173&tid=75446&repquote=2646354&extra=&page=1" onclick="showWindow('reply', this.href)">回复</a>
- <a class="editp" href="forum.php?mod=post&action=edit&fid=173&tid=75446&pid=2646354&page=1">编辑</a><a class="replyadd" href="forum.php?mod=misc&action=postreview&do=support&tid=75446&pid=2646354&hash=3a3b9b78" onclick="ajaxmenu(this, 3000, 1, 0, '43', '');return false;" onmouseover="this.title = ($('review_support_2646354').innerHTML ? $('review_support_2646354').innerHTML : 0) + ' 人 支持'">支持 <span id="review_support_2646354"></span></a>
- <a class="replysubtract" href="forum.php?mod=misc&action=postreview&do=against&tid=75446&pid=2646354&hash=3a3b9b78" onclick="ajaxmenu(this, 3000, 1, 0, '43', '');return false;" onmouseover="this.title = ($('review_against_2646354').innerHTML ? $('review_against_2646354').innerHTML : 0) + ' 人 反对'">反对 <span id="review_against_2646354"></span></a>
- </em>
- <p>
- </p>
- </div>
- </div>
- </td>
- </tr>
- <tr class="ad">
- <td class="pls">
- </td>
- <td class="plc">
- </td>
- </tr>
- </tbody></table>
复制代码
以上怎么用正则表达出来,即本人想提取出用户:天冰 某处贴子的所有回复,想用正则提取出来,但他的回复代码 如上,但不知道怎么用正则。或有没有更好的办法进行筛选。BeautifulSoup 或XPATH也可以,希望 有人指导一下谢
以上为论坛的回复提取出来的代码,想正则出一个贴子,指定人的所有回复信息,不知道怎么写正则,请教、。
即我想正则出:http://bbs.fishc.com/thread-75446-1-1.html
里面:天冰 回复的所有内容,其它人的过虑。
request=urllib2.Request(url)
response=urllib2.urlopen(request)
content = response.read().decode('utf-8')
然后就不知道怎么写正则了,正则好像没办法处理 空格,也不知道怎么条件判断本人回复贴。 |
|