|
|
马上注册,结交更多好友,享用更多功能^_^
您需要 登录 才可以下载或查看,没有账号?立即注册
x
<tr><th class="th40" align="left" valign="middle" style="border-color:#000; border-width: 1px 0px 0px 1px; border-style: solid"><nobr>Entry</nobr></th>
<td class="td40" style="border-color:#000; border-width: 1px 1px 0px 1px; border-style: solid"><table border="0" cellpadding="0" cellspacing="0" width="555"><tr><td align="left"><code><nobr>K06492 KO <br>
</nobr></code></td><td align="right"><br></td></tr></table></td></tr>
<tr><th class="th41" align="left" valign="top" style="border-color:#000; border-width: 1px 0px 0px 1px; border-style: solid"><nobr>Name</nobr></th>
<td class="td41" style="border-color:#000; border-width: 1px 1px 0px 1px; border-style: solid"><div style="width:555px;overflow-x:auto;overflow-y:hidden"><div style="width:555px;overflow-x:auto;overflow-y:hidden">CD58, LFA3<br>
</div></div></td></tr>
如何正则获得Entry、K06492、Name、CD58, LFA3这几个内容
- html='''
- <tr><th class="th40" align="left" valign="middle" style="border-color:#000; border-width: 1px 0px 0px 1px; border-style: solid"><nobr>Entry</nobr></th>
- <td class="td40" style="border-color:#000; border-width: 1px 1px 0px 1px; border-style: solid"><table border="0" cellpadding="0" cellspacing="0" width="555"><tr><td align="left"><code><nobr>K06492 KO <br>
- </nobr></code></td><td align="right"><br></td></tr></table></td></tr>
- <tr><th class="th41" align="left" valign="top" style="border-color:#000; border-width: 1px 0px 0px 1px; border-style: solid"><nobr>Name</nobr></th>
- <td class="td41" style="border-color:#000; border-width: 1px 1px 0px 1px; border-style: solid"><div style="width:555px;overflow-x:auto;overflow-y:hidden"><div style="width:555px;overflow-x:auto;overflow-y:hidden">CD58, LFA3<br>
- </div></div></td></tr>
- '''
- import re
- ptn=re.compile(r'<tr>[^<>]*'
- r'<th[^>]+?class="th\d\d".*?>([a-z]+)<.*?</th>[^<>]*'
- r'<td[^>]+?class="td\d\d".*?>([^<>]+)<.*?</td>[^<>]*'
- r'</tr>'
- ,re.I + re.S )
- html = html.replace(' ', ' ')
- rslt = ptn.findall(html)
- for x in rslt:
- print(x)
-
复制代码- ('Entry', 'K06492 KO ')
- ('Name', 'CD58, LFA3')
复制代码
|
|