<p class="source">
<!--[if !IE]>收藏功能<![endif]-->
<span style="float: right; padding-right: 10px; color: #797f7f">
<a href="javascript:void(0)" onclick="saveFavorite(0,0,1139699,'Z',0,0)" class="shcang">收藏</a>
</span>
2011-12-14 8:27:00 大公报
</p>
请教各位大侠,我要分别用正则得到“2011-12-14 8:27:00 ”和“大公报”该怎么写
------解决方案--------------------------------------------------------
var p1=/\d{4}(\-\d+){2}\s+(\d+:){2}\d+/;
var p2=/\s+[\u4e00-\u9fa5]+/;
var ts=document.getElementsByTagName("p");
for(var i=0;i<ts.length;i++)
{
var mth=p1.exec(ts[i].innerText);
if(mth!=null)
alert(mth[0]);
var mth2=p2.exec(ts[i].innerText);
if(mth2!=null)
alert(mth2[0]);
}
------解决方案--------------------------------------------------------
- Python code
>>> s = """<p class="source"> <!--[if !IE]>收藏功能<![endif]--> <span style="float: right; padding-right: 10px; color: #797f7f"> <a href="javascript:void(0)" onclick="saveFavorite(0,0,1139699,'Z',0,0)" class="shcang">收藏</a> </span> 2011-12-14 8:27:00 大公报 </p>""">>> import re>>> res = '\d{4}-\d{2}-\d{2} \d{1,2}:\d{1,2}:\d{1,2}'>>> m = re.findall(res,s)>>> m['2011-12-14 8:27:00']>>>
------解决方案--------------------------------------------------------
对了还有报纸
- Python code
>>> res2 = r'</span>(.*?)</p>'>>> m2 = re.findall(res2,s,,re.S)SyntaxError: invalid syntax>>> m2 = re.findall(res2,s,re.S)>>> print m2[0] 2011-12-14 8:27:00 大公报>>>