甘肃农大附近酒店:xpath提取超链

来源:百度文库 编辑:查人人中国名人网 时间:2024/05/08 13:30:27
想问一下,我想从一下这个xml文件中利用xpath提取出从文件最后数倒数第二个href的属性值,请问该怎么写 <xsl:for-each select="//body/table/tbody/td='1'......>中的......?我很着急,不知哪个高手能帮助我?谢谢啦!
(以下是xml文件)
<html>
- <head>
<meta content="HTML Tidy, see www.w3.org" name="generator" />
<title>Patent Database Search Results: semiconductor in 1976 to present</title>
<meta content="text/html; charset=gb2312" http-equiv="Content-Type" />
<meta name="GENERATOR" content="MSHTML 6.00.2900.2802" />
</head>
<a name="top" />
- <table>
- <tbody>
- <tr>
- <td>
+ <a href="http://www.uspto.gov/patft/index.html">
<img border="0" src="查询页面.files/home.gif" alt="[Home]" />
</a>
</td>
</tr>
</tbody>
</table>
- <a href="http://ebiz1.uspto.gov/vision-service/ShoppingCart_P/ShowShoppingCart?backUrl1=http%3A//164.195.100.11/netacgi/nph-Parser?Sect1%3DPTO2%26Sect2%3DHITOFF%26u%3D%252Fnetahtml%252Fsearch-adv.htm%26r%3D0%26p%3D1%26f%3DS%26l%3D50%26Query%3Dsemiconductor%26d%3Dptxt&backLabel1=Back%20to%20Document%3A%20semiconductor">
<img border="0" align="middle" src="查询页面.files/cart.gif" alt="[View Shopping Cart]" />
</a>
</div>
- <p>
<i>Searching 1976 to present...</i>
<br />
</p>
- <b>
Results of Search in 1976 to present db for:
<br />
semiconductor
</b>
: 336420 patents.
<br />
- <i>
Hits
<strong>1</strong>
through
<strong>50</strong>
out of
<strong>336420</strong>
</i>
- <p>
<br />
</p>
- <form method="get" action="/netacgi/nph-Parser">
<input name="Sect1" value="PTO2" type="hidden" />
<br />
</form>
- <form method="get" action="/netacgi/nph-Parser">
<input name="Sect1" value="PTO2" type="hidden" />
<input name="r" value="0" type="hidden" />
</form>
<br />
- <form method="get" action="/netacgi/nph-Parser">
<input name="Sect1" value="PTO2" type="hidden" />
<input name="Query" value="semiconductor" size="50" />
</form>
- <table>
- <tbody>
- <tr>
<td />
<td>PAT. NO.</td>
<td />
<td>Title</td>
</tr>
- <tr>
<td valign="top">1</td>
- <td valign="top">
<a href="http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO2&Sect2=HITOFF&u=/netahtml/search-adv.htm&r=1&p=1&f=G&l=50&d=ptxt&S1=semiconductor&OS=semiconductor&RS=semiconductor">7,007,305</a>
</td>
- <td valign="baseline">
<img border="0" src="查询页面.files/ftext.gif" alt="Full-Text" />
</td>
- <td valign="top">
<a href="http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO2&Sect2=HITOFF&u=/netahtml/search-adv.htm&r=1&p=1&f=G&l=50&d=ptxt&S1=semiconductor&OS=semiconductor&RS=semiconductor">Repeater amplifier with signal firewall protection for power line carrier communication networks</a>
</td>
</tr>

这是XML嘛?
把你的的XML文件和你的解析XML的那个基类粘出来我看下