丰田生产模式:利用xpath提取超链的问题

来源:百度文库 编辑:中科新闻网 时间:2024/04/25 23:22:49
我现在要将一个xml文件中的一些超联提取出来,使用xpath语句,具体是从以下xml文件中将倒数第二个href标记的http属性提取出来,应该怎样写一个xsl样式表来做啊?我很着急,不知哪位高手能给我一些指点,谢谢啦!
<?xml version="1.0" encoding="UTF-8" ?>
- <html>
- <head>
<meta content="HTML Tidy, see www.w3.org" name="generator" />
<title>Patent Database Search Results: semiconductor in 1976 to present</title>
<meta content="text/html; charset=gb2312" http-equiv="Content-Type" />
<meta name="GENERATOR" content="MSHTML 6.00.2900.2802" />
</head>
- <body bgcolor="#ffffff">
- <div style="text-align: center">
<a name="top" />
- <table>
- <tbody>
- <tr>
- <td>
- <a href="http://patft.uspto.gov/netahtml/search-bool.html">
<img border="0" src="查询页面.files/boolean.gif" alt="[Boolean Search]" />
</td>
</tr>
</tbody>
</table>
- <a href="http://ebiz1.uspto.gov/vision-service/ShoppingCart_P/ShowShoppingCart?backUrl1=http%3A//164.195.100.11/netacgi/nph-Parser?Sect1%3DPTO2%26Sect2%3DHITOFF%26u%3D%252Fnetahtml%252Fsearch-adv.htm%26r%3D0%26p%3D1%26f%3DS%26l%3D50%26Query%3Dsemiconductor%26d%3Dptxt&backLabel1=Back%20to%20Document%3A%20semiconductor">
</a>
</div>
- <p>
<i>Searching 1976 to present...</i>
<br />
</p>
- <b>
Results of Search in 1976 to present db for:
<br />
semiconductor
</b>
: 336420 patents.
<br />
- <i>
Hits
<strong>1</strong>
through
<strong>50</strong>
out of
<strong>336420</strong>
</i>
- <p>
<br />
</p>
- <form method="get" action="/netacgi/nph-Parser">
<input name="Sect1" value="PTO2" type="hidden" />
<br />
<br />
</form>
- <form method="get" action="/netacgi/nph-Parser">
<input name="Sect1" value="PTO2" type="hidden" />
</form>
<br />
<br />
- <form method="get" action="/netacgi/nph-Parser">
<input name="Sect1" value="PTO2" type="hidden" />
</form>
- <table>
- <tbody>
- <tr>
<td />
<td>PAT. NO.</td>
<td />
<td>Title</td>
</tr>
- <tr>
<td valign="top">1</td>
- <td valign="top">
<a href="http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO2&Sect2=HITOFF&u=/netahtml/search-adv.htm&r=1&p=1&f=G&l=50&d=ptxt&S1=semiconductor&OS=semiconductor&RS=semiconductor">7,007,305</a>
</td>
- <td valign="baseline">
<img border="0" src="查询页面.files/ftext.gif" alt="Full-Text" />
</td>
- <td valign="top">
<a href="http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO2&Sect2=HITOFF&u=/netahtml/search-adv.htm&r=1&p=1&f=G&l=50&d=ptxt&S1=semiconductor&OS=semiconductor&RS=semiconductor">Repeater amplifier with signal firewall protection for power line carrier communication networks</a>
</td>
</tr>
- <tr>