火车采集器在线资源>>东南网的上下页采集规则
东南网的上下页采集规则
发布:2010, March 5, 3:21 PM 浏览:530
这个是有一点难度的.要采到全部的网址,同时要次序正确.示例页面http://www.fjsen.com/o/2010-03/04/content_2875733.htm
我们看分页的样式.有三种的,需要在这里边找到共同点然后获取网址.
看最后的结果.
XML/HTML代码
- ████████████████████████████████████
- █此页面包含多个分页:
- █1:http://www.fjsen.com/o/2010-03/04/content_2875733.htm
- █2:http://www.fjsen.com/o/2010-03/04/content_2875733_2.htm
- █3:http://www.fjsen.com/o/2010-03/04/content_2875733_3.htm
- █4:http://www.fjsen.com/o/2010-03/04/content_2875733_4.htm
- █5:http://www.fjsen.com/o/2010-03/04/content_2875733_5.htm
- █6:http://www.fjsen.com/o/2010-03/04/content_2875733_6.htm
- █7:http://www.fjsen.com/o/2010-03/04/content_2875733_7.htm
- █8:http://www.fjsen.com/o/2010-03/04/content_2875733_8.htm
- █9:http://www.fjsen.com/o/2010-03/04/content_2875733_9.htm
- █10:http://www.fjsen.com/o/2010-03/04/content_2875733_10.htm
- █11:http://www.fjsen.com/o/2010-03/04/content_2875733_11.htm
- █12:http://www.fjsen.com/o/2010-03/04/content_2875733_12.htm
- █13:http://www.fjsen.com/o/2010-03/04/content_2875733_13.htm
- █14:http://www.fjsen.com/o/2010-03/04/content_2875733_14.htm
- █15:http://www.fjsen.com/o/2010-03/04/content_2875733_15.htm
- █有分页匹配的标签,比如内容注意选中标签编辑框中的[该标签在分页中匹配]
- ████████████████████████████████████
附件: 上下页采集.ljob (1.81 K, 下载次数:67)
相关信息
- 关键字:分页, 上下页
- 原文链接:http://www.caijibbs.com/show.php?id=134
- 将本文收藏到网摘:
Copyright © 2007-2009 采集之家 All Rights Reserved. Powered by SaBlog XHTML 1.0. 清除Cookies.
陕ICP备07009732号



