php在线采集网页数据库 php采集系统( 三 ) _关键字

}
curl_close($ch);
$linkarr = _striplinks($html);
// 主机部分，补全用
$host = '';
if (is_array($linkarr)) {
foreach ($linkarr as $k = $v) {
$linkresult[$k] = _expandlinks($v, $host);
}
}
printf("p此页面的所有链接为：/ppre%s/pren", var_export($linkresult , true));
?
function.php内容如下（即为上两篇中两个函数的合集）：
?php
function _striplinks($document) {
preg_match_all("'s*as.*?hrefs*=s*(["'])?(?(1) (.*?)\1 | ([^s]+))'isx", $document, $links);
// catenate the non-empty matches from the conditional subpattern
while (list($key, $val) = each($links[2])) {
if (!empty($val))
$match[] = $val;
} while (list($key, $val) = each($links[3])) {
if (!empty($val))
$match[] = $val;
}
// return the links
return $match;
}
/*===================================================================*
Function: _expandlinks
Purpose: expand each link into a fully qualified URL
Input:$linksthe links to qualify
$URIthe full URI to get the base from
Output:$expandedLinks the expanded links
*===================================================================*/
function _expandlinks($links,$URI)
{
$URI_PARTS = parse_url($URI);
$host = $URI_PARTS["host"];
preg_match("/^[^?]+/",$URI,$match);
$match = preg_replace("|/[^/.]+.[^/.]+$|","",$match[0]);
$match = preg_replace("|/$|","",$match);
$match_part = parse_url($match);
$match_root =
$match_part["scheme"]."://".$match_part["host"];
$search = array("|^http://".preg_quote($host)."|i",
"|^(/)|i",
"|^(?!http://)(?!mailto:)|i",
"|/./|",
"|/[^/]+/../|"
);
$replace = array( "",
$match_root."/",
$match."/",
"/",
"/"
);
$expandedLinks = preg_replace($search,$replace,$links);
return $expandedLinks;
}
?
php在线采集网页数据库的介绍就聊到这里吧，感谢你花时间阅读本站内容，更多关于php采集系统、php在线采集网页数据库的信息别忘了在本站进行查找喔。

php在线采集网页数据库 php采集系统( 三 )

推荐阅读

盐水加白醋泡脚的好处舒服每一晚

床单起球是质量问题吗

马上消费金融额度是多少

上环后图片避孕环是什么样子图片

SpringBoot配置并使用Redis缓存服务

1秒钟等于几毫秒一秒等于多少毫秒多少微秒

罗汉松有哪些品种如何区分

中年之殇

电脑里编程的软件有哪些，目前计算机编程的常用软件有什么

网易怒斥暴雪:离婚不离身暴雪是不是网易的

cfree里面怎么一步一步分析结果

山东良法领导干部知识竞赛题目是什么？领导干部知识题库大全

95和98的暗语是什么意思 98是什么

vxlan 格式分析

如何在手机上连接黑魂2服务器？黑魂2服务器怎么用手机

医生竟会对孕检女子做出这事，女子怀孕去医院检查

崩坏3新版本新增内容汇总 3.4版本相关调整内容前瞻

中国民俗的“鬼节”有哪些？中元节和清明节有什么区分？

张裕干红葡萄酒口感怎么样张裕特选级干红葡萄酒怎么样

看财报|苏酒老二今世缘百亿冲刺第一战：7%的省外营收如何撬动全国化市场？｜看财报