文章详情

  • 游戏榜单
  • 软件榜单
关闭导航
热搜榜
热门下载
热门标签
php爱好者> php文档>web crawler project---Heritrix(introduction)

web crawler project---Heritrix(introduction)

时间:2007-09-12  来源:luoxb

Introduction

Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

Heritrix (sometimes spelled heretrix, or misspelled or missaid as heratrix/heritix/ heretix/heratix) is an archaic word for heiress (woman who inherits). Since our crawler seeks to collect and preserve the digital artifacts of our culture for the benefit of future researchers and generations, this name seemed apt.

more info

http://crawler.archive.org/

相关阅读 更多 +
排行榜 更多 +
掌上皇御

掌上皇御

金融理财 下载
天翼校园

天翼校园

系统软件 下载
源新闻

源新闻

浏览阅读 下载