文章详情

  • 游戏榜单
  • 软件榜单
关闭导航
热搜榜
热门下载
热门标签
php爱好者> php文档>web crawler project---Heritrix(introduction)

web crawler project---Heritrix(introduction)

时间:2007-09-12  来源:luoxb

Introduction

Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

Heritrix (sometimes spelled heretrix, or misspelled or missaid as heratrix/heritix/ heretix/heratix) is an archaic word for heiress (woman who inherits). Since our crawler seeks to collect and preserve the digital artifacts of our culture for the benefit of future researchers and generations, this name seemed apt.

more info

http://crawler.archive.org/

相关阅读 更多 +
排行榜 更多 +
牛仔战争

牛仔战争

飞行射击 下载
街头打架

街头打架

动作格斗 下载
错误化探险活宝

错误化探险活宝

冒险解谜 下载