文章详情

  • 游戏榜单
  • 软件榜单
关闭导航
热搜榜
热门下载
热门标签
php爱好者> php文档>web crawler project---Heritrix(introduction)

web crawler project---Heritrix(introduction)

时间:2007-09-12  来源:luoxb

Introduction

Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

Heritrix (sometimes spelled heretrix, or misspelled or missaid as heratrix/heritix/ heretix/heratix) is an archaic word for heiress (woman who inherits). Since our crawler seeks to collect and preserve the digital artifacts of our culture for the benefit of future researchers and generations, this name seemed apt.

more info

http://crawler.archive.org/

相关阅读 更多 +
排行榜 更多 +
星漫

星漫

浏览阅读 下载
百姓文化云

百姓文化云

生活实用 下载
万捷云

万捷云

商务办公 下载