文章详情

  • 游戏榜单
  • 软件榜单
关闭导航
热搜榜
热门下载
热门标签
php爱好者> php文档>web crawler project---Heritrix(introduction)

web crawler project---Heritrix(introduction)

时间:2007-09-12  来源:luoxb

Introduction

Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

Heritrix (sometimes spelled heretrix, or misspelled or missaid as heratrix/heritix/ heretix/heratix) is an archaic word for heiress (woman who inherits). Since our crawler seeks to collect and preserve the digital artifacts of our culture for the benefit of future researchers and generations, this name seemed apt.

more info

http://crawler.archive.org/

相关阅读 更多 +
排行榜 更多 +
侦探故事

侦探故事

休闲益智 下载
白鲨清理

白鲨清理

游戏工具 下载
draft.art生成器

draft.art生成器

主题美化 下载