Yahoo! JAPAN独自的爬虫
时间:2008-09-27 来源:GREED
Yahoo! JAPAN独自爬虫——2个可疑的User-Agent。
Y!J-DSC 1.0 (http://help.yahoo.co.jp/help/jp/search/indexing/indexing-15.html )
Y!J-BSC/1.0 (http://help.yahoo.co.jp/help/jp/search/indexing/indexing-15.html)
Y!J-DSCIP :
211.14.8.254
ont211014008254.yahoo.co.jp
User-Agent :
Y!J-DSC 1.0; http://help.yahoo.co.jp/help/jp/search/indexing/indexing-15.html
爬取对象:
Yahoo! JAPAN上“カテゴリ”(雅虎大全)登録的网站的首页
DSC的意思:
Directory Site Crawler?
Y!J-BSCIP :
203.141.52.34
?
User-Agent :
Y!J-BSC/1.0 (http://help.yahoo.co.jp/help/jp/search/indexing/indexing-15.html)
爬取对象:
博客、Feed、特定网页
BSC的意思:
Blog Site Crawler?
Y!J-DSC 1.0 (http://help.yahoo.co.jp/help/jp/search/indexing/indexing-15.html )
Y!J-BSC/1.0 (http://help.yahoo.co.jp/help/jp/search/indexing/indexing-15.html)
Y!J-DSCIP :
211.14.8.254
ont211014008254.yahoo.co.jp
User-Agent :
Y!J-DSC 1.0; http://help.yahoo.co.jp/help/jp/search/indexing/indexing-15.html
爬取对象:
Yahoo! JAPAN上“カテゴリ”(雅虎大全)登録的网站的首页
DSC的意思:
Directory Site Crawler?
Y!J-BSCIP :
203.141.52.34
?
User-Agent :
Y!J-BSC/1.0 (http://help.yahoo.co.jp/help/jp/search/indexing/indexing-15.html)
爬取对象:
博客、Feed、特定网页
BSC的意思:
Blog Site Crawler?
相关阅读 更多 +