Download List

项目描述

Ebot is a scalable and distribuited Web crawler. The URLs are saved to a NOSQL database (which supports map/reduce queries) that you can query via RESTful HTTP requests or using your preferred programming languages. The URLs that need to be analyzed are sent to AMQP queues. In this way, it is possible to run several crawlers in parallel and stop and start them without losing URLs.

系统要求

System requirement is not defined
Information regarding Project Releases and Project Resources. Note that the information here is a quote from Freecode.com page, and the downloads themselves may not be hosted on OSDN.

2010-12-30 13:03
0.9

一个更好的插件架构。保存图像URL和HTML页面标题的新插件。
A better plugin architecture. New plugins for saving image URLs and titles of HTML pages.

2010-12-18 18:59
0.8

此版本已更新至RabbitMQ的(2.2.0)和couchbeam最新版本。有些错误是固定的。
This release was updated to the latest releases of rabbitmq (2.2.0) and couchbeam. Some bugs were fixed.

2010-09-18 07:05
0.7

兼容性与Erlang R14A。测试与Debian测试。
Compatibility with Erlang R14A. Tested with Debian Testing.

2010-08-27 07:24
0.6

与最新的(发展)兼容性释放出需要的库和软件(RabbitMQ的,CouchDB的,webmachine,riak等)
Compatibility with the latest (development) releases of required libraries and software (rabbitmq, couchdb, webmachine, riak, etc.)

2010-06-20 23:18
0.5

为了更好的扩展性,采用新AMQP成功队列(ebot.new .* ebot.fetched .* ebot.completed .* ebot.refused .*)和履带式(ebot_web旧核心)现在是在两个不同的模块和并行分裂/进程(ebot_html和ebot_web)。
For better scalability, new amqp queues are used (ebot.new.* ebot.fetched.* ebot.completed.* ebot.refused.*) and the old core of the crawler (ebot_web) is now split in two different and parallel modules/processes (ebot_html and ebot_web).

Project Resources