Maman is a Rust Web Crawler saving pages on Redis.
Pages are send to list <MAMAN_ENV>:queue:maman
using
Sidekiq job format:
json
{
"class": "Maman",
"jid": "b4a577edbccf1d805744efa9",
"retry": true,
"created_at": 1461789979, "enqueued_at": 1461789979,
"args": {
"document":"<html><body><a href='#' /><a href='/new' /></html>",
"headers": {"content-type": "text/html"},
"url": "http://example.net/"
}
}
~~~ cargo install maman ~~~
~~~ REDIS_URL="redis://127.0.0.1/" maman URL ~~~
The MIT License
Copyright (c) 2016 Laurent Arnoud laurent@spkdev.net