Provides simple access to the Archive.is Capturing Service. Archive any url and get the corresponding archive.is link in return.
The ArchiveClient
is build with hyper
and therefor uses futures for its services.
```rust extern crate archiveis; extern crate futures; extern crate tokio_core;
use archiveis::ArchiveClient; use futures::future::Future; use tokio_core::reactor::Core;
fn main() { let mut core = Core::new().unwrap();
let client = ArchiveClient::new(Some("archiveis (https://github.com/MattsSe/archiveis-rs)")); let url = "http://example.com/"; let capture = client.capture(url).andthen(|archived| { println!("targeted url: {}", archived.targeturl); println!("url of archived site: {}", archived.archivedurl); println!("archive.is submit token: {}", archived.submittoken); Ok(()) });
core.run(capture).unwrap(); } ```
archive.is uses a temporary token to validate a archive request.
The ArchiveClient
capture
function first obtains the token via a GET request.
The token is usually valid several minutes, and even if archive.is switches to a new token,the older ones are still valid. So if we need to archive multiple links, we can only need to obtain the token once and then invoke the capturing service directly with capture_with_token
for each url. This can be done using the future::join
functionality.
In the following case the designated join_all
function is used to get Future of a Vec<Archived>
.
An undesired sideeffect if the join_all
is that this returns an Error if any of the futures failed.
The Capturing service should work fine in most cases but if individual error handling is desired, the capturing futures can be wrapped inside another Result
. In an And_Then
we can handle those failures.
```rust extern crate archiveis; extern crate futures; extern crate tokio_core;
use archiveis::ArchiveClient; use futures::future::{joinall, Future}; use tokiocore::reactor::Core; fn main() { let mut core = Core::new().unwrap();
let client = ArchiveClient::new(Some("archiveis (https://github.com/MattsSe/archiveis-rs)"));
// the urls to capture
let urls = vec![
"http://example.com/",
"https://github.com/MattsSe/archiveis-rs",
"https://crates.io",
];
let capture = client
.get_unique_token()
.and_then(|token| {
let mut futures = Vec::new();
for u in urls.into_iter() {
// optionally wrap the capturing result in another Result, to handle the failures in the next step
futures.push(client.capture_with_token(u, &token).then(|x| Ok(x)));
}
join_all(futures)
}).and_then(|archives| {
let failures: Vec<_> = archives.iter().map(Result::as_ref).filter(Result::is_err).map(Result::unwrap_err).collect();
if failures.is_empty() {
println!("all links successfully archived.");
} else {
for err in failures {
if let archiveis::Error::MissingUrl(url) = err {
println!("Failed to archive url: {}", url);
}
}
}
Ok(())
});
core.run(capture).unwrap();
} ```
archiveis
also comes as commandline application:
Coming soon