Salami not included
Paperoni is a web article downloader written in Rust. The downloaded articles are then exported as EPUB files.
This project is in an alpha release so it might crash when you use it. Please open an issue on Github if it does crash.
Check the releases page for precompiled binaries. Currently there are only builds for Debian and Arch.
Paperoni is published on crates.io. If you have cargo installed, then run:
sh
cargo install paperoni --version 0.3.0-alpha1
Paperoni is still in alpha so the version
flag has to be passed.
This project uses async/.await
so it should be compiled using a minimum Rust version of 1.33. Preferrably use the latest version of Rust.
```sh git clone https://github.com/hipstermojo/paperoni.git cd paperoni
cargo install --path .
cargo run -- # pass your url here ```
sh
paperoni https://en.wikipedia.org/wiki/Pepperoni
Paperoni also supports passing multiple links as arguments.
sh
paperoni https://en.wikipedia.org/wiki/Pepperoni https://en.wikipedia.org/wiki/Salami
Alternatively, if you are on a Unix-like OS, you can simply do something like this:
sh
cat links.txt | xargs paperoni
These can also be read from a file using the -f/--file
flag.
sh
paperoni -f links.txt
By default, Paperoni generates an epub file for each link. You can also merge multiple links
into a single epub using the merge
flag and specifying the output file.
sh
paperoni -f links.txt --merge out.epub
The URL passed to Paperoni is fetched and the returned HTML response is passed to the extractor. This extractor retrieves a possible article using a port of the Mozilla Readability algorithm. This article is then saved in an EPUB.
The port of the algorithm is still unstable as well so it is not fully compatible with all the websites that can be extracted using Readability.
This program is still in alpha so a number of things won't work: