monolith build status on GNU/Linux monolith build status on macOS monolith build status on Windows

_____ ______________ __________ ___________________ ___ | \ / \ | | | | | | | \_/ __ \_| __ | | ___ ___ |__| | | | | | | | | | | | | | | |\ /| |__| _ |__| |____| | | | | __ | | | \___/ | | \ | | | | | | | |___| |__________| \_____________________| |___| |___| |___|

A data hoarder’s dream come true: bundle any web page into a single HTML file. You can finally replace that gazillion of open tabs with a gazillion of .html files stored somewhere on your precious little drive.

Unlike the conventional “Save page as”, monolith not only saves the target document, it embeds CSS, image, and JavaScript assets all at once, producing a single HTML5 document that is a joy to store and share.

If compared to saving websites with wget -mpk, this tool embeds all assets as data URLs and therefore lets browsers render the saved page exactly the way it was on the Internet, even when no network connection is available.


Installation

Using Cargo (cross-platform)

console cargo install monolith

Via Homebrew (macOS and GNU/Linux)

console brew install monolith

Via MacPorts (macOS)

console sudo port install monolith

Using Snapcraft (GNU/Linux)

console snap install monolith

Using FreeBSD packages (FreeBSD)

console pkg install monolith

Using FreeBSD ports (FreeBSD)

console cd /usr/ports/www/monolith/ make install clean

Using pkgsrc (NetBSD, OpenBSD, Haiku, etc)

console cd /usr/pkgsrc/www/monolith make install clean

Using containers

console docker build -t Y2Z/monolith . sudo install -b dist/run-in-container.sh /usr/local/bin/monolith

From source

Dependency: libssl

console git clone https://github.com/Y2Z/monolith.git cd monolith make install

Using pre-built binaries (Windows, ARM-based devices, etc)

Every release contains pre-built binaries for Windows, GNU/Linux, as well as platforms with non-standard CPU architecture.


Usage

console monolith https://lyrics.github.io/db/P/Portishead/Dummy/Roads/ -o portishead-roads-lyrics.html

console cat index.html | monolith -aIiFfcMv -b https://original.site/ - > result.html


Options


Whitelisting and blacklisting domains

Options -d and -B provide control over what domains can be used to retrieve assets from. E.g.:

console monolith -I -d example.com -d www.example.com https://example.com -o example-only.html

console monolith -I -B -d .googleusercontent.com -d googleanalytics.com -d .google.com https://example.com -o example-no-ads.html


Dynamic content

Monolith doesn't feature a JavaScript engine, hence websites that retrieve and display data after initial load may require usage of additional tools.

For example, Chromium (Chrome) can be used to act as a pre-processor for such pages:

console chromium --headless --incognito --dump-dom https://github.com | monolith - -I -b https://github.com -o github.html


Proxies

Please set https_proxy, http_proxy, and no_proxy environment variables.


Contributing

Please open an issue if something is wrong, that helps make this project better.


Related projects


License

To the extent possible under law, the author(s) have dedicated all copyright related and neighboring rights to this software to the public domain worldwide. This software is distributed without any warranty.


Keep in mind that monolith is not aware of your browser’s session