Puppeteer for Rust. It looks a little something like this:
A high-level API to control headless Chrome or Chromium over the DevTools Protocol. It is the Rust equivalent of Puppeteer, a Node library maintained by the Chrome DevTools team.
It is not 100% feature compatible with Puppeteer, but there's enough here to satisfy most browser testing / web crawling use cases, and there are several 'advanced' features such as:
```rust use headless_chrome::{Browser, protocol::page::ScreenshotFormat};
fn browse_wikipedia() -> Result<(), failure::Error> { let browser = Browser::default()?;
let tab = browser.wait_for_initial_tab()?;
/// Navigate to wikipedia
tab.navigate_to("https://www.wikipedia.org")?;
/// Wait for network/javascript/dom to make the search-box available
/// and click it.
tab.wait_for_element("input#searchInput")?.click()?;
/// Type in a query and press `Enter`
tab.type_str("WebKit")?.press_key("Enter")?;
/// We should end up on the WebKit-page once navigated
tab.wait_for_element("#firstHeading")?;
assert!(tab.get_url().ends_with("WebKit"));
/// Take a screenshot of the entire browser window
let _jpeg_data = tab.capture_screenshot(
ScreenshotFormat::JPEG(Some(75)),
None,
true)?;
/// Take a screenshot of just the WebKit-Infobox
let _png_data = tab
.wait_for_element("#mw-content-text > div > table.infobox.vevent")?
.capture_screenshot(ScreenshotFormat::PNG)?;
Ok(())
}
assert!(browsewikipedia().isok()); ```
For fuller examples, take a look at tests/simple.rs
and examples/real_world.rs
.
The Chrome DevTools Protocol is huge. Currently, Puppeteer supports way more of it than we do. Some of the missing features include:
headless_chrome
, which has a synchronous API and is just implemented using plain old threads. Fantoccini has also been around longer and is more battle-tested. It doesn't support Chrome DevTools-specific functionality like JS Coverage.For debug output, set these environment variables before running cargo test
:
RUST_BACKTRACE=1 RUST_LOG=headless_chrome=trace
Starting with v0.2.0, we're trying to follow SemVar strictly.
If you get errors related to timeouts, you likely need to enable sandboxing either in the kernel or as a setuid sandbox. Puppeteer has some information about how to do that here
By default, headless_chrome
will download a compatible version of chrome to XDG_DATA_HOME
(or equivalent on Windows/Mac). This behaviour can be optionally turned off, and you can use the system version of chrome (assuming you have chrome installed) by disabling the default feature in your Cargo.toml
:
toml
[dependencies.headless_chrome]
default-features = false
window.alert
handlersPull requests and issues are most welcome, even if they're just experience reports. If you find anything frustrating or confusing, let me know!