cargo-mutants

https://github.com/sourcefrog/cargo-mutants

Tests crates.io Maturity: Beta

cargo-mutants is a mutation testing tool for Rust. It guides you to missing test coverage by finding functions whose implementation could be replaced by something trivial and the tests would all still pass.

Coverage measurements can be helpful, but they really tell you what code is reached by a test, and not whether the test really assert anything about the behavior of the code. Mutation tests give different, perhaps richer information, about whether the tests really check the code's behavior.

CAUTION: This tool builds and runs code with machine-generated modifications. If the code under test, or the test suite, has side effects such as writing or deleting files, running it with mutations may be dangerous. Think first about what side effects the test suite could possibly have, and/or run it in a restricted or disposable environment.

NOTE: cargo-mutants is still pretty new! It can find some interesting results, but because it has a very basic idea of which functions to mutate and how, it generates significant false positives and false negatives. The proof-of-concept is successful, though, and I think the results can be iteratively improved.

Install

cargo install cargo-mutants

Using cargo-mutants

Just run cargo mutants in a Rust source directory, and it will point out functions that may be inadequately tested:

% cargo mutants --dir ~/src/unix_mode/
baseline test with no mutations ... ok
replace type_bits with Default::default() in src/lib.rs:42:32 ... caught
replace is_file with Default::default() in src/lib.rs:52:35 ... caught
replace is_dir with Default::default() in src/lib.rs:62:34 ... caught
replace is_symlink with Default::default() in src/lib.rs:72:38 ... caught
replace is_fifo with Default::default() in src/lib.rs:77:35 ... caught
replace is_char_device with Default::default() in src/lib.rs:82:42 ... caught
replace is_block_device with Default::default() in src/lib.rs:87:43 ... NOT CAUGHT!
...

In this version of the unix_mode crate, the is_block_device function was indeed untested.

To see what mutants could be generated without running them, use --list. --list also supports a --json option to make the output more machine-readable, and a --diff option to show the replacement.

Understanding the results

If tests fail in a clean copy of the tree, there might be an (intermittent) failure in the source directory, or there might be some problem that stops them passing when run from a different location, such as a relative path in Cargo.toml. Fix this first.

Otherwise, cargo mutants generates every mutant it can and prints the result of trying each one:

Skipping functions

To mark functions so they are not mutated:

  1. Add a Cargo dependency on the mutants crate.

  2. Mark functions with #[mutants::skip].

The crate is tiny and the attribute has no effect on the compiled code. It only flags the function for cargo-mutants.

Exit codes

mutants.out

A mutants.out directory is created in the source directory. It contains:

Hangs and timeouts

Some mutations to the tree can cause the test suite to hang. For example, in this code, cargo-mutants might try changing should_stop to always return false:

rust while !should_stop() { // something }

cargo mutants automatically sets a timeout when running tests with mutations applied, and reports mutations that hit a timeout. The automatic timeout is the maximum of 5 seconds, or 3x the time to run tests with no mutations.

You can also set an explicit timout with the --timeout option. In this case the timeout is also applied to tests run with no mutation.

The timeout does not apply to cargo check or cargo build, only cargo test.

When a test times out, you can mark it with #[mutants::skip] so that future cargo mutants runs go faster.

Tips

Performance

Anything you can do to make the cargo build and cargo test suite faster will have a multiplicative effect on cargo mutants run time, and of course will also make normal development more pleasant. There's lots of good advice on the web.

In particular, on Linux, using the Mold linker can improve build times significantly: because cargo-mutants does many incremental builds, link time is important.

Hard-to-test cases

Some functions don't cause a test suite failure if emptied, but also cannot be removed. For example, functions to do with managing caches or that have other performance side effects.

Ideally, these should be tested, but doing so in a way that's not flaky can be difficult. cargo-mutants can help in a few ways:

Goals

cargo-mutants it easy to run on any Rust source tree, and will tell you something interesting about areas where bugs might be lurking or the tests might be insufficient.

Being easy to use means:

Interesting results mean:

Limitations, caveats, known bugs, and future enhancements

How it works

The basic approach is:

The nice thing about Default is that it's defined on many commonly-used types including (), so cargo-mutants does not need to really understand the function return type at this early stage. Some functions will fail to build because they return a type that does not implement Default, and that's OK.

The file is parsed using the syn crate, but mutations are applied textually, rather than to the token stream, so that unmutated code retains its prior formatting, comments, line numbers, etc. This makes it possible to show a text diff of the mutation and should make it easier to understand any error messages from the build of the mutated code.

For more details, see DESIGN.md.

Related work

cargo-mutants was inspired by reading about the Descartes mutation-testing tool for Java described in Increment magazine's testing issue.

It's an interesting insight that mutation at the level of a whole function is a practical sweet-spot to discover missing tests, while (at least at moderate-size trees) still making it feasible to exhaustively generate every mutant.

Mutagen

There's an existing Rust mutation testing tool called Mutagen.

Some differences are:

Stability

cargo-mutants is in alpha and behavior, output formats, command-line syntax, json output formats, etc, may change from one release to the next.