https://github.com/sourcefrog/cargo-mutants
cargo-mutants is a mutation testing tool for Rust. It guides you to missing test coverage by finding functions whose implementation could be replaced by something trivial and the tests would all still pass.
CAUTION: This tool builds and runs code with arbitrary changes. If the test suite has side effects such as writing or deleting files, running it with mutations may be dangerous: for example it might write to or delete files outside of the source tree. Eventually, cargo-mutants might support running tests inside a jail or sandbox; for now think first about what side effects the test suite could possibly have and/or run it in a restricted or disposable environment.
This is inspired by the Decartes mutation-testing tool for Java described in https://increment.com/reliability/testing-beyond-coverage/. I think it's an interesting insight that mutation at the level of a whole function is a practical sweet-spot to discover missing tests, while (at least at moderate-size trees) still making it feasible to exhaustively generate every mutant.
cargo install cargo-mutants
Just run cargo mutants
in a Rust source directory, and it will point out
functions that may be inadequately tested:
% cargo mutants --dir ~/src/unix_mode/
baseline test with no mutations ... ok
replace type_bits with Default::default() in src/lib.rs:42:32 ... caught
replace is_file with Default::default() in src/lib.rs:52:35 ... caught
replace is_dir with Default::default() in src/lib.rs:62:34 ... caught
replace is_symlink with Default::default() in src/lib.rs:72:38 ... caught
replace is_fifo with Default::default() in src/lib.rs:77:35 ... caught
replace is_char_device with Default::default() in src/lib.rs:82:42 ... caught
replace is_block_device with Default::default() in src/lib.rs:87:43 ... NOT CAUGHT!
replace is_socket with Default::default() in src/lib.rs:92:37 ... caught
replace is_setuid with Default::default() in src/lib.rs:97:37 ... NOT CAUGHT!
replace is_setgid with Default::default() in src/lib.rs:102:37 ... NOT CAUGHT!
replace is_sticky with Default::default() in src/lib.rs:107:37 ... caught
replace to_string with Default::default() in src/lib.rs:130:39 ... caught
replace bitset with Default::default() in src/lib.rs:134:39 ... caught
replace permch with Default::default() in src/lib.rs:138:52 ... caught
replace file_mode with Default::default() in src/lib.rs:213:47 ... caught
The Cargo output is logged into target/mutants/
within the original source
directory, so you can see why individual tests failed.
deny
style lints such as unused parameters are likely to fail to
build when mutated, without really saying much about the value of the tests.
I suggest you don't statically deny warnings in your source code, but rather
set RUSTFLAGS
when you do want to check this.while
loop. cargo-mutants currently detects this but does
not kill the test process, so you'll need to find and kill it yourself. (On
Unix we might need to use setpgrp
.)The basic approach is:
cargo test
in the tree, saving output to a log file.The list of possible mutations is generated by:
#[mutants::skip]
attribute.#[test]
.#[cfg(test)]
or inside a mod
so marked.Default::default()
.Result
, instead replace the body with Ok(Default::default())
.The nice thing about Default
is that it's defined on many commonly-used types including ()
,
so cargo-mutants does not need to really understand the function return type at this early stage.
Some functions will fail to build because they return a type that does not implement Default
,
and that's OK.
The file is parsed using the syn
crate, but mutations
are applied textually, rather than to the token stream, so that unmutated code
retains its prior formatting, comments, line numbers, etc. This makes it
possible to show a text diff of the mutation and should make it easier to
understand any error messages from the build of the mutated code.
To make this faster on large trees, we could keep several scratch trees and test them in parallel, which is likely to exploit CPU resources more thoroughly than Cargo's own parallelism: in particular Cargo tends to fall down to a single task during linking.
One class of functions that have a reason to exist but may not cause a test suite failure if emptied out are those that exist for performance reasons, or more generally that have effects other than on the directly observable side effects of calling the function. For example, functions to do with managing caches or memory.
Ideally, these should be tested, but doing so in a way that's not flaky can be difficult. cargo-mutants can help in a few ways:
#[mutants::skip]
annotation can be added to suppress warnings and explain the decision.