The comparable
crate defines the trait [Comparable
], along with a derive
macro for auto-generating instances of this trait for most data types.
Primarily the purpose of this trait is to offer a method,
[Comparable::comparison
], by which two values of any type supporting that
trait can yield a summary of the differences between them.
Note that unlike other crates that do data differencing (primarily between
scalars and collections), comparable
has been written primarily with testing
in mind. That is, the purpose of generating such change descriptions is to
enable writing tests that assert the set of expected changes after some
operation between an initial state and the resulting state. This goal also
means that some types, like
HashMap
,
must be differenced after ordering the keys first, so that the set of changes
produced can be made deterministic and thus expressible as a test expectation.
To these ends, the function [assert_changes
] is also provided, taking two
values of the same type along with an expected "change description" as
returned by foo.comparison(&bar)
. This function uses the
pretty_assertions
crate under
the hood so that minute differences within deep structures can be easily seen
in the failure output.
If you want to get started quickly with the [Comparable
] crate to enhance unit
testing, do the following:
comparable
crate as a dependency, enabling features = ["derive"]
.Comparable
trait on as many structs and enums as needed.assert_changes
] between the initial state and the resulting state
to assert that whatever happened is exactly what you expected to happen.The main benefit of this approach over the usual method of "probing" the resulting state -- to ensure it changed as you expected it to-- is that it asserts against the exhaustive set of changes to ensure that no unintended side-effects occurred beyond what you expected to happen. In this way, it is both a positive and a negative test: checking for what you expect to see as well as what you don't expect to see.
The [Comparable
] trait has two associated types and two methods, one pair
corresponding to value descriptions and the other to value changes:
```rust pub trait Comparable { type Desc: std::cmp::PartialEq + std::fmt::Debug; fn describe(&self) -> Self::Desc;
type Change: std::cmp::PartialEq + std::fmt::Debug;
fn comparison(&self, other: &Self) -> comparable::Changed<Self::Change>;
} ```
Comparable::Desc
] associated typeValue descriptions (the [Comparable::Desc
] associated type) are needed
because value hierarchies can involve many types. Perhaps some of these types
implement PartialEq
and Debug
, but not all. To work around this
limitation, the [Comparable
] derive macro creates a "mirror" of your data
structure with all the same constructors ands field, but using the
[Comparable::Desc
] associated type for each of its contained types.
```
struct Foo { bar: u32, baz: u32 } ```
This generates a description that mirrors the original type, but using type descriptions rather than the types themselves:
struct FooDesc {
bar: <u32 as comparable::Comparable>::Desc,
baz: <u32 as comparable::Comparable>::Desc
}
You may also choose an alternate description type, such as a reduced form of a
value or some other type entirely. For example, complex structures could
describe themselves by the set of changes they represent from a Default
value. This is so common, that it's supported via a compare_default
macro
attribute provided by comparable
:
```
struct Foo { /* ...lots of fields... */ }
impl Default for Foo { fn default() -> Self { Foo {} } } ```
For scalars, the [Comparable::Desc
] type is the same as the type it's
describing, and these are called "self-describing".
There are other macro attributes provided for customizing things even further, which are covered below, beginning at the section on Structures.
Comparable::Change
] associated typeWhen two values of a type differ, this difference gets represented using the
associated type [Comparable::Change
]. Such values are produced by the
[Comparable::comparison
] method, which actually returns Changed<Change>
since the result may be either Changed::Unchanged
or
Changed::Changed(_changes_)
.[^option]
[^option] Changed
is just a different flavor of the Option
type, created
to make changesets clearer than just seeing Some
in various places.
The primary purpose of a [Comparable::Change
] value is to compare it to a
set of changes you expected to see, so design choices have been made to
optimize for clarity and printing rather than, say, the ability to transform
one value into another by applying a changeset. This is entirely possible give
a dataset and a change description, but no work has been done to achieve this
goal.
How changes are represented can differ greatly between scalars, collections, structs and enums, so more detail is given below in the section discussing each of these types.
[Comparable
] traits have been implemented for all of the basic scalar types.
These are self-describing, and use a [Comparable::Change
] structure named
after the type that holds the previous and changed values. For example, the
following assertions hold:
```
assertchanges(&100, &100, Changed::Unchanged); assertchanges(&100, &200, Changed::Changed(I32Change(100, 200))); assertchanges(&true, &false, Changed::Changed(BoolChange(true, false))); assertchanges( &"foo", &"bar", Changed::Changed(StringChange("foo".tostring(), "bar".tostring())), ); ```
The set collections for which [Comparable
] has been implemented are: Vec
,
HashSet
, and BTreeSet
.
The Vec
uses Vec<VecChange>
to report all of the indices at which changes
happened. Note that it cannot detect insertions in the middle, and so will
likely report every item as changed from there until the end of the vector, at
which point it will report an added member.
HashSet
and BTreeSet
types both report changes the same way, using the
SetChange
type. Note that in order for HashSet
change results to be
deterministic, the values in a HashSet
must support the Ord
trait so they
can be sorted prior to comparison. Sets cannot tell when specific members have
change, and so only report changes in terms of SetChange::Added
and
SetChange::Removed
.
Here are a few examples, taken from the comparable_test
test suite:
```
// Vectors assertchanges( &vec![1 as i32, 2], &vec![1 as i32, 2, 3], Changed::Changed(vec![VecChange::Added(2, 3)]), ); assertchanges( &vec![1 as i32, 3], &vec![1 as i32, 2, 3], Changed::Changed(vec![ VecChange::Changed(1, I32Change(3, 2)), VecChange::Added(2, 3), ]), ); assertchanges( &vec![1 as i32, 2, 3], &vec![1 as i32, 3], Changed::Changed(vec![ VecChange::Changed(1, I32Change(2, 3)), VecChange::Removed(2, 3), ]), ); assertchanges( &vec![1 as i32, 2, 3], &vec![1 as i32, 4, 3], Changed::Changed(vec![VecChange::Changed(1, I32Change(2, 4))]), );
// Sets assertchanges( &HashSet::from(vec![1 as i32, 2].intoiter().collect()), &HashSet::from(vec![1 as i32, 2, 3].intoiter().collect()), Changed::Changed(vec![SetChange::Added(3)]), ); assertchanges( &HashSet::from(vec![1 as i32, 3].intoiter().collect()), &HashSet::from(vec![1 as i32, 2, 3].intoiter().collect()), Changed::Changed(vec![SetChange::Added(2)]), ); assertchanges( &HashSet::from(vec![1 as i32, 2, 3].intoiter().collect()), &HashSet::from(vec![1 as i32, 3].intoiter().collect()), Changed::Changed(vec![SetChange::Removed(2)]), ); assertchanges( &HashSet::from(vec![1 as i32, 2, 3].intoiter().collect()), &HashSet::from(vec![1 as i32, 4, 3].intoiter().collect()), Changed::Changed(vec![SetChange::Added(4), SetChange::Removed(2)]), ); ```
Note that if the first VecChange::Change
above had used an index of 1
instead of 0, the resulting failure would look something like this:
```text running 1 test test testcomparablebar ... FAILED
failures:
---- testcomparablebar stdout ----
thread 'testcomparablebar' panicked at 'assertion failed: (left == right)
Diff < left / right > : Changed( [ Change( < 1,
0, I32Change( 100, 200, ), ), ],
)
', /Users/johnw/src/comparable/comparable/src/lib.rs:19:5
note: run with RUST_BACKTRACE=1
environment variable to display a backtrace
failures: testcomparablebar ```
TODO: jww (2021-11-01): Need content here.
Differencing arbitrary structures was the original motive for creating
comparable
. This is made feasible using a [Comparable
] derive macro that
auto-generates code needed for such comparisons. The purpose of this section
is to explain how this macro works, and the various attribute macros that can
be used to guide the process. If all else fails, manual trait implementations
are always an alternative.
For the purpose of the following sub-sections, we consider the following structure:
```
struct Foo {
bar: u32,
baz: u32,
#[comparable_ignore]
quux: Box The first attribute macro you'll notice that can be applied to individual
fields is TODO: jww (2021-11-01): Allow the [ TODO: jww (2021-11-01): For each multi-field variant in an enum, generate
a helper [ TODO: jww (2021-11-01): Provide an attribute macro TODO: jww (2021-11-01): Provide an attribute macro
By default, deriving [ ``` struct FooDesc {
bar: This process can be influenced using several attribute macros. If the Note the following traits are required for self-describing types: If you want no description at all for a type, since you only care about how it
has changed and never want to report a description of the value in any other
context, then you can use ```ignore
type Desc = (); fn describe(&self) -> Self::Desc {
()
}
``` It is assumed that when this is appropriate, such values will never appear in
any change output, so consider a different approach if you see lots of units
turning up. You can have more control over description by specifying exactly the text that
should appear for the [ ```ignore struct Foo {
bar: u32,
baz: u32
}
``` The following is generated: ```ignore
type Desc = T; fn describe(&self) -> Self::Desc {
B
}
``` This also means that the expression argument passed to ``` struct Foo {}
``` This same approach could be used to represent large blobs of data by their
checksum hash, for example, or large data structures that you don't need to
ever display by their Merkle root hash. When the ```ignore impl comparable::Comparable for Foo {
type Desc = Self::Change; }
``` Note that changes for structures are always a vector, since this allows
changes to be reported separately for each field. More on this in the
following section. By default for structs, deriving [ ``` struct Foo {
bar: u32,
baz: u32
}
``` The [ ```ignore enum FooChange {
Bar( impl comparable::Comparable for Foo {
type Desc = FooDesc;
type Change = Vec Here is an abbreviated example of how this looks when asserting changes: If the field hasn't been changed it won't appear in the vector, and each field
appears at most once. The reason for taking this approach is that structures
with many, many fields can be represented by a small change set if most of the
other fields were left untouched. If a struct has no fields it can never change, and so only a unitary
[ If a struct has only one field, there is no reason to specify changes using a
vector, since either the struct is unchanged or just that one field has
changed. For this reason, singleton structs optimize away the vector and use
By default, the auto-generated [ Enumerations are handled quite differently from structures, for the main
reason that while a To unpack that a bit: By a product of fields, it is meant that a Meanwhile, an ``` enum MyEnum {
One(bool),
Two { two: Vec Here we see variant that has a variant with no fields ( Instead, the following gets generated, which can end up being a bit verbose,
but captures the full nature of any differences: Note that variants with singleton fields do not use [ If a enum has no variants it cannot be constructed, so both the
[ Unions cannot derive [#[comparable_ignore]
, which must be used if the type in question
cannot be compared for differences.Comparable::Desc
] and
[Comparable::Change
] suffixes to both be changed.Comparable::Change
] struct and set that variant's type for the
enum's [Comparable::Change
] to be Vec<Change>
.#[comparable_wrap]
that defines a wrapping type that can be used for comparison. When the field
is encountered during [Comparable::comparison
], construct a temporary value
using the wrapper and then call [Comparable::comparison
] on that.#[comparable_view(function)]
for defining synthetic properties that receive
&self
as an argument and return a type implementing [Comparable
] that can
be differenced.Deriving Comparable for structs: the Desc type
Comparable
] for a structure will create a "mirror" of
that structure, with all the same fields, but replacing every type T
with
<T as Comparable>::Desc
:use comparable::*;
self_describing
self_describing
attribute is used, the [Comparable::Desc
] type is
set to be the type itself, and the [Comparable::describe
] method return a
clone of the value.Clone
,
Debug
and PartialEq
.no_description
#[no_description]
. This sets the
[Comparable::Desc
] type to be unit, and the [Comparable::describe
] method
accordingly:describe_type
and describe_body
Comparable::Desc
] type and the body of the
[Comparable::describe
] function. Basically, for the following definition:use comparable_derive::*;
[derive(Comparable)]
[describe_type(T)]
[describe_body(B)]
describe_body
may
reference the self
parameter. Here is a real-world use case:use comparable_derive::*;
[cfg_attr(feature = "comparable",
derive(comparable::Comparable),
describe_type(String),
describe_body(self.to_string()))]
compare_default
#[compare_default]
attribute macro is used, the
[Comparable::Desc
] type is defined to be the same as the
[Comparable::Change
] type, with the [Comparable::describe
] method being
implemented as a comparison against the value of Default::default()
:use comparable::*;
fn describe(&self) -> Self::Desc {
Foo::default().comparison(self).unwrap_or_default()
}
type Change = Vec<FooChange>;
/* ... */
Deriving Comparable for structs: the Change type
Comparable
] creates an enum
with
variants for each field in the struct
, and it represents changes using a
vector of such values. This means that for the following definition:use comparable_derive::*;
[derive(Comparable)]
Comparable::Change
] type is defined to be Vec<FooChange>
, with
FooChange
as follows:[derive(PartialEq, Debug)]
ignore
assert_changes(
&initial_foo, &later_foo,
Changed::Changed(vec![
FooChange::Bar(...),
FooChange::Baz(...),
]));
Special case: Unit structs
Comparable::Desc
] type is generated.Special case: Singleton structs
type Change = [type]Change
in their [Comparable
] derivation, rather than
type Change = Vec<[type]Change>
as for multi-field structs.comparable_public
and comparable_private
Comparable::Desc
] and [Comparable::Change
]
types have the same visibility as their parent. This may not be appropriate,
however, if you want to keep the original data type private but allow
exporting of descriptions and change sets. To support this -- and the converse
-- you can use #[comparable_public]
and #[comparable_private]
to be
explicit about the visibility of these generated types.Enumerations
struct
is always a product of fields, an enum
can be
more than a sum of variants -- but also a sum of products.struct
is a
simple grouping of typed fields, where the same fields are available for
every value of such a structure.enum
is a sum, or choice, among variants, but some of these
variants can themselves contain groups of fields, as though there were an
unnamed structure embedded in the variant. Consider the following enum
,
which will be used for all the following examples:use comparable_derive::*;
[derive(Comparable)]
Three
), one with
unnamed fields (One
), and one with named fields like a usual structure
(Two
). The problem, though, is that these embedded structures are never
represented as independent types, so we can't define [Comparable
] for them
and just compute the differences between the enum arguments. Nor can we just
create a copy of the field type with a real name and generate [Comparable
]
for it, because not every value is copyable or clonable, and it gets very
tricky to auto-generate a new hierarchy built out fields with reference types
all the way down...ignore
enum MyEnumChange {
BothOne(<bool as comparable::Comparable>::Change),
BothTwo {
two: Changed<<Vec<bool> as comparable::Comparable>::Change>,
two_more: Changed<Baz as comparable::Comparable>::Change
},
BothThree,
Different(
<MyEnum as comparable::Comparable>::Desc,
<MyEnum as comparable::Comparable>::Desc
),
}
Comparable::Change
],
since that information is already reflected when the variant is reported as
having changed at all using, for example, BothOne
. In the case of BothTwo
,
each of the field types is wrapped in Changed
because it's possible that
either one or both of the fields may changed.Special case: Empty enums
Comparable::Desc
] or [Comparable::Change
] types are omitted and it is
always reported as unchanged.Unions
Comparable
] instances at the present time.