Summary

Introduce new combinators for Result, with perticular interest in recoverable error-handling in favor of static dispatch.

The new constructs are:

Motivation and overview

Error-handling in manner of static dispatch has some fundamental advantages:

This RFC discusses topics about error definition, must-have traits, error conversions and logging to conclude the general error-handling approach, in spirit of static dispatch and orthogonality.

Definition of recoverable error

In Rust, a recoverable error is a Result::Err. Neither panics nor exceptions (stack unwinding) are considered as recoverable error-handling.

From now on, "error" is used as the synonym of "recoverable error", and "the static approach" is short for "the proposed approach of error-handling in manner of static dispatch".

Traits that a plain error must have

The essence of plain error is to indicating branches for its caller, while composite errors do more, e.g. conversion and logging.

In general, a plain error should have no mandatory trait. In specific cases, errors may be expected to have certain traits. The static approach should allow plain errors with no traits, meanwhile be capable of adding traits as needed.

Library-provided error conversions

An error-handling library must provide error conversions in a systematical way. The static approach must provide such mechanism in favor of enum rather than trait object, because the latter will force users to implement mandatory traits.

An enum of errors for summarizing and converting is regarded as composite, while its variant types are plain errors.

To support enum conversions systematically, an EnumX proc-macro derive will be introduced and explained later.

Library-provided logging

The static approach must provide logging mechanisms, which

The example

Consider an artificial example. The task is to read 3 integer values a, b, and c from 3 different files and check if they satisfied the equation a * b == c.

To read integers from files, a read_u32() function is defined. It returns a u32 if everything is ok, or some file operations fails or parsing integer from text fails, resulting std::io::Error or std::num::ParseIntError.

To check the equation, an a_mul_b_eq_c() function is defined. It calls read_u32() and do the calculating, which introduces a new possible error type defined as struct MulOverflow( u32, u32 ) to indicate a multiplication overflow.

The three errors std::io::Error, std::num::ParseIntError and MulOverflow are considered as plain errors. To do error convertion, some composite errors should be defined with EnumX derived.

```rust

[derive( EnumX )]

enum ReadU32Error { IO( std::io::Error ), Parse( std::num::ParseIntError ), }

fn read_u32( filename: &'static str ) -> Result {/**/}

[derive( EnumX )]

enum AMulBEqCError { IO( std::io::Error ), Parse( std::num::ParseIntError ), Overflow( MulOverflow ), }

fn amulbeqc( filea: &'static str, fileb: &'static str, file_c: &'static str ) -> Result {/**/} ```

ReadU32Error and AMulBEqCError are regarded as composite errors.

EnumX derive

The #[derive( EnumX )] attributes will tell proc-macro to generate convertions for ReadU32Error and AmulBEqCError:

Error wrapper/combinator

```rust fn read_u32( filename: &'static str ) -> Result { use std::io::Read;

let mut f = std::fs::File::open( filename ).map_error()?;
let mut s = String::new();
f.read_to_string( &mut s ).map_error()?;
let number = s.trim().parse::<u32>().map_error()?;
Ok( number )

}

fn amulbeqc( filea: &'static str, fileb: &'static str, filec: &'static str ) -> Result { let a = readu32( filea ).maperror()?;

let b = match read_u32( file_b ) {
    Ok(  value ) => value,
    Err( err ) => {
        if a == 0 {
            0 // 0 * b == 0, no matter what b is.
        } else {
            return err.error();
        }
    },
};

let c = match read_u32( file_c ) {
    Ok(  value ) => value,
    Err( err   ) => match err {
        ReadU32Error::IO(    _ ) => 0, // default to 0 if file is missing.
        ReadU32Error::Parse( e ) => return e.error(),
    },
};

a.checked_mul( b )
 .ok_or( MulOverflow(a,b) )
 .map( |result| result == c )
 .map_error()

} ```

Logging

The static approach does not provide the ability to do backtracing via predefined methods of plain errors. The users are free to do so, as long as they want, using whatever traits they like.

Instead, it will ask users to change the definitions of the composite errors, to turn on library-supported backtrace/logging.

```rust

[derive( EnumX, Logger, Debug )]

enum ReadU32Error { IO( Log ), Parse( Log ), }

[derive( EnumX, Logger, Debug )]

enum AMulBEqCError { IO( Log ), Parse( Log ), Overflow( Log ), } ```

Note that a wrapper struct, Log and a derivable Logger trait are introduced. The enum with #[derive(Logger)] must use the same log items among all of its variants.

```rust

[derive( EnumX, Logger, Debug )]

fn read_u32( filename: &'static str ) -> Result { use std::io::Read;

let mut f = std::fs::File::open( filename )
    .map_err_to_log(frame!())
    .map_error()?;

let mut s = String::new();
f.read_to_string( &mut s )
    .map_err_to_log(frame!())
    .map_error()?;

let number = s.trim().parse::<u32>()
    .map_err_to_log(frame!())
    .map_error()?;

Ok( number )

}

fn amulbeqc( filea: &'static str, fileb: &'static str, filec: &'static str ) -> Result { let a = readu32( filea ).maperrlog(frame!()).maperror()?;

let b = match read_u32( file_b ) {
    Ok(  value ) => value,
    Err( err ) => {
        if a == 0 {
            0 // 0 * b == 0, no matter what b is.
        } else {
            return err.log(frame!()).error();
        }
    },
};

let c = match read_u32( file_c ) {
    Ok(  value ) => value,
    Err( err   ) => match err {
        ReadU32Error::IO(    _ ) => 0, // default to 0 if file is missing.
        ReadU32Error::Parse( e ) => return e.log(frame!()).error(),
    },
};

a.checked_mul( b )
 .ok_or( MulOverflow(a,b) )
 .map( |result| result == c )
 .map_err_to_log( frame!() )
 .map_error()

} ```

Log combinators

To summarize:

Log agent and items

A log agent can be an in-memory String, Vec, a file on disk, or any thing that the user is willing to use to collect log items. Both the log agent and its items are generialized and fully customizable.

```rust pub trait LogAgent { type Item;

fn new() -> Self;
fn create_log( item: Self::Item ) -> Self;
fn append_log( &mut self, item: Self::Item );

} ```

Users can implement this trait to customize the behaviours on to_log() using create_log(), and log() using append_log().

Note that log item is generalized and not restricted to strings. The static approach provides a typed logging system that has the power to support both common logging tasks and highly customized ones.

Some common log items could be predefined for convenience, e.g. A Frame to store the source of the error with file name, module path, line/column numbers, and an optional context info. A call similar to .log( frame!( "An unexpected {:?} was detect.", local_var )) is up to the job. The frame!() macro uses the same syntax with format!(), but results in a Frame struct rather than a string.

Users can switch between type aliases to choose different log agents or items.

rust type Log<E> = cex::Log<E,Vec<Frame>>; // Vec agent, Frame item

rust type Log<E> = cex::Log<E,Vec<String>>; // Vec agent, String item

rust type Log<E> = cex::Log<E,String>; // String agent, String item

rust type Log<E> = cex::Log<E,MyLogAgent>; // User-defined agent and item

Compile time opt-in logging

Using type aliases to turn on/off logging at compile time.

rust type Log<E> = cex::Log<E>; // do logging

rust type Log<E> = cex::NoLog<E>; // no logging

Runtime opt-in logging: Logging-level

The type alias to turn on logging-level support.

rust type Log<E> = cex::Log<E, Env<Vec<Frame>>>;

The logging-level is defined by an environment variable CEX_LOG_LEVEL. Client code providing a value no greater than it, is allowed to do logging.

rust let mut f = std::fs::File::open( filename ) .map_err_to_log(( LogLevel::Debug, frame!() )) .map_error()?;

Note that the item is a tuple, the first field of which is the level.

The ergonomic issue of extra annotations

To address the issue, users can tag the fn with an #[cex], with optional arguments.

The modified example:

```rust

[derive( EnumX, Logger, Debug )]

enum ReadU32Error { IO( Log ), Parse( Log ), }

[cex(to_log)]

fn read_u32( filename: &'static str ) -> Result { use std::io::Read;

let mut f = std::fs::File::open( filename )?;
let mut s = String::new();
f.read_to_string( &mut s )?;
let number = s.trim().parse::<u32>()?;
Ok( number )

}

[derive( Debug, PartialEq, Eq )]

struct MulOverflow( u32, u32 );

[derive( EnumX, Logger, Debug )]

enum AMulBEqCError { IO( Log ), Parse( Log ), Overflow( Log ), }

[cex(log)]

fn amulbeqc( filea: &'static str, fileb: &'static str, filec: &'static str ) -> Result { let a = readu32( file_a )?;

let b = match read_u32( file_b ) {
    Ok(  value ) => value,
    Err( err ) => {
        if a == 0 {
            0 // 0 * b == 0, no matter what b is.
        } else {
            return err.error();
        }
    },
};

let c = match read_u32( file_c ) {
    Ok(  value ) => value,
    Err( err   ) => match err {
        ReadU32Error::IO(    _ ) => 0, // default to 0 if file is missing.
        ReadU32Error::Parse( e ) => return e.error(),
    },
};

Ok( a.checked_mul( b )
    .ok_or( MulOverflow(a,b) )
    .map( |result| result == c )
    .map_err_to_log( frame!() ) // this is a plain error, needs `to_log`
? )

} ```

Example for nested #[cex]:

```rust use cex_derive::cex;

[derive( EnumX, Debug, PartialEq, Eq )]

enum CexErr { Code(i32), Text(&'static str), }

[cex]

fn misc() -> Result<(),CexErr> { fn bar() -> Result<(),i32> { Err(42)? } let _bar = || -> Result<(),i32> { Ok( bar()? )}; let _bar: Result<(),i32> = try { Err(42)? };

#[cex] fn _baz() -> Result<(),CexErr> { Err(42)? }
let _baz = #[cex] || -> Result<(),CexErr> { Ok( bar()? )};
let _baz: Result<(),CexErr> = #[cex] try { Err(42)? };

Err(42)?

}

assert_eq!( misc(), Err( CexErr::Code( 42 ))); ```

Example for log level support in #[cex]:

```rust type Log = super::Log>>;

[derive( EnumX, Logger, Debug, PartialEq, Eq )]

enum CexErr { Code( Log ), Text( Log<&'static str> ), }

[cex(to_log(( LogLevel::Debug, frame!() )))]

fn cexto_log() -> Result<(),CexErr> { Err(42)? }

[cex(log(( LogLevel::Info, frame!() )))]

fn cexlog() -> Result<(),CexErr> { Ok( cexto_log()? )} ```

Guidelines

Put plain errors in public API's composite Err type, as long as they are part of the design constraints. Changing them may cause compile errors in client code, which is guaranteed by type system. These errors are regarded as intrinsic errors of the API.

If the API author wants to add new errors in the future without breaking compatibility, he could tag the enum with #[non_exhausted].

To treat extrinsic errors as extrinsic, some trait object could be utilized to erase their types and treat them as one variant in the composite error. For example, if the API author want erase std::io::Error in the API, the function signatures can be written as:

```rust

[derive( EnumX, Debug )]

enum ReadU32Error { Parse( std::num::ParseIntError ), Other( cex::DynErr ), }

[cex]

fn read_u32( filename: &'static str ) -> Result { use std::io::Read;

let mut f = std::fs::File::open( filename ).map_dyn_err()?;

let mut s = String::new();
f.read_to_string( &mut s ).map_dyn_err()?;
let number = s.trim().parse::<u32>()?;
Ok( number )

} ```

The cex::DynErr is a wrapper of some trait object. Current library implementation utilizes failure::Error.

This is an example how the static approach, as the foundamental error-handling mechanism, is able to work with dynamic dispatch approaches as needed. If users had picked up the dynamic approach as the foundamental, it is not possible to adopt the static approach if needed.

Fallback to fat enum

A classic way of using enums in error-handling is "wrapping errors". It collects all the possible errors into a fat enum, as "the crate error", and every public API in the crate will return Result<T>, which is a type alias type Result<T> = Result<T,crate::Error>;.

The main advantage of the static approach over it, is expressing errors excactly in function signatures. However, if the users feel it is too weight to use the static approach, they can simply define a fat enum and use the result type alias again. The code can still be benefited from logging mechanisms provided by the static approach.

This is a proof that "wrapping errors" is a special case of the static approach.

Detailed design

Minimum structural enum

The EnumX derive is the key technology to support error conversions. It is essentially a library-implemented language feature: a minimum supported structural enum, aka enum exchange.

We use "minimum supported" because it does not support:

It is obvious that these are not possible to implement with enum unless type system changes.

To avoid overlapping impls in generialized plain errors, a phantom generic argument is utilized.

```rust pub trait IntoEnumx { fn into_enumx( self ) -> Dest; }

pub trait MapError where Self : Into> { fn maperror( self ) -> Result where Src : Sized + IntoEnumx { self.into().maperr( |src| src.into_enumx() ) } } ```

Neither std::convert::From nor std::ops::Try provides such phantom argument. It is the root cause that map_error() is required.

Interaction with other features

The static approach is the nature result of applying EnumX on Err, implemented in Result combinators. So it won't cause issues other than ergonomic, such as asyn issues, thread safety issues, etc.

Drawbacks

Rationale and alternatives

The static approach is best effort in separating mechanisms from policies. Any other error-handling approaches could be considered as special cases and utilized as fallback.

Prior art

The key idea of enum exchange, is inspired by "union types" in Typed Racket.

Its original implementation with generics support is referenced to frunk_core::coproduct.

Unresolved questions

Intrusive backtrace via traits implemented by plain errors are unresolved by design.

Future possibilities

Ok combinators(available in library implementation) may find the usecases in the future.

Fully language-supported "union types" will make the implementation trivial, and get rid of #[cex].

Changing in std::ops::Try traits may help getting rid of #[cex] too.

License

Licensed under MIT.