A flexible, simple to use, immutable, clone-efficient String
replacement for
Rust. It unifies literals, inlined, and heap allocated strings into a single
type.
Rust is great, but it's String
type is optimized as a mutable string
buffer, not for typical string use cases. Most string use cases don't
modify their contents, often need to copy strings around as if
they were cheap like integers, typically concatenate instead of modify, and
often end up being cloned with identical contents. Additionally, String
isn't able to wrap a string literal without additional allocation and copying.
Rust needs a new string type to unify usage of both literals and
allocated strings in these typical use cases. This crate creates a new string
type
that is optimized for those use cases, while retaining the usage simplicity of
String
.
This type is not inherently "better" than String
, but different. It
works best in 'typical' string use cases (immutability, concatenation, cheap
multi ownership) whereas String
works better in "string buffer" use cases
(mutability, string building, single ownership).
NOTE: The serde feature is optional and only included when specified.
toml
[dependencies]
flexstr = { version = "0.4", features = ["serde"] }
```rust use flexstr::{flex_fmt, FlexStr, IntoFlexStr, ToCase, ToFlexStr};
fn main() {
// Use an into
function to wrap a literal, no allocation or copying
let staticstr = "This will not allocate or copy".intoflexstr();
assert!(staticstr.is_static());
// Strings up to 22 bytes (on 64-bit) will be inlined automatically
// (demo only, use into
for literals as above)
let inlinestr = "inlined".toflexstr();
assert!(inlinestr.is_inlined());
// When a string is too long to be wrapped/inlined, it will heap allocate
// (demo only, use into
for literals as above)
let rcstr = "This is too long to be inlined".toflexstr();
assert!(rcstr.is_heap());
// You can efficiently create a new FlexStr
(without creating a String
)
// This is equivalent to the stdlib format!
macro
let inlinestr2 = flexfmt!("in{}", "lined");
assert!(inlinestr2.isinlined());
asserteq!(inlinestr, inline_str2);
// We can upper/lowercase strings without converting to a String
// This doesn't heap allocate
let inlinestr3: FlexStr = "INLINED".toasciilower();
assert!(inlinestr3.isinlined());
asserteq!(inlinestr, inlinestr3);
// Concatenation doesn't even copy if we can fit it in the inline string let inlinestr4 = inlinestr3 + "!!!"; assert!(inlinestr4.isinlined()); asserteq!(inlinestr4, "inlined!!!");
// Clone is almost free, and never allocates // (at most it is a ref count increment for heap allocated strings) let staticstr2 = staticstr.clone(); assert!(staticstr2.isstatic());
// Regardless of storage type, these all operate seamlessly together // and choose storage as required let heapstr2 = staticstr2 + &inlinestr; assert!(heapstr2.isheap()); asserteq!(heap_str2, "This will not allocate or copyinlined"); } ```
Internally, FlexStr
uses an enum with these variants:
Static
- A simple wrapper around a static string literal (&'static str
)Inlined
- An inlined string (no heap allocation for small strings)Heap
- A heap allocated (reference counted) stringThe type automatically chooses the best storage and allows you to use them interchangeably as a single string type.
String
(64-bit: 24 bytes)serde
serialization support (feature = "serde")std
)FlexStr
) and multi-thread safe
(AFlexStr
) optionsFlexStr
- regular usage
Heap
storage based on Rc
AFlexStr
- provides Send
/ Sync
for multi-threaded use
Heap
storage based on Arc
```rust use flexstr::IntoFlexStr;
fn main() { // From literal - no copying or allocation let world = "world!".intoflexstr();
println!("Hello {world}"); } ```
```rust use flexstr::{IntoAFlexStr, IntoFlexStr, ToFlexStr};
fn main() {
// From literal - no copying or allocation
// NOTE: to_flex_str
will copy, so use into_flex_str
for literals
let literal = "literal".intoflexstr();
// From borrowed string - Copied into inline string let owned = "inlined".tostring(); let strtoinlined = (&owned).toflex_str();
// From borrowed String - copied into str
wrapped in Rc
let owned = "A bit too long to be inlined!!!".tostring();
let strtowrapped = (&owned).toflex_str();
// From String - copied into inline string (String
storage released)
let inlined = "inlined".tostring().intoflex_str();
// From String - str
wrapped in Rc
(String
storage released)
let counted = "A bit too long to be inlined!!!".tostring().intoflex_str();
// * If you want a Send/Sync type you need AFlexStr
instead *
// From FlexStr wrapped literal - no copying or allocation let literal = literal.intoaflex_str();
// From FlexStr inlined string - no allocation let inlined = inlined.intoaflex_str();
// From FlexStr Rc
wrapped str
- copies into str
wrapped in Arc
let counted = counted.intoaflex_str();
}
```
This has always been a confusing situation in Rust, but it is easy with
FlexStr
since multi ownership is cheap.
```rust use flexstr::{IntoFlexStr, FlexStr};
struct MyStruct { s: FlexStr }
impl MyStruct { fn toownornottoown(s: &FlexStr) -> Self { let s = if s == "ownme" { // Since a wrapped literal, no copy or allocation s.clone() } else { // Wrapped literal - no copy or allocation "own_me".into() };
Self { s }
} }
fn main() { // Wrapped literals - no copy or allocation let s = "borrow me".intoflexstr(); let s2 = "own me".intoflexstr();
let struct1 = MyStruct::toownornottoown(&s); let struct2 = MyStruct::toownornottoown(&s2);
asserteq!(s2, struct1.s); asserteq!(s2, struct2.s); } ```
into()
and never copiedinto()
on a String
will result in an inline string (if
short) otherwise copied into a str
wrapped in Rc
/Arc
(which will allocate, copy, and then release original String
storage)into_flex_str()
and into_a_flex_str()
are equivalent to calling into()
on both literals and String
(they are present primarily for let
bindings so there is no need to declare a type)to_flex_str()
and to_a_flex_str()
are meant for taking ownership of
borrowed strings and always copy into either an inline string (for short strings) or
an Rc
/Arc
wrapped str
(which will allocate)to_string
always copies into a new String
AFlexStr
and FlexStr
using into()
are cheap when using wrapped literals or inlined strings
Rc
or Arc
Heap creates are fairly expensive still compared to String
, Rc<str>
and
Arc<str>
, but inline/static creation is very fast as expected.
create_static_normal time: [3.7062 ns 3.7213 ns 3.7422 ns]
create_inline_small time: [3.8932 ns 3.9004 ns 3.9084 ns]
create_heap_normal time: [13.533 ns 13.557 ns 13.587 ns]
create_heap_large time: [18.605 ns 18.635 ns 18.664 ns]
create_heap_arc_normal time: [18.535 ns 18.551 ns 18.568 ns]
create_heap_arc_large time: [26.794 ns 26.861 ns 26.937 ns]
create_string_small time: [7.4377 ns 7.4572 ns 7.4794 ns]
create_string_normal time: [8.0550 ns 8.0605 ns 8.0667 ns]
create_string_large time: [12.940 ns 12.955 ns 12.973 ns]
create_rc_small time: [8.0525 ns 8.0577 ns 8.0639 ns]
create_rc_normal time: [8.2438 ns 8.2512 ns 8.2604 ns]
create_rc_large time: [13.139 ns 13.153 ns 13.168 ns]
create_arc_small time: [8.7128 ns 8.7231 ns 8.7341 ns]
create_arc_normal time: [8.7454 ns 8.7851 ns 8.8446 ns]
create_arc_large time: [13.827 ns 13.855 ns 13.886 ns]
Clones are MUCH cheaper than String
(except when using Arc
). Interested
to find out why the single branch op causes such a large differential between
the wrapped Rc<str>
/Arc<str>
and the raw version.
clone_static_normal time: [3.9540 ns 3.9572 ns 3.9610 ns]
clone_inline_small time: [4.4717 ns 4.4763 ns 4.4819 ns]
clone_heap_normal time: [4.4738 ns 4.4839 ns 4.4965 ns]
clone_heap_arc_normal time: [10.596 ns 10.607 ns 10.618 ns]
clone_string_small time: [11.774 ns 11.789 ns 11.807 ns]
clone_string_normal time: [12.289 ns 12.422 ns 12.540 ns]
clone_string_large time: [14.931 ns 15.013 ns 15.116 ns]
clone_rc_normal time: [652.97 ps 653.58 ps 654.30 ps]
clone_arc_normal time: [3.2948 ns 3.2986 ns 3.3021 ns]
There is no free lunch:
Rc
(or Arc
), when on-boarding String
it will need to
reallocate and copyFlexStr
is not Send
or Sync
, there is a need to consider
single-threaded (FlexStr
) and multi-threaded (AFlexStr
) use cases and
convert accordinglyThis is currently beta quality and still needs testing. The API may very possibly change but semantic versioning will be followed.
This project is licensed optionally under either: