![Latest version] ![CI] ![Docs]
crop is an implementation of a UTF-8 text rope, a data structure designed to be used in applications that need to handle frequent edits to arbitrarily large buffers, such as text editors.
crop's Rope
is backed by a B-tree,
ensuring that the time complexity of inserting, deleting or replacing a piece
of text is always logarithmic in the size of the Rope
.
crop places an extreme focus on performance: check out the benchmarks to see how it stacks up against similar projects.
Rope
s use thread-safe reference counting to share data between threads.
Cloning a Rope
takes up only 16 extra bytes of memory, and its copy-on-write
semantics allow the actual text contents to be duplicated only when different
clones diverge due to user edits.
This allows to send snapshots of a Rope
to background threads to perform any
IO or CPU-intensive computations, while the main thread is kept responsive and
always ready for the next batch of edits.
``rust
// A
Ropecan be created either directly from a string or incrementally
// using the
RopeBuilder`.
let mut builder = RopeBuilder::new();
builder .append("I am a 🦀\n") .append("Who walks the shore\n") .append("And pinches toes all day.\n") .append("\n") .append("If I were you\n") .append("I'd wear some 👟\n") .append("And not get in my way.\n");
let mut rope = builder.build();
// Rope
s can be sliced to obtain RopeSlice
s.
//
// A RopeSlice
is to a Rope
as a &str
is to a String
: the former in
// each pair are borrowed references of the latter.
// A Rope
can be sliced using either byte offsets:
let byteslice: RopeSlice = rope.byteslice(..32);
asserteq!(byteslice, "I am a 🦀\nWho walks the shore\n");
// or line offsets:
let lineslice: RopeSlice = rope.lineslice(..2);
asserteq!(lineslice, byte_slice);
// We can also get a RopeSlice
by asking the Rope
for a specific line
// index:
assert_eq!(rope.line(5), "I'd wear some 👟");
// We can modify that line by getting its start/end byte offsets:
let start: usize = rope.byteofline(5);
let end: usize = rope.byteofline(6);
// and replacing that byte range with some other text:
rope.replace(start..end, "I'd rock some 👠\n");
assert_eq!(rope.line(5), "I'd rock some 👠");
// Rope
s use Arc
s to share data between threads, so cloning them is
// extremely cheap.
let snapshot = rope.clone();
// This allows to save a Rope
to disk in a background thread while
// keeping the main thread responsive.
thread::spawn(move || { let mut file = BufWriter::new(File::create("mylittlepoem.txt").unwrap());
// The text content is stored in the leaves of the B-tree, where each
// chunk can store up to 1KB of data.
//
// We can iterate over the leaves using the `Chunks` iterator which
// yields the chunks of the `Rope` as string slices.
for chunk in snapshot.chunks() {
file.write_all(chunk.as_bytes()).unwrap();
}
}) .join() .unwrap(); ```
Check out the docs for a more in-depth overview of the crate.
A significant portion of crop's public API was inspired by the excellent Ropey crate (from which I also borrowed some test vectors).
Unlike crop, Ropey uses code points (char
s in Rust-speak) as its primary
indexing metric. If you'd prefer to work with char
offsets rather than
byte offsets Ropey might be a great alternative.