Moka is a fast, concurrent cache library for Rust. Moka is inspired by Caffeine (Java) and Ristretto (Go).
Moka provides a cache that supports full concurrency of retrievals and a high expected concurrency for updates. It also provides a segmented cache for increased concurrent update performance. These caches perform a best-effort bounding of a map using an entry replacement algorithm to determine which entries to evict when the capacity is exceeded.
Moka currently does not provide async
optimized caches. The synchronous (blocking)
caches in the current version can be safely used in async runtime such as Tokio or
async-std, but will not produce optimal performance under heavy updates. See
this example for more details. A near future version of Moka will
provide async
optimized caches in addition to the sync caches.
Cache entries are manually added using insert
method, and are stored in the cache
until either evicted or manually invalidated.
Here's an example that reads and updates a cache by using multiple threads:
```rust use moka::sync::Cache;
use std::thread;
fn value(n: usize) -> String { format!("value {}", n) }
fn main() { const NUMTHREADS: usize = 16; const NUMKEYSPERTHREAD: usize = 64;
// Create a cache that can store up to 10,000 elements.
let cache = Cache::new(10_000);
// Spawn threads and read and update the cache simultaneously.
let threads: Vec<_> = (0..NUM_THREADS)
.map(|i| {
// To share the same cache across the threads, clone it.
// This is a cheap operation.
let my_cache = cache.clone();
let start = i * NUM_KEYS_PER_THREAD;
let end = (i + 1) * NUM_KEYS_PER_THREAD;
thread::spawn(move || {
// Insert 64 elements. (NUM_KEYS_PER_THREAD = 64)
for key in start..end {
my_cache.insert(key, value(key));
// get() returns Option<String>, a clone of the stored value.
assert_eq!(my_cache.get(&key), Some(value(key)));
}
// Invalidate every 4 element of the inserted elements.
for key in (start..end).step_by(4) {
my_cache.invalidate(&key);
}
})
})
.collect();
// Wait for all threads to complete.
threads.into_iter().for_each(|t| t.join().expect("Failed"));
// Verify the result.
for key in 0..(NUM_THREADS * NUM_KEYS_PER_THREAD) {
if key % 4 == 0 {
assert_eq!(cache.get(&key), None);
} else {
assert_eq!(cache.get(&key), Some(value(key)));
}
}
} ```
get
return a clone of the stored valueNote that the return type of get
method is Option<V>
instead of Option<&V>
,
where V
is the value type. Every time get
is called for an existing key, it
creates a clone of the stored value V
and returns it.
Because of the nature of concurrent cache, get
cannot return Option<&V>
. A value
stored in a cache can be dropped or replaced at any time by any other thread
including cache's eviction thread. So it is impossible to create a &V
, a reference
to a value, and guarantee the value outlives the reference.
If you want to store values that will be expensive to clone, wrap them by
std::sync::Arc
before storing in a cache. Arc
is a thread-safe
reference counted pointer.
```rust,ignore use std::sync::Arc;
let key = ... let large_value = vec![0u8; 2 * 1024 * 1024]; // 2 MiB
// When insert, wrap the largevalue by Arc. cache.insert(key.clone(), Arc::new(largevalue));
// get() will call Arc::clone() on the store value, which is cheap. cache.get(&key); ```
Currently, Moka does not provide async
optimized caches. An update operation
(insert
or invalidate
methods) can be blocked for a short time under heavy
updates. They employ locks, mpsc channels and thread sleeps that are not aware of the
Future trait in std. While insert
or invalidate
can be safely
called in an async fn
or async
block, they will not produce optimal performance
as they may prevent async tasks from switching while acquiring a lock.
Some of the async runtime libraries such as Tokio and async-std provide APIs to
off-load a blocking operation to a dedicated thread pool. You may want to use them
when calling insert
or invalidate
although it is not required.
spawn-blocking
functionspawn-blocking
functionHere is a similar program to the previous example, but using Tokio runtime with
spawn-blocking
:
```rust // Cargo.toml // // [dependencies] // tokio = { version = "1.1", features = ["rt-multi-thread", "macros" ] }
use moka::sync::Cache;
use tokio::task;
async fn main() { const NUMTASKS: usize = 16; const NUMKEYSPERTASK: usize = 64;
fn value(n: usize) -> String {
format!("value {}", n)
}
// Create a cache that can store up to 10,000 elements.
let cache = Cache::new(10_000);
// Spawn async tasks and write to and read from the cache.
let tasks: Vec<_> = (0..NUM_TASKS)
.map(|i| {
// To share the same cache across the async tasks, clone it.
// This is a cheap operation.
let my_cache = cache.clone();
let start = i * NUM_KEYS_PER_TASK;
let end = (i + 1) * NUM_KEYS_PER_TASK;
tokio::spawn(async move {
// Insert 64 elements. (NUM_KEYS_PER_TASK = 64)
for key in start..end {
// Use spawn_blocking() for insert() as it internally uses locks
// that are not async aware.
let my_cache1 = my_cache.clone();
task::spawn_blocking(move || my_cache1.insert(key, value(key)))
.await
.unwrap();
// get() returns Option<String>, a clone of the stored value.
assert_eq!(my_cache.get(&key), Some(value(key)));
}
// Invalidate every 4 element of the inserted elements.
for key in (start..end).step_by(4) {
// Use spawn_blocking() for invalidate() as it internally uses locks
// that are not async aware.
let my_cache1 = my_cache.clone();
task::spawn_blocking(move || my_cache1.invalidate(&key))
.await
.unwrap();
}
})
})
.collect();
// Wait for all tasks to complete.
for task in tasks {
task.await.expect("Failed");
}
// Verify the result.
for key in 0..(NUM_TASKS * NUM_KEYS_PER_TASK) {
if key % 4 == 0 {
assert_eq!(cache.get(&key), None);
} else {
assert_eq!(cache.get(&key), Some(value(key)));
}
}
} ```
A near future version of Moka will provide async
optimized caches in addition to
the synchronous caches.
Moka supports the following expiration policies:
insert
.get
or insert
.To set them, use the cache Builder
.
```rust use moka::sync::Builder;
use std::time::Duration;
fn main() { let cache = Builder::new(10000) // Max 10,000 elements // Time to live (TTL): 30 minutes .timetolive(Duration::fromsecs(30 * 60)) // Time to idle (TTI): 5 minutes .timetoidle(Duration::from_secs( 5 * 60)) // Create the cache. .build();
// This entry will expire after 5 minutes (TTI) if there is no get().
cache.insert(0, "zero");
// This get() will extend the entry life for another 5 minutes.
cache.get(&0);
// Even though we keep calling get(), the entry will expire
// after 30 minutes (TTL) from the insert().
} ```
Moka caches maintain internal data structures for entry replacement algorithms. These
structures are guarded by a lock and operations are applied in batches using a
dedicated worker thread to avoid lock contention. Under heavy updates, the worker
thread may not be able to catch up to the updates. When this happens, insert
or
invalidate
call will be paused (blocked) for a short time.
If this pause happen very often, you may want to switch to a segmented cache. You can
use segments
method of the builder to create such a cache.
By default, a cache uses a hashing algorithm selected to provide resistance against HashDoS attacks.
The default hashing algorithm is the one used by std::collections::HashMap
, which
is currently SipHash 1-3, though this is subject to change at any point in the
future.
While its performance is very competitive for medium sized keys, other hashing algorithms will outperform it for small keys such as integers as well as large keys such as long strings. However those algorithms will typically not protect against attacks such as HashDoS.
The hashing algorithm can be replaced on a per-Cache
basis using the with_hasher
method of the cache Builder
. Many alternative algorithms are available on
crates.io, such as the aHash crate.
This crate's minimum supported Rust version (MSRV) is 1.45.2.
If no feature is enabled, MSRV will be updated conservatively. When using other
features, like async
(which is not available yet), MSRV might be updated more
frequently, up to the latest stable. In both cases, increasing MSRV is not
considered a semver-breaking change.
Moka is named after the moka pot, a stove-top coffee maker that brews espresso-like coffee using boiling water pressurized by steam.
Moka is distributed under the terms of both the MIT license and the Apache License (Version 2.0).
See LICENSE-MIT and LICENSE-APACHE for details.