SCC: Scalable Concurrent Containers

SCC offers scalable concurrent containers written in the Rust language. The data structures in SCC assume to be used by a database management software running on a server, ane therefore they may not efficiently work with a small set of data.

scc::HashMap

scc::HashMap is a scalable in-memory unique key-value store that is targeted at highly concurrent heavy workloads. It does not distribute data to multiple shards as most concurrent hash maps do, instead only does it have a single array of entries and corresponding metadata cell array. The metadata management strategy is similar to that of Swisstable; a metadata cell which is separated from the key-value array, is a 64-byte data structure for managing consecutive sixteen entries in the key-value array. The metadata cell also has a linked list of entry arrays for hash collision resolution. scc::HashMap automatically enlarges and shrinks the capacity of its internal array, and resizing happens without blocking other operations and threads. In order to keep the predictable latency of each operation, it does not rehash every entry in the container at once when resizing, instead it distributes the resizing workload to future access to the data structure.

Performance

Test setup

Test data

Test workload: local

| | 11 threads | 22 threads | 44 threads | 88 threads | |--------|----------------|----------------|----------------|----------------| | Insert | 156.423361787s | 187.157442477s | 264.075874751s | 463.032489985s | | Read | 81.03393205s | 92.933046817s | 109.303575217s | 137.802145824s | | Remove | 85.563265194s | 102.896206291s | 117.072458551s | 167.450069665s |

Test workload: local-remote

| | 11 threads | 22 threads | 44 threads | 88 threads | |--------|----------------|----------------|----------------|----------------| | Insert | 272.420310927s | 314.424537182s | 432.493505328s | 772.267595819s | | Mixed | 326.767954659s | 350.603202721s | 375.987412301s | 433.899012681s | | Remove | 164.857461617s | 184.528933216s | 199.187884668s | 250.735616868s |

scc::TreeIndex

Work-in-progress

Only two APIs are implemented. - scc::TreeIndex::insert - scc::TreeIndex::read

scc::TreeIndex is a B+-tree variant optimized for read operations. Locks are only acquired on structural changes, and read/scan operations are neither blocked nor interrupted by other threads. The semantics of each API is similar to the read-committed isolation level in terms of database management software, as readers may not see the snapshot of data that is newer than the read snapshot.

Changelog

0.3.6

Make scc::TreeIndex::insert work correctly on aarch64

0.3.5

Fix a data race issue with scc::TreeIndex::insert

0.3.4

Implement scc::TreeIndex::insert and scc::TreeIndex::read

0.3.2

Make scc::HashMap pass Valgrind by deallocating arrays immediately on drop

0.3.1

APIs stabilized

0.2.13

Add 'contains' and 'hasher' APIs

0.2.12

Update crossbeam_epoch to 0.9.1

0.2.11

Remove libc dependencies

Adjust memory alignment

0.2.10

Fix memory leak

0.2.8

Make scc::HashMap stack-unwinding-safe, meaning that it does not leave resources (memory, locks) unreleased after stack-unwinding on one condition; moving instances of K, and V types must always be successful (in C++ terms, K and V satisfy std::isnothrowmove_constructible).

Refine resizing strategies

0.2.7

Remove unnecessary heap allocation during read

Milestones

Milestones