Bartleby: a symbol renaming tool.

This repository contains the source code for Bartleby, a library and a tool to rename symbols across several object files and static libraries.

Why?

If you develop a library, your end product will contain:

Bartleby helps you ship a static library exposing only the symbols you want exposed and without polluting the namespace with the symbols of your own dependencies. It is similar to objcopy --prefix-symbols=__private but with "intelligent" (see the "How?" section below) automated selection of which symbols to prefix (for instance a reference to puts won't be renamed private_puts because it would lead to unresolved symbols).

This helps solving the diamond dependency problem: if a user of your library wants to link against OpenSSL, which also happens to be a library you link against, how do you make sure that the user remains in control of the version of OpenSSL they use without having name collisions with your own use of OpenSSL? You use Bartleby.

How?

Bartleby takes a set of files as input. These can be static libraries (.a) or objects (.o). Bartleby figures out which symbols are considered private or not by iterating over all these objects (the ones directly supplied by the user, or the ones contained in the archives), and collects the visibility and the definedness of each symbols found. Using these pieces of information, a map of symbols is built as follows:

Finally, for each symbol marked as defined and global, Bartleby considers them as private and adds a prefix to its name, and rename all its references across all the objects. The output is a single static library that contains all the processed objects.

Example

Let's say we have the following three C files, along with their header:

```c

include

include

attribute((visibility("default"))) void myapi(void) { puts("myapi called, calling external dep"); char buf[0x41]; external_api(buf, sizeof(buf)); }

void internalimpl(void) { puts("internal implementation, calling my_api"); myapi(); } ```

```c

include

include "api_1.h"

attribute((visibility("default"))) void anotherapi(void) { puts("anotherapi called, calling myapi"); myapi(); } ```

```c

include

include

attribute((visibility("default"))) void externalapi(void *src, const sizet n) { puts("external_api called, calling external private impl"); memset(src, 0, n); }

void externalprivateimpl(void) { puts("external private implementation called"); } ```

We compile the first two objects from our API:

shell $ clang -fvisibility=hidden -c api_1.c api_2.c -isystem. $ file api_1.o api_2.o api_1.o: Mach-O 64-bit object arm64 api_2.o: Mach-O 64-bit object arm64

Then, we create a static library out of them:

shell $ ar rvs libapi.a api_1.o api_2.o

Now, let's build our simple external dependency:

shell $ clang -fvisibility=hidden -c external_dep.c $ ar rvs libexternal.a external_dep.o

libapi.a is the library we want to have its symbols prefixed with our custom prefix. libexternal.a is one of our external dependencies, and we want the user to be able to link against libapi.a without encountering any C symbol collisions because libexternal might be used in another place. We run Bartleby by giving libapi.a and libexternal.a as an input. The output will be libapi_v1.1.a:

shell $ bartleby --if libapi.a \ --if libexternal.a \ --of libapi_v1.1.a \ --prefix __impl_v1.1_ 5 symbol(s) prefixed libapi_v1.1.a produced.

Now, we use nm to inspect the produced archive:

```shell $ nm libapiv1.1.a pi1.o: U implv1.1externalapi 0000000000000064 T implv1.1internalimpl 0000000000000000 T implv1.1myapi U stackchkfail U _stackchkguard U _puts 0000000000000084 r l.str 00000000000000a8 r l_.str.1

api2.o: 0000000000000000 T implv1.1anotherapi U _implv1.1myapi U puts 0000000000000020 r l.str

externaldep.o: 0000000000000000 T implv1.1externalapi 000000000000001c T _implv1.1externalprivateimpl U _puts 0000000000000038 r l.str 000000000000006b r l_.str.1 ```

We can now see that defined symbols have been prefixed with __impl_v1.1. However, puts hasn’t been prefixed, because even if it’s a global symbol, no definition has been found in either libapi.a or libexternal.a, therefore Bartleby didn’t prefix it.

Getting Started

Bartleby can be compiled with Bazel or CMake.

Using CMake

LLVM >= 15.0 is required to build Bartleby. See Getting Started to find more information about how to build LLVM, and LLVM releases to find all the releases. APT packages are also available on the LLVM Debian/Ubuntu packages page.

shell $ cmake -B build -DCMAKE_BUILD_TYPE=Release -DLLVM_DIR=/path/to/llvm-15/lib/cmake $ cmake --build build $ ./build/bin/bartleby

If an higher version of LLVM is installed, BARTLEBY_LLVM_VERSION must be defined:

shell $ cmake -B build -DCMAKE_BUILD_TYPE=Release -DBARTLEBY_LLVM_VERSION=16.0 -DLLVM_DIR=/path/to/llvm-16/lib/cmake $ cmake --build build

Using Bazel

It is highly recommended to use [bazelisk].

shell $ bazelisk build -c opt bartleby/... $ ./bazel-bin/bartleby/tools/Bartleby/bartleby

Importing Bartleby in your Bazel project

It is also possible to import Bartleby in an existing Bazel workspace using [http_archive]. One can use it as a tool, or can also use it through the [bartleby starlark rule]. See examples/bazel for more information.

License

See [LICENSE].