pqrs Travis GitHub release GitHub tag license

protobuf to json deserializer, written in Rust

pqrs is a tool which deserializes compiled protobuf messages given a set of pre-compiled .fdset files.

Now on crates.io Crates.io Crates.io

Contents

  1. Usage
  2. Forced decoding
  3. Message guessing
  4. Streaming
  5. Portability with musl
  6. Dependencies
  7. Tests

Usage

  1. Put your *.fdset files in ~/.pq:

$ protoc -o dog.fdset dog.proto $ protoc -o person.fdset person.proto $ cp *.fdset ~/.pq/

  1. Pipe a single compiled protobuf message to pq:

sevag:pqrs $ ./py-test/generate_random_proto.py | pq | jq { "age": 4, "breed": "poodle", "temperament": "excited" } sevag:pqrs $ ./py-test/generate_random_proto.py | pq | jq { "id": 2, "name": "raffi" }

Read the manpage.

Forced decoding

Use --force to try to brute-force decode a message. E.g., given a 20-byte message:

``` while (don't have decode result) if (decode [0:19]) == success: return

# try chopping off 1 byte
if (decode([0:18]) == success or\
    decode([1:19]) == success): return

# try chopping off 2 consecutive bytes
if (decode([0:17]) == success or\
    decode([1:18]) == success) or\
    decode([2:19]) == success): return

# repeat until success or no bytes left

```

Message guessing

pqrs by default will guess the message type. You can make it use a specific type by passing the fully qualified message name, e.g. pq --msgtype="com.example.dog.Dog".

Guessing strategy:

Since protobuf treats fields as positional, similar protos (e.g. Dog: , Person: ) are indistinguishable.

Streaming

I treat streams the way it's done by Google in delimitedmessageutil.

Basically, each message is preceded with a raw varint32 containing the length of the message. Different streaming strategies are possible. I need to come up with a way to accomodate them all.

In length.rs I got started on an enum to allow different types of length-delimiting. For instance I'm planning on adding leb128 in addition to varint32 which is implemented. This will probably be toggleable with a switch (i.e. --delimiter="leb128|varint32").

Portability with musl

First, clone and compile musl-gcc on your system:

$ git clone git://git.musl-libc.org/musl $ ./configure && make && sudo make install

Then, run make in this repo - this downloads a local ./rust toolchain with the x86_64-unknown-linux-musl target and runs ./rust/bin/cargo --target=x86_64-unknown-linux-musl to build pqrs.

The result is a static binary:

$ ldd ./target/x86_64-unknown-linux-musl/debug/pq not a dynamic executable $ $ file ./target/x86_64-unknown-linux-musl/debug/pq ./target/x86_64-unknown-linux-musl/debug/pq: ELF 64-bit LSB executable, x86-64, version 1 (GNU/Linux), statically linked, BuildID[sha1]=3aa843efe79d0082aacb674a28e8d1ed8105a5e5, not stripped

Alternatively, you can download a static musl-compiled binary from the releases page.

Dependencies

[dependencies] docopt = "0.7" rustc-serialize = "0.3" serde = "0.9.12" serde-value = "0.4.0" serde_json = "0.9.9" serde-protobuf = "0.5" protobuf = "1.2.1"

Tests

The testing tools are ./py-test for a Python random compiled protobuf generator (py-test README), and ./tests for Rust integration tests. The integration tests invoke the pqrs binary using std::process and checks return codes, stdout, etc. - inspired by the xsv test suite.

There is no linting in the Travis-CI job because it takes too long, but there is a make target (make lint). This is a bit hacky - it switches to rust-stable to run cargo fmt, rust-nightly to run cargo clippy (and then back to rust-stable). Run this before submitting a PR, or alternatively, run cargo fmt and cargo clippy however you prefer.

Goals

The original goal was to make a UNIX-y tool for generalized protobuf pretty-printing. Since jq already exists, I dropped the pretty-printing requirement and just output ugly JSON.