pqrs
is a tool which deserializes compiled protobuf messages given a set of pre-compiled .fdset
files.
*.fdset
files in ~/.pq
:
$ protoc -o dog.fdset dog.proto
$ protoc -o person.fdset person.proto
$ cp *.fdset ~/.pq/
sevag:pqrs $ ./py-test/generate_random_proto.py | pq | jq
{
"age": 4,
"breed": "poodle",
"temperament": "excited"
}
sevag:pqrs $ ./py-test/generate_random_proto.py | pq | jq
{
"id": 2,
"name": "raffi"
}
Use --force
to try to brute-force decode a message. E.g., given a 20-byte message:
``` while (don't have decode result) if (decode [0:19]) == success: return
# try chopping off 1 byte
if (decode([0:18]) == success or\
decode([1:19]) == success): return
# try chopping off 2 consecutive bytes
if (decode([0:17]) == success or\
decode([1:18]) == success) or\
decode([2:19]) == success): return
# repeat until success or no bytes left
```
pqrs
by default will guess the message type. You can make it use a specific type by passing the fully qualified message name, e.g. pq --msgtype="com.example.dog.Dog"
.
Guessing strategy:
~/.pq/*.fdset
, try to decode the message with itserde_value::Value::Unit
in the codebase), discard the decoded resultBTreeMap
in a vectorBTreeMap
from the vector which has the most fieldsSince protobuf treats fields as positional, similar protos (e.g. Dog:
I treat streams the way it's done by Google in delimitedmessageutil.
Basically, each message is preceded with a raw varint32
containing the length of the message. Different streaming strategies are possible. I need to come up with a way to accomodate them all.
In length.rs I got started on an enum to allow different types of length-delimiting. For instance I'm planning on adding leb128
in addition to varint32
which is implemented. This will probably be toggleable with a switch (i.e. --delimiter="leb128|varint32"
).
First, clone and compile musl-gcc
on your system:
$ git clone git://git.musl-libc.org/musl
$ ./configure && make && sudo make install
Then, run make
in this repo - this downloads a local ./rust
toolchain with the x86_64-unknown-linux-musl
target and runs ./rust/bin/cargo --target=x86_64-unknown-linux-musl
to build pqrs
.
The result is a static binary:
$ ldd ./target/x86_64-unknown-linux-musl/debug/pq
not a dynamic executable
$
$ file ./target/x86_64-unknown-linux-musl/debug/pq
./target/x86_64-unknown-linux-musl/debug/pq: ELF 64-bit LSB executable, x86-64, version 1 (GNU/Linux), statically linked, BuildID[sha1]=3aa843efe79d0082aacb674a28e8d1ed8105a5e5, not stripped
Alternatively, you can download a static musl
-compiled binary from the releases page.
[dependencies]
docopt = "0.7"
rustc-serialize = "0.3"
serde = "0.9.12"
serde-value = "0.4.0"
serde_json = "0.9.9"
serde-protobuf = "0.5"
protobuf = "1.2.1"
The testing tools are ./py-test for a Python random compiled protobuf generator (py-test README), and ./tests for Rust integration tests. The integration tests invoke the pqrs
binary using std::process
and checks return codes, stdout, etc. - inspired by the xsv test suite.
There is no linting in the Travis-CI job because it takes too long, but there is a make target (make lint
). This is a bit hacky - it switches to rust-stable to run cargo fmt
, rust-nightly to run cargo clippy
(and then back to rust-stable). Run this before submitting a PR, or alternatively, run cargo fmt
and cargo clippy
however you prefer.
The original goal was to make a UNIX-y tool for generalized protobuf pretty-printing. Since jq
already exists, I dropped the pretty-printing requirement and just output ugly JSON.