ANTLR4 runtime for Rust programming language
Generator part is currently located in rust-target branch of my antlr4 fork rrevenantt/antlr4/tree/rust-target
For examples you can see grammars, tests/gen for corresponding generated code and tests/my_tests.rs for actual usage examples
Everything is implemented, "business" logic is quite stable and well tested, but user facing API is not very robust yet an very likely will have some changes.
For now development is going on in this repository but eventually it will be merged to main ANTLR4 repo
Currently requires nightly version of rust.
This very likely will be the case until specialization
,try_blocks
and unsize
features are stabilized.
Remaining things before merge:
- API stabilization
- [ ] Rust api guidelines compliance
- [ ] more tests for API because it is quite different from Java
- make parsing zero copy(i.e. use &str(or Cow) instead String in token and &Token in tree nodes)
- more generic PredictionContext
- generic over ownership for string
- profiling and performance optimizations
- generate enum for labeled alternatives without redundant Error
option
- ? option to generate fields instead of getters by default
Can be done after merge: - Documentation - [ ] Quite some things are already documented but still far from perfect - Code quality - [ ] Rustfmt fails to run currently - [ ] Clippy sanitation - [ ] Not all warning are fixed - visitor - build.rs integration + example - run rustfmt on generated parser
Add to Cargo.toml
toml
[dependencies]
lazy_static = "1.4"
antlr-rust = "0.1"
and #![feature(try_blocks)]
in your project root module
It is possible to generate idiomatic Rust syntax trees. For this you would need to use labels feature of ANTLR. You can see Labels grammar for example. Consider following rule : ```text e : a=e op='*' b=e # mult | left=e '+' b=e # add
``
For such rule ANTLR will generate enum
EContextAllcontaining
multand
addalternatives,
so you will be able to match on them in your code.
Also corresponding struct for each alternative will contain fields you labeled.
I.e. for
MultContextstruct will contain
aand
bfields containing child subtrees and
opfield with
TerminalNodetype which corresponds to individual
Token.
It also is possible to disable generic parse tree creation to keep only selected children via
parser.buildparsetrees = false`.
Although Rust runtime API is made as close as possible to Java, there are quite some differences because Rust is not an OOP language and is much more explicit.
Default + Clone
.ListenerId
otherwise ParseTreeWalker
should be used.recog
variable instead of self
.
This is because predicate have to be inserted into two syntactically different places in generated parser Currently unsafe is used only to cast from trait object back to original type
and to update data inside Rc via get_mut_unchecked
(returned mutable reference is used immediately and not stored anywhere)
BSD 3-clause