[
](https://docs.rs/html-streaming-editor/)
Run (simple) manipulations on HTML files, like extracting parts. Use CSS selectors to define which parts of the HTML to operator on, use different commands in pipes to perform the desired operations.
The basic syntax is:
COMMAND{ SELECTOR } | COMMAND{ SELECTOR }
Some COMMAND
use sub-pipelines. There are two kind of COMMANDS
with this:
- "iterate"/"forEach": For each (sub) node matching the inner selector the sub-pipeline is processed, but the elements themselves are not changed
COMMAND{ SELECTOR ↦ COMMAND{ SELECTOR } | COMMAND { SELECTOR } }
COMMAND{ SELECTOR => COMMAND{ SELECTOR } | COMMAND { SELECTOR } }
"map"/"replace": For each (sub) node matching the inner selector the sub-pipeline is processed, and the pipelines result is used instead of the element
COMMAND{ SELECTOR ↤ COMMAND{ SELECTOR } | COMMAND { SELECTOR } }
COMMAND{ SELECTOR <= COMMAND{ SELECTOR } | COMMAND { SELECTOR } }
The SELECTOR
is a CSS selector.
Currently supported:
ONLY
: remove everything not matching the CSS selector (alias: SELECT
)WITHOUT
: remove everything matching the CSS selector (alias: FILTER
)FOR-EACH
: run a sub-pipeline on all sub-elements matching a CSS selector but return the previously selected elements (alias: FOR
)CLEAR-ATTR
: removes a given attribute from the previously selected elements CLEAR-CONTENT
: clears all children from the previously selected elementsSET-ATTR
: Sets a given attribute to a specified valueSET-TEXT-CONTENT
: removes previous children and replaces it with exactly one given text childADD-TEXT-CONTENT
: appends a new text childADD-COMMENT
: appends a new comment childADD-ELEMENT
: appends a new tag/element childREPLACE
: replace all elements matching a CSS selector with new elements (alias: MAP
)CREATE-ELEMENT
: creates a new, empty element, mainly in combination with ADD-ELEMENT
or REPLACE-WITH
(alias: NEW
)Planned commands:
READ-FROM
: reads a DOM from a different file, mainly in combination with ADD-ELEMENT
or REPLACE-WITH
(alias: SOURCE
) The binary is called hse
and supports following options:
```
USAGE:
hse [OPTIONS]
ARGS:
OPTIONS:
-h, --help Print help information
-i, --input File name of the Input. -
for stdin (default)
-o, --output
```shell
hse -i index.html 'ONLY{header .content}' ```
```shell
<main>
or element with CSS class main
, but without any <script>
defined insidehse -i index.html 'ONLY{main, .main} | WITHOUT{script}' ```