rew

Rew is a text processing CLI tool that rewrites FS paths according to a pattern.

Build status Code coverage

Contents

:bulb: What rew does

  1. Reads values from standard input.
  2. Rewrites them according to a pattern.
  3. Prints results to standard output.

Input values are assumed to be FS paths, however, rew is able to process any UTF-8 encoded text.

What rew does

:package: Installation

:rocket: Usage

By default, input values are read as lines from standard input.

bash cmd | rew [options] [pattern]

Input values can be also passed as additional arguments.

bash rew [options] [pattern] [--] <value>...

Use -h flag to print short help, --help to print detailed help.

:pencil: Pattern

Pattern is a string describing how to generate output from an input.

Use --explain flag to print detailed explanation what a certain pattern does.

bash rew --explain 'file_{c|<3:0}.{e}'

By default, pattern characters are directly copied to output.

| Input | Pattern | Output | | ----- | ------- | ------ | | * | abc | abc |

Characters { and } form an expression which is evaluated and replaced in output.

Empty expression {} evaluates directly to input value.

| Input | Pattern | Output | | ------- | ------------ | --------------- | | world | {} | world | | world | Hello_{}_! | Hello_world_! |

Expression may contain one or more filters, delimited by |, which are consecutively applied on input value.

| Input | Pattern | Output | Description | | ---------- | ----------------- | ---------- | ---------------------------------- | | old.JPEG | new.{e} | new.JPEG | Extension | | old.JPEG | new.{e\|l} | new.jpeg | Extension + Lowercase | | old.JPEG | new.{e\|l\|r:e} | new.jpg | Extension + Lowercase + Remove e |

Character # starts an escape sequence.

| Sequence | Description | | -------- |--------------------------- | | #/ | System directory separator
\ on Windows
/ everywhere else | | #n | New line | | #r | Carriage return | | #t | Horizontal tab | | #0 | Null | | #{ | Escaped { | | #\| | Escaped \| | | #} | Escaped { | | ## | Escaped # |

Use --escape option to set a different escape character.

bash rew '{R:#t: }' # Replace tabs with spaces rew '{R:\t: }' --escape='\' # Same thing, different escape character

If no pattern is provided, input values are directly copied to output.

bash printf 'a\0b' | rew -z # Convert NUL bytes to newlines

:railway_track: Path filters

| Filter | Description | | ------ | ------------------------- | | w | Working directory | | a | Absolute path | | A | Relative path | | p | Normalized path | | P | Canonical path | | d | Parent directory | | D | Remove last name | | f | File name | | F | Last name | | b | Base name | | B | Remove extension | | e | Extension | | E | Extension with dot | | z | Ensure trailing separator | | Z | Remove trailing separator |

Path filters assume that their input value is a FS path. To get a specific portion of a path, use one of dD, fF, bB, eE filters.

| Pattern | Output | | ------------ | ----------------------- | | {} | /home/alice/notes.txt | | {d}, {D} | /home/alice | | {f}, {F} | notes.txt | | {b} | notes | | {B} | /home/alice/notes | | {e} | txt | | {E} | .txt |

Parent directory d might give a different result than D which removes last name of a path. Similarly, file name f might not be the same as last name F which is a complement of D.

| Input | {d} | {D} | {f} | {F} | | --------- | ------- | --------- | ----------| ----------| | / | / | / | (empty) | (empty) | | /a | / | / | a | a | | a/b | a | a | b | b | | a | . | (empty) | a | a | | . | ./.. | (empty) | (empty) | . | | .. | ../.. | (empty) | (empty) | .. | | (empty) | .. | (empty) | (empty) | (empty) |

Extension with dot E can be useful when dealing with files with no extension.

| Input | new.{e} | new{E} | | --------- | --------- | --------- | | old.txt | new.txt | new.txt | | old | new. | new |

Absolute path a and relative path A are both resolved against working directory w.

| {w} | Input | {a} | {A} | | ------------- | ----------- | ----------- | -------- | | /home/alice | /home/bob | /home/bob | ../bob | | /home/alice | ../bob | /home/bob | ../bob |

By default, working directory w is set to your current working directory. You can change that using the -w, --working-directory option. w filter will always output an absolute path, even if you set a relative one using the -w option.

bash rew -w '/home/alice' '{w}' # Absolute path rew -w '../alice' '{w}' # Relative to your current working directory

Normalized path p is constructed using the following rules:

| Input | Output | | Input | Output | | --------- |------- | - | --------- |------- | | (empty) | . | | / | / | | . | . | | /. | / | | .. | .. | | /.. | / | | a/ | a | | /a/ | /a | | a// | a | | /a// | /a | | a/. | a | | /a/. | /a | | a/.. | . | | /a/.. | / | | ./a | a | | /./a | /a | | ../a | ../a | | /../a | /a | | a//b | a/b | | /a//b | /a/b | | a/./b | a/b | | /a/./b | /a/b | | a/../b | b | | /a/../b | /b |

Canonical path P works similarly to p but has some differences:

Trailing separator filters z and Z can be useful when dealing with root and unnormalized paths.

| Input | {}b | {}/b | {z}b | {Z}/b | | ------ | ----- | -------| ------ | ------- | | / | /b | //b | /b | /b | | a | ab | a/b | a/b | a/b | | a/ | a/b | a//b | a/b | a/b |

:ab: Substring filters

| Filter | Description | | ------ | ------------------------------------------------- | | nA-B | Substring from index A to B.
Indices start from 1 and are both inclusive. | | nA- | Substring from index A to end. | | nA | Character at index A.
Equivalent to nA-A. | | N | Same as n but with backward indexing. |

Examples:

| Input | Pattern | Output | | ------- | -------- | ------ | | abcde | {n2-3} | bc | | abcde | {N2-3} | cd | | abcde | {n2-} | bcde | | abcde | {N2-} | abcd | | abcde | {n2} | b | | abcde | {N2} | d |

:mag: Replace filters

| Filter | Description | | ------- | ------------------------------------------------------- | | r:X:Y | Replace first occurrence of X with Y.
Any other character than : can be also used as a delimiter. | | r:X | Remove first occurrence of X.
Equivalent to r:X: | | R | Same as r but replaces/removes all occurrences. | | ?D | Replace empty value with D. |

Examples:

| Input | Pattern | Output | | --------- | ----------- | ------- | | ab_ab | {r:ab:xy} | xy_ab | | ab_ab | {R:ab:xy} | xy_xy | | ab_ab | {r:ab} | _ab | | ab_ab | {R:ab} | _ | | abc | {?def} | abc | | (empty) | {?def} | def |

:star: Regex filters

| Filter | Description | | ------------- | ------------------------------------------------ | | =E | Match of a regular expression E. | | s:X:Y | Replace first match of a regular expression X with Y.
Y can reference capture groups from X using $1, $2, ...
Any other character than : can be also used as a delimiter. | | s:X | Remove first match of a regular expression X.
Equivalent to s:X:. | | S | Same as s but replaces/removes all matches. | | 1, 2, ... | Capture group of an external regular expression. |

Examples:

| Input | Pattern | Output | | --------- | ---------------------| ------- | | 12_34 | {=\d+} | 12 | | 12_34 | {s:\d+:x} | x_34 | | 12_34 | {S:\d+:x} | x_x | | 12_34 | {s:(\d)(\d):$2$1} | 21_34 | | 12_34 | {S:(\d)(\d):$2$1} | 21_43 |

bash echo 'a/b.c' | rew -e '([a-z])' '{1}' # Will print 'a' echo 'a/b.c' | rew -E '([a-z])' '{1}' # Will print 'b'

:art: Format filters

| Filter | Description | | ------ | -------------------------------------- | | t | Trim white-spaces from both sides. | | u | Convert to uppercase. | | l | Convert to lowercase. | | i | Convert non-ASCII characters to ASCII. | | I | Remove non-ASCII characters. | | <<M | Left pad with mask M. | | <N:M | Left pad with N times repeated mask M.
Any other non-digit than : can be also used as a delimiter. | | >>M | Right pad with mask M. | | >N:M | Right pad with N times repeated mask M.
Any other non-digit than : can be also used as a delimiter. |

Examples:

| Input | Pattern | Output | | ---------- | ------------ | -------- | | ..a..b.. | {t} | a..b (dots are white-spaces) | | aBčĎ | {u} | ABČĎ | | aBčĎ | {l} | abčď | | aBčĎ | {a} | aBcD | | aBčĎ | {A} | aB | | abc | {<<123456} | 123abc | | abc | {>>123456} | abc456 | | abc | {<3:XY} | XYXabc | | abc | {>3:XY} | abcYXY |

:infinity: Generators

| Filter | Description | | ------ | -------------------------------------------------- | | *N:V | Repeat N times V.
Any other non-digit than : can be also used as a delimiter. | | c | Local counter | | C | Global counter | | uA-B | Random number from interval [A, B] | | uA- | Random number from interval [A, 264) | | u | Random number from interval [0, 264) | | U | Random UUID |

Examples:

| Pattern | Output | | --------- | ------------------------------------------------- | | {*3:ab} | ababab | | {c} | (see below) | | {C} | (see below) | | {u0-99} | (random number between 0-99) | | {U} | 5eefc76d-0ca1-4631-8fd0-62eeb401c432 (random) |

| Input | Global counter | Local counter | | ----- | -------------- | ------------- | | A/1 | 1 | 1 | | A/2 | 2 | 2 | | B/1 | 3 | 1 | | B/2 | 4 | 2 |

bash rew -c0 '{c}' # Start from 0, increment by 1 rew -c2:3 '{c}' # Start from 2, increment by 3

:keyboard: Input

By default, input values are read as lines from standard input. LF or CR+LF is auto-detected as a delimiter, independent of platform.

bash find | rew '{a}' # Convert output of find command to absolute paths find -print0 | rew -z '{a}' # Use NUL delimiter in case paths contain newlines echo "$PATH" | rew -d: # Split PATH variable entries delimited by colon rew -r 'A{}B' <data.txt # Read file as a whole, prepend 'A', append 'B'

Input values can be also passed as additional arguments, after a pattern.

bash rew '{a}' *.txt # Wildcard expansion is done by shell

:speech_balloon: Output

By default, results are printed as lines to standard output. LF is used as a delimiter.

bash rew '{D}' | xargs mkdir -p # Pass extracted directories to mkdir command rew -Z '{D}' | xargs -0 mkdir -p # Use NUL delimiter in case paths contain newlines rew -D$'\r\n' # Convert newlines to CR+LF using custom output delimiter rew -R '{}#r#n' # Same thing but output delimiter is in the pattern rew -TD+ '{}' a b c # Join input values to string "a+b+c"

Apart from this (standard) mode, there are also two other output modes.

:robot: Diff mode

```text value1

outputvalue1 value2 outputvalue2 ... valueN outputvalueN ```

Such output can be processed by accompanying mvb and cpb utilities to perform bulk move/copy.

bash find -name '*.jpeg' | rew -b '{B}.jpg' | mvb # Rename all *.jpeg files to *.jpg find -name '*.txt' | rew -b '{}.bak' | cpb # Make backup copy of each *.txt file

:rose: Pretty mode

text input_value_1 -> output_value_1 input_value_2 -> output_value_2 ... input_value_N -> output_value_N

:microscope: Comparison with similar tools

rew vs rename / prename

bash find -name '*.jpeg' | xargs rename .jpeg .jpg # Rename *.jpeg files to *.jpg find -name '*.jpeg' | rew '{B}.jpg' -b | mvb # Same thing using rew + mvb find -name '*.jpeg' | rew 'mv "{}" "{B}.jpg"' | sh # Same thing using rew + mv + sh

rew vs coreutils

Like pwd, rew is able to print your current working directory.

bash pwd # Print your current working directory rew '{w}' '' # Same thing using rew

Like basename, rew is able to strip directory and suffix from a path.

bash basename 'dir/file.txt' '.txt' # Print base name without the ".txt" extension rew '{b}' 'dir/file.txt' # Same thing using rew, no need to specify an extension

Like dirname, rew is able to strip last component from a path.

bash dirname 'dir/file.txt' # Print directory name rew '{D}' 'dir/file.txt' # Same thing using rew

Like realpath, rew is able to resolve a path.

bash realpath -e '/usr/../home' # Print canonical path rew '{P}' '/usr/../home' # Same thing using rew realpath --relative-to='/home' '/usr' # Print path relative to a directory rew -w '/home' '{A}' '/usr' # Same thing using rew

rew vs grep

Like grep, rew is able to print match of a regular expression.

bash echo "123 abc 456" | grep -Po '\d+' # Extract all numbers from a string echo "123 abc 456" | rew '{=\d+}' # Same thing using rew (but only the first number)

rew vs sed / sd

Like sed or sd, rew is able to replace text using a regular expression.

bash echo "123 abc 456" | sed -E 's/([0-9]+)/_\1_/g' # Put underscores around numbers echo "123 abc 456" | sd '(\d+)' '_${1}_' # Same thing using sd echo "123 abc 456" | rew '{S:(\d+):_$1_}' # Same thing using rew

:cardfilebox: Examples

:information_source: Use rew --explain <pattern> to print detailed explanation what a certain pattern does.

Print contents of your current working directory as absolute paths.

bash rew '{a}' * # Paths are passed as arguments, wildcard expansion is done by shell ls | rew '{a}' # Paths are read from standard input

Rename all *.jpeg files to *.jpg.

bash find -name '*.jpeg' | rew -b '{B}.jpg' | mvb -v

Same thing but we use rew to generate executable shell code.

bash find -name '*.jpeg' | rew 'mv -v "{}" "{B}.jpg"' | sh

Make backup copy of each *.txt file with .txt.bak extension in the same directory.

bash find -name '*.txt' | rew -b '{}.bak' | cpb -v

Copy *.txt files (keep directory structure) to the ~/Backup directory.

bash find -name '*.txt' | rew -b "$HOME/Backup/{p}" | cpb -v

Copy *.txt files (flatten directory structure) to the ~/Backup directory.

bash find -name '*.txt' | rew -b "$HOME/Backup/{f}" | cpb -v

Same thing but we append randomly generated suffix after base name to avoid name collisions.

bash find -name '*.txt' | rew -b "$HOME/Backup/{b}_{U}.{e}" | cpb -v

Flatten directory structure ./dir/subdir/ to ./dir_subdir/.

bash find -mindepth 2 -maxdepth 2 -type d | rew -b '{D}_{F}' | mvb -v

Normalize base names of files to file_001, file_002, ...

bash find -type f | rew -b '{d}/file_{C|<3:0}{E}' | mvb -v

Print the first word of each line with removed diacritics (accents).

bash rew '{=\S+|i}' <input.txt

Swap the first and second column in a CSV file.

bash rew -e'([^:]*):([^:]*):(.*)' '{2}:{1}:{3}' <input.csv >output.csv

Same thing but we use regex replace filter.

bash rew '{s/([^:]*):([^:]*):(.*)/$2:$1:$3}' <input.csv >output.csv

Print PATH variable entries as lines.

bash echo "$PATH" | rew -d: # PATH entries are delimited by ':'

Replace tabs with 4 spaces in a file.

bash rew -rR '{R:#t: }' <input.txt >output.txt # Read/write file content as a whole

Normalize line endings in a file to LF.

bash rew <input.txt >output.txt # LF is the default output delimiter

Normalize line endings in a file to CR+LF.

bash rew -D$'\r\n' <input.txt >output.txt # CR+LF delimiter using -D option rew -R '{}#r#n' <input.txt >output.txt # CR+LF delimiter in pattern

:pagefacingup: License

Rew is licensed under the MIT license.