example-01
├── collagen.json
└── images
└── smiley.jpg
This project is a work in progress, but it’s coming along well! Testing and documentation definitely need to be beefed up, but the functionality is nearly all there. |
Collagen is a program that takes as input a folder containing zero or more image files
(.jpeg
, .png
, etc.) and a JSON manifest file describing the layout of these images
along with SVG components such as shapes and text, and produces as output a single SVG
file with all assets embedded.
There are several widely used image formats, perhaps the three best known of which are JPEG, PNG, and SVG. JPEG and PNG are raster formats, which means they correspond to a rectangular grid of pixels. On the other hand, SVG is a vector format, which means it describes perfectly precise curves that can be displayed with arbitrarily high precision. These three formats are each optimized for a different use case:
JPEG uses lossy compression that preserves visual quality on most "real-life" images — images that contain smoothly-varying gradients — but which produces visible artifacts when used on other kinds of images, especially ones containing hard edges and/or text.
PNG uses lossless compression that handles images with few distinct colors well, but requires an inordinate amount of space for storing images with many colors.
SVG is a vector graphics format which can nevertheless contain embedded raster images; however, doing so requires base64 encoding the raster image.
Because each of these image formats is optimized for only a single use case, they cannot be easily combined. For instance, overlaying text on a JPEG image will introduce compression artifacts that were not present in the original text, while overlaying a JPEG image on a PNG will cause the file size to balloon.
Collagen — from "collage" and "generate"[1] — aims to fix this problem by packing all types of images, as well as vector shapes and text, into a single SVG file, the actual rendering of which is left to an SVG viewer application (which includes most web browsers). This allows a user to combine several graphics into a single file that can be displayed as an image without compromising on visual quality or file size.[2]
The input to Collagen is a folder containing at the very least a collagen.json
manifest file describing the layout of the resulting SVG.
If the manifest specifies any image files (by their path relative to the folder), then those image files must also be present at the expected path in the folder.
An input folder satisfying these criteria will be referred to as a skeleton.
An example of a simple input-output pair is below.
Suppose you have the following simple skeleton at directory example-01
:
example-01
├── collagen.json
└── images
└── smiley.jpg
Where images/smiley.jpg
is the following image (whose native size is 380×380 pixels):
And where collagen.json
contains the following:
{
"vars": { (1)
"bubble_text": "Collagen!!",
"nose_color": "#f00",
"text_color": "#000"
},
"attrs": { "viewBox": "0 0 500 400" },
"children": [
{
"image_path": "images/smiley.jpg", (2)
"attrs": { "transform": "translate(0 100) scale(1.3)" }
},
{
"tag": "circle", (3)
"attrs": {
"cx": 123,
"cy": 240,
"r": 15,
"fill": "{nose_color}",
"stroke": "#000",
"stroke-width": 3
}
},
{
"tag": "path",
"attrs": {
"d": "M 230 140 L 265 120 A 100 40 0 1 0 235 110 Z",
"stroke": "#000",
"stroke-width": 3,
"fill": "#fff"
}
},
{
"tag": "text",
"text": "{bubble_text}", (4)
"attrs": {
"x": 250,
"y": 97,
"text-anchor": "start",
"dominant-baseline": "top",
"font-family": "Impact",
"font-size": 30,
"fill": "{text_color}"
}
}
]
}
1 | A dictionary of variables. In text and attrs values, variables enclosed in curly brackets will have their value substituted.
For example, if vars contains values for dx and dy , then a translation could be applied with transform: translate({dx} {dy}) .
If a child’s vars doesn’t contain a variable that the child needs the value of, the variable’s value will be looked up by walking up the list of ancestors until the root is reached (i.e., variable scopes are nested). |
2 | To include an image, just give its relative path. |
3 | Most other tags are specified with the tag field, which contains the name of the SVG tag to use. |
4 | If a tag has a text field, the given text will be the content of the tag, as in <text>your text here</text> . |
Then, running the following command:[3]
cargo run -- -i example-01 -o example-01.svg
Will produce the following file, examples-01.svg
:
If you zoom in, you’ll see the smiley face’s pixels.
But because the nose and speech bubble are SVG elements (i.e., vector graphics, not raster) they look nice and smooth and crisp even when zoomed in.
That’s the whole point!
Perfectly precise vector graphics can coexist alongside raster graphics.
(This simple example shows just one image, but of course we could include arbitrarily many by simply adding more children of the form {"image_path": <path>}
.)
As we’ve seen, we can include raster images in skeletons; it would be silly if we couldn’t also include other skeletons!
Nested skeletons can be included by adding a child of the form {"clgn_path": <path>}
.
(Whereas a standalone skeleton gets turned into a <svg>
tag, a nested skeleton will reside in a <g>
tag.)
Let’s include the above skeleton in another (and just for fun, let’s add a photo of a kitten (source) too, because why not):
example-02
├── collagen.json
├── example-01
│ ├── collagen.json
│ └── images
│ └── smiley.jpg
└── kitty.jpg
Where example-02/collagen.json
is below:
{
"attrs": { "viewBox": "0 0 300 250" },
"children": [
{
"tag": "rect",
"attrs": {
"x": "10",
"y": "10",
"width": "275",
"height": "225",
"fill": "#ddd",
"stroke": "#00f",
"stroke-width": "10",
"stroke-dasharray": "10 10"
}
},
{
"tag": "g",
"attrs": { "transform": "translate(50 25) scale(.5)" },
"children": [
{
"clgn_path": "./example-01"
}
]
},
{
"image_path": "./kitty.jpg",
"attrs": { "transform": "translate(180 150) scale(.15)" }
}
]
}
Here’s the result when you run cargo run -- -i example-02 -o example-02.svg
:
So, as far as Collagen is concerned, skeletons act more or less the same as raster images, in the sense that the path is sufficient to include them.
The only difference is that the path to a skeleton child is given by the key clgn_path
instead of image_path
.
A format that makes it easy to place text on images? Sounds like it would be perfect for memes.
{
"attrs": { "viewBox": "0 0 800 650" },
"children": [
{
"fonts": [
{
"name": "Impact",
"path": "./impact.woff2" (1)
}
]
},
{
"image_path": "./drake-small.jpg",
"attrs": {
"width": 800
}
},
{
"vars": {
"x": 550,
"dy": 50
},
"tag": "text",
"attrs": {
"font-family": "Impact", (2)
"font-size": 50,
"color": "black",
"text-anchor": "middle",
"vertical-align": "top",
"x": "{x}",
"y": 420
},
"children": [
{
"tag": "tspan",
"text": "Using SVG-based text,",
"attrs": {
"x": "{x}",
"dy": 0
}
},
{
"tag": "tspan",
"text": "which is infinitely",
"attrs": {
"x": "{x}",
"dy": "{dy}"
}
},
{
"tag": "tspan",
"text": "zoomable and has",
"attrs": {
"x": "{x}",
"dy": "{dy}"
}
},
{
"tag": "tspan",
"text": "no artifacts",
"attrs": {
"x": "{x}",
"dy": "{dy}"
}
}
]
}
]
}
1 | Hmm, why might we need the path to a woff2 file? |
2 | It’s not a meme unless it uses the Impact font. But what if our device doesn’t have Impact on it? (iPhones don’t, for instance.) |
example-03/collagen.json
produces the following meme:
If you’re on a device that doesn’t include the Impact font (which includes iPhones, for one), you might wonder what magic occurred that made the bottom pane’s font show up correctly — as Impact and not, say, Times New Roman.
After all, if the specified font-face
is not available — and Impact is not available on iPhones — the browser will fall back to another font.
So, for maximum portability, Collagen allows embedding fonts in SVGs — that’s how we got Impact to show up on devices that don’t have the font natively.
Of course, if you stick to web-safe fonts or you know that the recipient has all the fonts you want to use, then you can just refer to the fonts by name and they’ll show up correctly.
But if you want to use fonts that aren’t on the receiving device, then you can still get a portable file by embedding the font in the SVG.
For reference, here’s the file above but without the font embedded.
{
"attrs": { "viewBox": "0 0 800 650" },
"children": [ (1)
{
"image_path": "./drake-small.jpg",
"attrs": {
"width": 800
}
},
{
"vars": {
"x": 550,
"dy": 50
},
"tag": "text",
"attrs": {
"font-family": "Impact",
"font-size": 50,
"color": "black",
"text-anchor": "middle",
"vertical-align": "top",
"x": "{x}",
"y": 420
},
"children": [
{
"tag": "tspan",
"text": "Using SVG-based text,",
"attrs": {
"x": "{x}",
"dy": 0
}
},
{
"tag": "tspan",
"text": "which is infinitely",
"attrs": {
"x": "{x}",
"dy": "{dy}"
}
},
{
"tag": "tspan",
"text": "zoomable and has",
"attrs": {
"x": "{x}",
"dy": "{dy}"
}
},
{
"tag": "tspan",
"text": "no artifacts",
"attrs": {
"x": "{x}",
"dy": "{dy}"
}
}
]
}
]
}
1 | This time, we didn’t embed Impact. |
Now, if you view the result in a desktop browser, it should look the same as above, but on a mobile device the font in the bottom pane might be Times New Roman (or some other fallback font) instead of Impact.
So it’s nice to be able to embed fonts (although it’s not great for the resulting file size…).
Wait, so all this does is base64 encode assets and put them in an SVG with other SVG elements?
It adds some additional features, such as nesting of skeletons and the use of tag-wide variables and interpolation of these variables in attributes. But yes, for the most part, all this project does is allow raster images to coexist with each other and with vector graphics. If you need to embed fonts in an SVG, Collagen lets you do that, too.
Couldn’t I just do the base64 encoding and create the SVG myself?
Yes. All Collagen does it automate this.
I want to put some text on a JPEG. What’s so bad about just opening an image editor, adding the text, and pressing save?
The text will look bad because
It will no be longer an infinitely zoomable vector entity, but instead will have been rasterized, i.e., rendered onto a fixed pixel grid that is only finitely zoomable.
JPEG in particular is not optimized for text, so artifacts will be visible (see here or the Drake meme above).
I’m ok with text being rasterized. This means I can convert my JPEG to PNG and avoid #2 above, right?
Yes and no. While the text will look sort of ok (when not zoomed in), you now have the problem that your JPEG is being stored as a PNG. Chances are that this will cause the resulting file size to explode because PNG is simply not meant to store the kind of images that JPEG is meant to store. For instance, the JPEG below (source) is 57KB, whereas the PNG is 434KB.
But surely just placing black text on top of an all-white PNG is fine? Because it’s stored losslessly?
Sure, if you don’t mind your text being rasterized, i.e., not perfectly precise and infinitely zoomable. The image below is black text on a white background.
You don’t have to zoom in very far to see the text get fuzzy. And if this image undergoes additional rounds of editing and compression, this problem will only get worse. In contrast, the text in the smiley-face image above (and, naturally, the text on this webpage) is perfectly precise and will retain all of its detail at arbitrary magnification.
s/protein/tool/;s/body/images/