Zyx is open source tensor library designed to be zero cost abstraction and to provide lot of compile-time guarantees.
From user perspective it works similar to PyTorch. Also names of functions are mostly the same, so that you can quickly pick up zyx if you are familiar with PyTorch.
We want to provide a way to do automatic differentiation and backpropagation for any datatypes, whether are those scalars, arrays, matrices, or tensors. Zyx aims to be zero cost abstraction and use simple Rust syntax for this autodiff and backprop.
By passing datatype into .with_grad() function you create Variable. Variable stores your datatype and adds gradient to the datatype. Gradient is of the same type as your datatype. To manage access to the gradient we use UnsafeCell as gradients must be accessed from different places and we saw significant performance improvements over RefCell in certain benchmarks.
Tensor is a result of a mathematical or other operation performed on Variable. Tensor creates the graph needed for backpropagation at compile time.
Tensors are immutable and all operations are executed eagerly.
TL DR: By zero cost abstraction we mean zero dyn, zero Rc, zero RefCell and minimal number of branches.
The syntax you will be using as a user is very close to PyTorch. Also, although the graph is created at compile time, it behaves completely dynamically (i. e. RNNs are easy). You don't need to do any graph.compile or graph.execute calls. Buffer, Tensor and Variable are immutable.
Thanks to typestate API there are ZERO runtime errors when running on CPU. Well, technically you can run out of RAM...
If you have supported IDE, you can look at your whole graph just by hovering over your loss variable and inspecting it's type. The second generic parameter of Tensor represents the graph.
GPU acceleration uses OpenCL through ocl.
The current architecture makes it easy to add other accelerators should the need arise. It is because new accelerators can be added gradually and number of required operations is low.
You can also turn custom datatypes into tensors by calling .with_grad(). They will run on CPU.
Convolution is not currently possible on stable rust. We need generic constant expressions to calculate the output shape.
For examples of linear neural networks, look at examples directory.
If you want to accelerate matrix multiplication using matrixmultiply crate, use --features=matrimultiply
.
```rust use zyx::prelude::*; use zyx::device::cpu; // If you want this to run on GPU, just use zyx::device::opencl; use zyx::tensor::Variable; use zyx::shape::{Sh4, Ax4};
let device = cpu::Device::default();
let x: Variable
let z = x.matmul(&y).sum::
z.backward();
println!("{}", x.grad()); println!("{}", y.grad()); ```
Want to use scalars? Just give them gradients!
```rust use zyx::prelude::*;
let x = 3f32.withgrad(); let y = 5.; let z = (&x + y).relu(); z.backward(); println!("{}", x.grad()); ```
Zyx is available on crates.io: https://crates.io/crates/zyx
Many features are not implemented and many tests are missing. We would appreciate help writing more thorough tests and feature requests so that we know where to direct our focus. Therefore zyx can not be considered stable yet. With that said, we don't have plans to significantly change APIs that have already been written.
We would advice you to first look at module ops. There are defined all basic operations you can work with. Any device which implements these operations automatically implementes operations in tensor::ops, these do automatic gradient calculations. Tensors can use optimizers to update their values using calculated gradients. Many other functors, such as losses and other higher level functions are in module nn.
Any opinions, issue reports, feature requests as well as code contributions are very welcome.