This crate helps you retrieve data files from the Internet. Files are only downloaded when needed. The contents of the files are always verified via a SHA256 hash.
The ability to download data files makes creating sample code easier. For example, here we download a genomics file and print its size:
```rust use fetchhash::samplefile;
let path = sample_file("small.fam")?; println!("{}", std::fs::metadata(path)?.len()); // Prints 85
```
ureq
to download files via blocking I/O.Create a registry.txt
file containing a whitespace-delimited list of files
and their hashes. (This is the same format as Pooch). You can put this file anywhere in your project. I put
it in tests/registry.txt
and I put data files in tests/data
.
See Registry Creation for more information.
In your code, create a global static FetchHash
instance that
reads your registry.txt
file. You should also give it:
a qualifier, organization, and application used to create a local data directory when the environment variable is not set.
Define a public sample_file
function that takes a file name and returns a Result
containing the path to the file.
```rust use fetch_hash::{ctor, FetchHash, FetchHashError}; use std::path::{Path, PathBuf};
static STATICFETCHHASH: FetchHash = FetchHash::new( includestr!("../tests/registry.txt"), "https://raw.githubusercontent.com/CarlKCarlK/fetch-hash/main/tests/data/", "BARAPPDATADIR", "com", "Foo Corp", "Bar App", );
pub fn samplefile
```
You and your users can now use your sample_file
function to download your files as needed.
Here is one suggested method for creating a registry.txt
file.
tests/data
, so they uploaded to this GitHub folder. In GitHub, by looking at the raw view of a data file, I see the root URL for these files.FetchHash
instance without registry contentsgen_registry_contents
on your list of files. It will download
the files, compute their hashes, and create a string of file names and hashes.registry.txt
.```rust use fetchhash::{FetchHash, dirtofilelist};
let fetchhash = FetchHash::new( "", "https://raw.githubusercontent.com/CarlKCarlK/fetch-hash/main/tests/data/", "BARAPPDATADIR", "com", "Foo Corp", "Bar App", ); let filelist = dirtofilelist("tests/data")?; let registrycontents = fetchhash.genregistrycontents(filelist)?; println!("{registrycontents}");
```
sample_file
. Define your own sample_file
that
knows about your data files.FetchHash
instance global and static.FetchHash
instance. You can instead use utility functions such as fetch
.