Using an AI model that can interpolate frames, we can make our videos smoother.
This project is aimed to automate the process.
⚠️ You'll need ffmpeg and ffprobe added to your PATH. Windows users, you may need to add them to your path manually.
Install rust
Clone the repo and build the project:
cargo b --release
First we need to set a few environment variables (use the correct path):
- INTERPOLATION_MODEL
- The path to the model folder
- INTERPOLATION_BINARY
- The path to the interpolation binary (which is a file!)
As an example, assuming the ai model and binary are in your current directory:
export INTERPOLATION_MODEL=models/rife-v4.6
export INTERPOLATION_BINARY=rife-ncnn-vulkan
RUST_LOG=INFO cargo run --release -- -t /tmp/more_fps/ input.mkv output.mkv
TODO
```
Usage: more-fps [OPTIONS] -t
Arguments: Path to the file for which we'll increase the frame rate
[env: INTERPOLATION_BINARY=]
Options:
--fps
[default: sixty]
Possible values:
- sixty: 60 fps
-t
-m
[default: 50]
--interpolation-args <INTERPOLATION_ARGS>
Extra args you may want to pass to the interpolation binary
[default: "-g 0,-1 -j 8:8,16:32:16"]
-r
[default: everything]
Possible values:
- everything: Delete the entire temp_directory which contains a few building blocks: "ffmpeg" - used for storing extracted frames "interpolation" - used for storing interpolated frames "scene_data.txt" - holds scene timestamps
- nothing: Nothing will be deleted... meaning we try to continue from where we left off
-s
[default: .2]
--crf <CRF>
https://trac.ffmpeg.org/wiki/Encode/H.264#a1.ChooseaCRFvalue
[default: 18]
-h, --help Print help (see a summary with '-h')
```
-r
) to nothing
. With this option set to nothing
, we will simply continue extracting from where we left off last time.If two consecutive frames are different enough, the model could generate a very bad intermediate frame. To avoid this, we use ffmpeg to identify the timestamps which have large-enough scene differences. You can change this scene value with the -s
option (see help text above):
https://www.ffmpeg.org/ffmpeg-filters.html#select_002c-aselect
We then use these timestamps as start/end times when extracting frames.
⚠️ If you notice these bad frames, I recommend decreasing the scene cut threshold with the -s
option.
We want to extract the frames from our original video file, but we don't want to extract all of the frames in one shot for two reasons: 1. As mentioned above, the AI model could be generating bad intermediate frames. 1. Extracting all of frames could require hundreds of gigabytes of data which should exceed the capacity of your disk.
So instead of extracting all of the frames, we extract according to the intervals given by the timestamps.
These time intervals may still be very large... To avoid hitting the capacity of the disk, I set a default MAX_STEP_SIZE. This option is used to set a limit on the number of seconds a frame extraction will use at a time.
This is where the AI model is used. We need a model that supports the "-n" option mentioned above so we can tell the model how many frames to generate per frame extraction.
The simplest example is if we have 1 second (which is usually 23.998 aka 24 fps), the AI model will be told to generate 60 frames. Not all scene cuts are this nice, so decimals are involved...
⚠️ I assume you have one CPU and GPU you want to use... If this is not the case, feel free to change the --interpolation-args
option
Now that we have the AI generated frames, we use ffmpeg to generate the video.
After we're done extracting frames, we will copy the audio + subtitles from the original file to our output file.
If you're looking to support me, you can send any amount of Monero:
42NTT1Q91P2TcG7vzN3oi2cYEJEaJ6QbzH3pwXGDKiSQiaBVAeZkYnBX6SijCxgKpc5tTUeVW5AuwDWBNdewZia9AJ5TgLT
cargo install