A Fedora COPR repository for Audio creation
From wikipedia’s article (https://en.wikipedia.org/wiki/FFmpeg):
FFmpeg is a free and open-source software project consisting of a large suite of libraries and programs for handling video, audio, and other multimedia files and streams. At its core is the FFmpeg program itself, designed for command-line-based processing of video and audio files. It is widely used for format transcoding, basic editing (trimming and concatenation), video scaling, video post-production effects and standards compliance (SMPTE, ITU).
Fixing a file (ffmpeg is highly robust wrt faulty media files):
$ ffmpeg -i broken_song.wav -c copy happy_song.wav
Transcoding a format (here, we select h264 for video and aac for audio, as required for mp4 format):
$ ffmpeg -i master_file.mkv -c:v h264 -c:a aac access_file.mp4
To specific duration or part of a media, the following flags are quite useful: -t: specify a duration in second -ss: specify a starting instant (in the format hh:mm:ss) -to: specify and ending instant (in the format hh:mm:ss)
ffmpeg has also a huge “unreconginzed” function: complex filtering. This function allows to apply some ffmpeg plugin to various media stream (audio, video). You can apply several plugins linked together using a “graph” representation. To get some informations about ffmpeg filtering, you can reach the ffmpeg documentation website or you can get this information via the “ffmpeg-filter” man page (https://www.ffmpeg.org/ffmpeg-filters.html):
$ man ffmpeg-filters
You can also access to a specific plugin help using the following command:
$ ffmpeg -hide_banner -h filter=amerge
Information related to the inputs used:
$ ffmpeg -i 01_au_lecteur.mp3
ffmpeg version 4.3.2 Copyright (c) 2000-2021 the FFmpeg developers
...
Input #0, mp3, from '01_au_lecteur.mp3':
Duration: 00:02:39.92, start: 0.025057, bitrate: 163 kb/s
Stream #0:0: Audio: mp3, 44100 Hz, stereo, fltp, 163 kb/s
Metadata:
encoder : LAME3.100
Side data:
replaygain: track gain - -1.200000, track peak - unknown, album gain - unknown, album peak - unknown,
$ ffmpeg -i BlackRaven.jpg
ffmpeg version 4.3.2 Copyright (c) 2000-2021 the FFmpeg developers
...
Input #0, image2, from 'BlackRaven.jpg':
Duration: 00:00:00.04, start: 0.000000, bitrate: 61800 kb/s
Stream #0:0: Video: mjpeg (Baseline), yuvj420p(pc, bt470bg/unknown/unknown), 1600x1064 [SAR 300:300 DAR 200:133], 25 tbr, 25 tbn, 25 tbc
$ ffmpeg -i TheBarberOfSeville1944.ogv
ffmpeg version 4.3.2 Copyright (c) 2000-2021 the FFmpeg developers
...
Input #0, ogg, from 'TheBarberOfSeville1944.ogv':
Duration: 00:06:55.34, start: 0.000000, bitrate: 290 kb/s
Stream #0:0: Video: theora, yuv420p, 400x300 [SAR 1:1 DAR 4:3], 59.94 fps, 59.94 tbr, 59.94 tbn, 59.94 tbc
Metadata:
encoder : Lavc57.22.100 libtheora
TITLE : The Barber of Seville (1944)
LOCATION : https://archive.org/details/TheBarberOfSeville
Stream #0:1: Audio: vorbis, 44100 Hz, stereo, fltp, 128 kb/s
Metadata:
encoder : Lavc57.22.100 libvorbis
TITLE : The Barber of Seville (1944)
LOCATION : https://archive.org/details/TheBarberOfSeville
Let’s first look at the audio manipulation part. Noise reduction using afftdn!
$ ffmpeg -i 01_au_lecteur.mp3 -af "afftdn=nf=-25" output.mp3
$ ffmpeg -i 01_au_lecteur.mp3 -filter_complex "afftdn=nf=-25" output.mp3
Convert merge the original and the modified file into a stereo stream to compare the effect:
$ ffmpeg -i 01_au_lecteur.mp3 -i output.mp3 -filter_complex "[0:a]pan=mono|c0=FL[left]; [1:a]pan=mono|c0=FR[right]; [left][right]join=inputs=2:channel_layout=stereo[out]" -map "[out]" output_stereo.mp3
For this command line, the effects connexions can be represented as a graph:
[0:a] +--+ pan=mono|c0=FL +--+ [left] +
\
+--+ join=inputs=2:channel_layout=stereo +--+ [out]
/
[1:a] +--+ pan=mono|c0=FR +--+ [right] +
Filters in the same linear chain are separated by commas, and distinct linear chains of filters are separated by semicolons. Here, we just have 2 separate linear chains:
[0:a]pan=mono | c0=FL[left] |
There are some examples on how to convert to various stereo format on the ffmpeg wiki: https://trac.ffmpeg.org/wiki/AudioChannelManipulation
Audio filtering (combining low and high pass filters and a volume plugin):
$ ffmpeg -i 01_au_lecteur.mp3 -af "lowpass=f=300, highpass=f=3000, volume=4" output.mp3
Here, we have only one linear chain but with 3 elemets:
[0:a] +--+ lowpass=f=300 +--+ highpass=f=3000 +--+ volume=4 +--+ output
No need to add a label between each elements of a same linear chain.
Using several parameters from the lowpass filter. The help of the lowpass filter:
$ ffmpeg -hide_banner -h filter=lowpass
There is a huge list of parameters available. To specify several parameters, you must separate the parameters by a ‘:’ or a ‘|’ character. The ‘|’ parameter separator is an old style format and is deprecated. But some plugins still require to use this separator.
$ ffmpeg -i 01_au_lecteur.mp3 -filter:a "lowpass=f=300:t=h:w=300" output.mp3
If we start working with some complex audio flow, we must switch from -af to -filter_complex.
Using -filter_complex, we must specify inputs for each filter.
The input format is [A:B] with A the file ID (with 0 corresponding to the first -i
$ ffmpeg -i 01_au_lecteur.mp3 -filter_complex "[0:0]adelay=0|500[a1];[0:0]adelay=1500|0[a2];[a1][a2]amix=inputs=2" output.mp3
+--+ adelay=0|500 +--+ a1 +--+
/ \
01_au_lecteur.mp3 +--+ +--+ amix(a1, a2) +--+ output.mp3
\ /
+--+ adelaut=1500|0 +--+ a2 +--+
Example of using a chorus:
The chorus accepts the following parameters:
$ ffmpeg -hide_banner -h filter=chorus
in_gain: Set input gain. Default is 0.4. out_gain: Set output gain. Default is 0.4. delays: Set delays. A typical delay is around 40ms to 60ms. decays: Set decays. speeds: Set speeds. depths: Set depths.
A single delay:
$ ffmpeg -i 01_au_lecteur.mp3 -filter:a "chorus=in_gain=0.7:out_gain=0.9:delays=55:decays=0.4:speeds=0.25:depths=2" output.mp3
If the parameters are passed by position (you specify all the listed parameters), you can omit the parameter name:
$ ffmpeg -i 01_au_lecteur.mp3 -filter:a "chorus=0.7:0.9:55:0.4:0.25:2" output.mp3
in_gain: 0.7 out_gain: 0.9 delays: 55 decays: 0.4 speeds: 0.25 depths: 2
Two delays: You can pass a list of values for some parameters which will act as duplicating the chorus with various parameters value:
$ ffmpeg -i 01_au_lecteur.mp3 -filter:a "chorus=0.6:0.9:50|60:0.4|0.32:0.25|0.4:2|1.3" output.mp3
in_gain: 0.6 out_gain: 0.9 delays: 50 + 60 decays: 0.4 + 0.32 speeds: 0.25 + 0.4 depths: 2 + 1.3
Fuller sounding chorus with three delays:
$ ffmpeg -i 01_au_lecteur.mp3 -filter:a "chorus=0.5:0.9:50|60|40:0.4|0.32|0.3:0.25|0.4|0.3:2|2.3|1.3" output.mp3
in_gain: 0.5 out_gain: 0.9 delays: 50 + 60 + 40 decays: 0.4 + 0.32 + 0.3 speeds: 0.25 + 0.4 + 0.3 depths: 2 + 2.3 + 1.3
Cut (trim) and crossfade 2 subsamples:
$ ffmpeg -i 01_au_lecteur.mp3 -filter_complex "[0:0]atrim=start=0:end=3[a1], [0:0]atrim=start=10:end=13[a2], [a1][a2]acrossfade=d=1.0" output.mp3
And add a vibrato or a tremolo to the mix:
$ ffmpeg -i 01_au_lecteur.mp3 -filter_complex "[0:0]atrim=start=0:end=3[a1], [0:0]atrim=start=10:end=13[a2], [a1][a2]acrossfade=d=1.0, vibrato=f=5:d=0.75" output.mp3
$ ffmpeg -i 01_au_lecteur.mp3 -filter_complex "[0:0]atrim=start=0:end=3[a1], [0:0]atrim=start=10:end=13[a2], [a1][a2]acrossfade=d=1.0, tremolo=f=1:d=0.75" output.mp3
And add a sine source to the mix:
FFmpeg has some audio sources you can use in your filtering graph. Here, we will use a sine source.
$ ffmpeg -i 01_au_lecteur.mp3 -filter_complex "[0:0]atrim=start=0:end=3[a1]; [0:0]atrim=start=10:end=13[a2]; sine=f=220:d=6,volume=3[a3]; [a1][a2]acrossfade=d=1.0, tremolo=f=1:d=0.75[a4], [a3][a4]amix" output.mp3
[0:0] +--+ atrim=start=0:end=3 +--+ [a1] +--+
\ \
\ +--+ acrossfade=d=1.0 +--+ tremolo=f=1:d=0.75 +--+ [a4] +--+
\ / \
+ atrim=start=10:end=13 +--+ [a2] +--+ +--+ amix +--+ output
/
sine=f=220:d=6,volume=3 +--+ [a3] +--+
acompressor acontrast acopy acrossfade acrossover acrusher acue adeclick adeclip adelay adenorm aderivative, aintegral aecho aemphasis aeval aexciter afade afftdn afftfilt afir aformat afreqshift agate aiir alimiter allpass aloop amerge amix amultiply anequalizer anlmdn anlms anull apad aphaser aphaseshift apulsator aresample areverse arnndn asetnsamples asetrate ashowinfo asoftclip asr astats asubboost asubcut asupercut asuperpass asuperstop atempo atrim axcorrelate bandpass bandreject bass, lowshelf biquad bs2b channelmap channelsplit chorus compand compensationdelay crossfeed crystalizer dcshift deesser drmeter dynaudnorm earwax equalizer extrastereo firequalizer flanger haas hdcd headphone highpass join ladspa loudnorm lowpass lv2 mcompand pan replaygain resample rubberband sidechaincompress sidechaingate silencedetect silenceremove sofalizer speechnorm stereotools stereowiden superequalizer surround treble, highshelf tremolo vibrato volume volumedetect
abuffer aevalsrc afirsrc anullsrc flite anoisesrc hilbert sinc sine
abuffersink anullsink
To test various examples, we will use a video clip downloaded from Archive.org (https://archive.org/details/TheBarberOfSeville).
As with the audio part, ffmpeg provides some video sources. An example of such a video source: a video test pattern:
$ ffmpeg -t 5 -f lavfi -i testsrc=500x500 output.mp4
The first thing we can add to this video is a text part:
$ ffmpeg -hide_banner -h filter=drawtext
$ ffmpeg -i TheBarberOfSeville1944.ogv -t 4 -filter_complex "drawtext=text='This is a movie':fontcolor=red:fontsize=32:x=W/2:y=H/2" output.mp4
We can display the audio spectrum associated to the audio stream of the movie:
$ ffmpeg -hide_banner -h filter=showspectrum
$ ffmpeg -i TheBarberOfSeville1944.ogv -t 4 -filter_complex "showspectrum=size=400x300:mode=separate:color=intensity:slide=1:scale=log" output.mp4
We can mix the original clip with the spectrum computed by ffmpeg by changing the value of the alpha channel of the showspectrum filter:
$ ffmpeg -hide_banner -h filter=overlay
$ ffmpeg -i TheBarberOfSeville1944.ogv -t 4 -filter_complex "showspectrum=size=400x300:mode=separate:color=intensity:slide=1:scale=log,format=rgba,colorchannelmixer=aa=0.5[out1];drawtext=text='This is a movie':fontcolor=red:fontsize=32:x=W/2:y=H/2[out2];[out2][out1]overlay" output.mp4
There are several other possibilities for audio representation like showwaves, showspectrum of avectorscope. Here are some example of use:
$ ffmpeg -i 01_au_lecteur.mp3 -filter_complex "[0:a]showwaves=s=1280x720:mode=line:rate=25,format=yuv420p[v]" -map "[v]" -map "0:a" output.mp4
$ ffmpeg -i 01_au_lecteur.mp3 -filter_complex "showspectrum=mode=separate:color=intensity:slide=1:scale=cbrt" -y -acodec copy output.mp4
$ ffmpeg -i 01_au_lecteur.mp3 -filter_complex avectorscope=s=320x240 -y -acodec copy output.mp4
We can also use images as a ffmpeg source. Here, with display a still image (a raven) and via a change of alpha value, we overlay a spectrum representation of the audio:
$ ffmpeg -t 5 -i 01_au_lecteur.mp3 -i BlackRaven.jpg -filter_complex "[0]showspectrum=color=channel:scale=cbrt:orientation=vertical:overlap=1:s=2048x1024[out0];[1]crop=w=1024:h=1024:[out1];[out0][out1]overlay=x=1024:alpha=0.25" output.mp4
ffmpeg has also a mandelbrot video source:
$ ffmpeg -t 10 -filter_complex "mandelbrot" output.mp4
This source can be used with ffplay. ffplay allows to read and display a video file / stream:
$ ffplay -f lavfi -i mandelbrot
And example of use of ffplay with a complex filter (here, the equivalent of -filter_complex option from ffmpeg is -vf):
$ ffplay TheBarberOfSeville1944.ogv -vf "split=2[m][v], [v]vectorscope=b=0.7:m=color3:g=green[v],[m][v]overlay=x=W-w:y=H-h"
We display a movie and a still image each of which displayed on half a screen. The hstack / vstack seem to be sensitive to fps. So, the first thing we do is to set a fps for each stream. Then, we rescale the image / video using the ‘scale’ filter and then ‘crop’ one half. Everything is then put on the screen using ‘hstack’.
$ ffmpeg -ss 0 -t 5 \
-i TheBarberOfSeville1944.ogv \
-i BlackRaven.jpg \
-filter_complex \
"[0]fps=fps=20, scale=h=1024:w=1024,crop=iw/2:ih:0:0[left]; \
[1]fps=fps=20, scale=h=1024:w=1024,crop=iw/2:ih:iw/2:0[right]; \
[left][right]hstack[out]" \
-map "[out]" \
output.mp4
$ ffmpeg -ss 0 -t 5 \
-i TheBarberOfSeville1944.ogv \
-i BlackRaven.jpg \
-filter_complex \
"[1:v]fps=fps=20,scale=400:-1[bottom];\
[0:v]fps=fps=20[top];\
[top][bottom]vstack" \
output.mp4
$ ffmpeg -hide_banner -filters
T.. = Timeline support
.S. = Slice threading
..C = Command support
A = Audio input/output
V = Video input/output
N = Dynamic number and/or type of input/output
| = Source or sink filter
... abench A->A Benchmark part of a filtergraph.
..C acompressor A->A Audio compressor.
... acontrast A->A Simple audio dynamic range compression/expansion filter.
... acopy A->A Copy the input audio unchanged to the output.
... acue A->A Delay filtering to match a cue.
... acrossfade AA->A Cross fade two input audio streams.
.S. acrossover A->N Split audio into per-bands streams.
... acrusher A->A Reduce audio bit resolution.
TS. adeclick A->A Remove impulsive noise from input audio.
TS. adeclip A->A Remove clipping from input audio.
T.. adelay A->A Delay one or more audio channels.
... aderivative A->A Compute derivative of input audio.
... aecho A->A Add echoing to the audio.
... aemphasis A->A Audio emphasis.
... aeval A->A Filter audio signal according to a specified expression.
T.. afade A->A Fade in/out input audio.
TSC afftdn A->A Denoise audio samples using FFT.
... afftfilt A->A Apply arbitrary expressions to samples in frequency domain.
.SC afir N->N Apply Finite Impulse Response filter with supplied coefficients in additional stream(s).
... aformat A->A Convert the input audio to one of the specified formats.
... agate A->A Audio gate.
.S. aiir A->N Apply Infinite Impulse Response filter with supplied coefficients.
... aintegral A->A Compute integral of input audio.
... ainterleave N->A Temporally interleave audio inputs.
... alimiter A->A Audio lookahead limiter.
TSC allpass A->A Apply a two-pole all-pass filter.
... aloop A->A Loop audio samples.
... amerge N->A Merge two or more audio streams into a single multi-channel stream.
T.. ametadata A->A Manipulate audio frame metadata.
..C amix N->A Audio mixing.
... amultiply AA->A Multiply two audio streams.
..C anequalizer A->N Apply high-order audio parametric multi band equalizer.
TSC anlmdn A->A Reduce broadband noise from stream using Non-Local Means.
.SC anlms AA->A Apply Normalized Least-Mean-Squares algorithm to first audio stream.
... anull A->A Pass the source unchanged to the output.
T.. apad A->A Pad audio with silence.
... aperms A->A Set permissions for the output audio frame.
... aphaser A->A Add a phasing effect to the audio.
... apulsator A->A Audio pulsator.
... arealtime A->A Slow down filtering to match realtime.
... aresample A->A Resample audio data.
... areverse A->A Reverse an audio clip.
.S. arnndn A->A Reduce noise from speech using Recurrent Neural Networks.
... aselect A->N Select audio frames to pass in output.
... asendcmd A->A Send commands to filters.
... asetnsamples A->A Set the number of samples for each output audio frames.
... asetpts A->A Set PTS for the output audio frame.
... asetrate A->A Change the sample rate without altering the data.
... asettb A->A Set timebase for the audio output link.
... ashowinfo A->A Show textual information for each audio frame.
T.. asidedata A->A Manipulate audio frame side data.
TSC asoftclip A->A Audio Soft Clipper.
... asplit A->N Pass on the audio input to N audio outputs.
.S. astats A->A Show time domain statistics about audio frames.
..C astreamselect N->N Select audio streams
..C asubboost A->A Boost subwoofer frequencies.
..C atempo A->A Adjust audio tempo.
... atrim A->A Pick one continuous section from the input, drop the rest.
... axcorrelate AA->A Cross-correlate two audio streams.
TSC bandpass A->A Apply a two-pole Butterworth band-pass filter.
TSC bandreject A->A Apply a two-pole Butterworth band-reject filter.
TSC bass A->A Boost or cut lower frequencies.
TSC biquad A->A Apply a biquad IIR filter with the given coefficients.
... chromaber_vulkan V->V Offset chroma of input video (chromatic aberration)
... channelmap A->A Remap audio channels.
... channelsplit A->N Split audio into per-channel streams.
... chorus A->A Add a chorus effect to the audio.
... compand A->A Compress or expand audio dynamic range.
... compensationdelay A->A Audio Compensation Delay Line.
T.C crossfeed A->A Apply headphone crossfeed filter.
TSC crystalizer A->A Simple expand audio dynamic range filter.
T.. dcshift A->A Apply a DC shift to the audio.
T.. deesser A->A Apply de-essing to the audio.
... drmeter A->A Measure audio dynamic range.
T.C dynaudnorm A->A Dynamic Audio Normalizer.
... earwax A->A Widen the stereo image.
... ebur128 A->N EBU R128 scanner.
TSC equalizer A->A Apply two-pole peaking equalization (EQ) filter.
T.C extrastereo A->A Increase difference between stereo audio channels.
..C firequalizer A->A Finite Impulse Response Equalizer.
... flanger A->A Apply a flanging effect to the audio.
... haas A->A Apply Haas Stereo Enhancer.
... hdcd A->A Apply High Definition Compatible Digital (HDCD) decoding.
.S. headphone N->A Apply headphone binaural spatialization with HRTFs in additional streams.
TSC highpass A->A Apply a high-pass filter with 3dB point frequency.
TSC highshelf A->A Apply a high shelf filter.
... join N->A Join multiple audio streams into multi-channel output.
..C ladspa N->A Apply LADSPA effect.
... loudnorm A->A EBU R128 loudness normalization
TSC lowpass A->A Apply a low-pass filter with 3dB point frequency.
TSC lowshelf A->A Apply a low shelf filter.
... lv2 N->A Apply LV2 effect.
... mcompand A->A Multiband Compress or expand audio dynamic range.
... pan A->A Remix channels with coefficients (panning).
... replaygain A->A ReplayGain scanner.
... resample A->A Audio resampling and conversion.
..C sidechaincompress AA->A Sidechain compressor.
... sidechaingate AA->A Audio sidechain gate.
... silencedetect A->A Detect silence.
... silenceremove A->A Remove silence.
.S. sofalizer A->A SOFAlizer (Spatially Oriented Format for Acoustics).
... stereotools A->A Apply various stereo tools.
T.C stereowiden A->A Apply stereo widening effect.
... superequalizer A->A Apply 18 band equalization filter.
.S. surround A->A Apply audio surround upmix filter.
TSC treble A->A Boost or cut upper frequencies.
... tremolo A->A Apply tremolo effect.
... vibrato A->A Apply vibrato effect.
T.C volume A->A Change input volume.
... volumedetect A->A Detect audio volume.
... aevalsrc |->A Generate an audio signal generated by an expression.
... afirsrc |->A Generate a FIR coefficients audio stream.
... anoisesrc |->A Generate a noise audio signal.
... anullsrc |->A Null audio source, return empty audio frames.
... hilbert |->A Generate a Hilbert transform FIR coefficients.
... sinc |->A Generate a sinc kaiser-windowed low-pass, high-pass, band-pass, or band-reject FIR coefficients.
... sine |->A Generate sine wave audio signal.
... anullsink A->| Do absolutely nothing with the input audio.
... addroi V->V Add region of interest to frame.
... alphaextract V->N Extract an alpha channel as a grayscale image component.
T.. alphamerge VV->V Copy the luma value of the second input into the alpha channel of the first input.
TSC amplify V->V Amplify changes between successive video frames.
... ass V->V Render ASS subtitles onto input video using the libass library.
TSC atadenoise V->V Apply an Adaptive Temporal Averaging Denoiser.
TSC avgblur V->V Apply Average Blur filter.
... avgblur_opencl V->V Apply average blur filter
... avgblur_vulkan V->V Apply avgblur mask to input video
T.. bbox V->V Compute bounding box for each frame.
... bench V->V Benchmark part of a filtergraph.
T.. bilateral V->V Apply Bilateral filter.
T.. bitplanenoise V->V Measure bit plane noise.
... blackdetect V->V Detect video intervals that are (almost) black.
... blackframe V->V Detect frames that are (almost) black.
TS. blend VV->V Blend two video frames into each other.
TS. bm3d N->V Block-Matching 3D denoiser.
T.. boxblur V->V Blur the input.
... boxblur_opencl V->V Apply boxblur filter to input video
TS. bwdif V->V Deinterlace the input image.
TSC cas V->V Contrast Adaptive Sharpen.
TSC chromahold V->V Turns a certain color range into gray.
TSC chromakey V->V Turns a certain color into transparency. Operates on YUV colors.
TSC chromashift V->V Shift chroma.
... ciescope V->V Video CIE scope.
T.. codecview V->V Visualize information about some codecs.
TSC colorbalance V->V Adjust the color balance.
TSC colorchannelmixer V->V Adjust colors by mixing color channels.
TSC colorkey V->V Turns a certain color into transparency. Operates on RGB colors.
... colorkey_opencl V->V Turns a certain color into transparency. Operates on RGB colors.
TSC colorhold V->V Turns a certain color range into gray. Operates on RGB colors.
TSC colorlevels V->V Adjust the color levels.
TS. colormatrix V->V Convert color matrix.
TS. colorspace V->V Convert between colorspaces.
TS. convolution V->V Apply convolution filter.
... convolution_opencl V->V Apply convolution mask to input video
TS. convolve VV->V Convolve first video stream with second video stream.
... copy V->V Copy the input video unchanged to the output.
... cover_rect V->V Find and cover a user specified object.
..C crop V->V Crop the input video.
T.. cropdetect V->V Auto-detect crop size.
... cue V->V Delay filtering to match a cue.
TS. curves V->V Adjust components curves.
.S. datascope V->V Video data analysis.
T.C dblur V->V Apply Directional Blur filter.
TS. dctdnoiz V->V Denoise frames using 2D DCT.
TS. deband V->V Debands video.
T.. deblock V->V Deblock video.
... decimate N->V Decimate frames (post field matching filter).
TS. deconvolve VV->V Deconvolve first video stream with second video stream.
TS. dedot V->V Reduce cross-luminance and cross-color.
TSC deflate V->V Apply deflate effect.
... deflicker V->V Remove temporal frame luminance variations.
... deinterlace_qsv V->V QuickSync video deinterlacing
... deinterlace_vaapi V->V Deinterlacing of VAAPI surfaces
... dejudder V->V Remove judder produced by pullup.
T.. delogo V->V Remove logo from input video.
... denoise_vaapi V->V VAAPI VPP for de-noise
T.. derain V->V Apply derain filter to the input.
... deshake V->V Stabilize shaky video.
... deshake_opencl V->V Feature-point based video stabilization filter
TS. despill V->V Despill video.
... detelecine V->V Apply an inverse telecine pattern.
TSC dilation V->V Apply dilation effect.
... dilation_opencl V->V Apply dilation effect
T.. displace VVV->V Displace pixels.
... dnn_processing V->V Apply DNN processing filter to the input.
... doubleweave V->V Weave input video fields into double number of frames.
T.C drawbox V->V Draw a colored box on the input video.
... drawgraph V->V Draw a graph using input video metadata.
T.C drawgrid V->V Draw a colored grid on the input video.
T.C drawtext V->V Draw text on top of video frames using libfreetype library.
T.. edgedetect V->V Detect and draw edge.
... elbg V->V Apply posterize effect, using the ELBG algorithm.
T.. entropy V->V Measure video frames entropy.
T.C eq V->V Adjust brightness, contrast, gamma, and saturation.
TSC erosion V->V Apply erosion effect.
... erosion_opencl V->V Apply erosion effect
... extractplanes V->N Extract planes as grayscale frames.
.S. fade V->V Fade in/out input video.
T.. fftdnoiz V->V Denoise frames using 3D FFT.
T.. fftfilt V->V Apply arbitrary expressions to pixels in frequency domain.
... field V->V Extract a field from the input video.
... fieldhint V->V Field matching using hints.
... fieldmatch N->V Field matching for inverse telecine.
T.. fieldorder V->V Set the field order.
T.C fillborders V->V Fill borders of the input video.
... find_rect V->V Find a user specified object.
T.. floodfill V->V Fill area with same color with another color.
... format V->V Convert the input video to one of the specified pixel formats.
... fps V->V Force constant framerate.
... framepack VV->V Generate a frame packed stereoscopic video.
.S. framerate V->V Upsamples or downsamples progressive source between specified frame rates.
T.. framestep V->V Select one frame every N frames.
... freezedetect V->V Detects frozen video input.
... freezeframes VV->V Freeze video frames.
... frei0r V->V Apply a frei0r effect.
T.. fspp V->V Apply Fast Simple Post-processing filter.
TSC gblur V->V Apply Gaussian Blur filter.
TS. geq V->V Apply generic equation to each pixel.
T.. gradfun V->V Debands video quickly using gradients.
... graphmonitor V->V Show various filtergraph stats.
TS. greyedge V->V Estimates scene illumination by grey edge assumption.
TS. haldclut VV->V Adjust colors using a Hald CLUT.
TS. hflip V->V Horizontally flip the input video.
T.. histeq V->V Apply global color histogram equalization.
... histogram V->V Compute and draw a histogram.
TSC hqdn3d V->V Apply a High Quality 3D Denoiser.
.S. hqx V->V Scale the input by 2, 3 or 4 using the hq*x magnification algorithm.
.S. hstack N->V Stack video inputs horizontally.
T.C hue V->V Adjust the hue and saturation of the input video.
... hwdownload V->V Download a hardware frame to a normal frame
... hwmap V->V Map hardware frames
... hwupload V->V Upload a normal frame to a hardware frame
... hwupload_cuda V->V Upload a system memory frame to a CUDA device.
T.. hysteresis VV->V Grow first stream into second stream by connecting components.
... idet V->V Interlace detect Filter.
T.C il V->V Deinterleave or interleave fields.
TSC inflate V->V Apply inflate effect.
... interlace V->V Convert progressive video into interlaced.
... interleave N->V Temporally interleave video inputs.
... kerndeint V->V Apply kernel deinterlacing to the input.
.S. lagfun V->V Slowly update darker pixels.
.S. lenscorrection V->V Rectify the image by correcting for lens distortion.
TS. lensfun V->V Apply correction to an image based on info derived from the lensfun database.
... libvmaf VV->V Calculate the VMAF between two video streams.
TS. limiter V->V Limit pixels components to the specified range.
... loop V->V Loop video frames.
TSC lumakey V->V Turns a certain luma into transparency.
TS. lut V->V Compute and apply a lookup table to the RGB/YUV input video.
TS. lut1d V->V Adjust colors using a 1D LUT.
TS. lut2 VV->V Compute and apply a lookup table from two video inputs.
TS. lut3d V->V Adjust colors using a 3D LUT.
TS. lutrgb V->V Compute and apply a lookup table to the RGB input video.
TS. lutyuv V->V Compute and apply a lookup table to the YUV input video.
TS. maskedclamp VVV->V Clamp first stream with second stream and third stream.
TS. maskedmax VVV->V Apply filtering with maximum difference of two streams.
TS. maskedmerge VVV->V Merge first stream with second stream using third stream as mask.
TS. maskedmin VVV->V Apply filtering with minimum difference of two streams.
TS. maskedthreshold VV->V Pick pixels comparing absolute difference of two streams with threshold.
TS. maskfun V->V Create Mask.
... mcdeint V->V Apply motion compensating deinterlacing.
TSC median V->V Apply Median filter.
... mergeplanes N->V Merge planes.
... mestimate V->V Generate motion vectors.
T.. metadata V->V Manipulate video frame metadata.
T.. midequalizer VV->V Apply Midway Equalization.
... minterpolate V->V Frame rate conversion using Motion Interpolation.
.S. mix N->V Mix video inputs.
... mpdecimate V->V Remove near-duplicate frames.
TS. negate V->V Negate input video.
TS. nlmeans V->V Non-local means denoiser.
... nlmeans_opencl V->V Non-local means denoiser through OpenCL
T.. nnedi V->V Apply neural network edge directed interpolation intra-only deinterlacer.
... noformat V->V Force libavfilter not to use any of the specified pixel formats for the input to the next filter.
TS. noise V->V Add noise.
T.C normalize V->V Normalize RGB video.
... null V->V Pass the source unchanged to the output.
T.C oscilloscope V->V 2D Video Oscilloscope.
TSC overlay VV->V Overlay a video source on top of the input.
... overlay_opencl VV->V Overlay one video on top of another
... overlay_qsv VV->V Quick Sync Video overlay.
... overlay_vulkan VV->V Overlay a source on top of another
T.. owdenoise V->V Denoise using wavelets.
... pad V->V Pad the input video.
... pad_opencl V->V Pad the input video.
... palettegen V->V Find the optimal palette for a given stream.
... paletteuse VV->V Use a palette to downsample an input video stream.
... perms V->V Set permissions for the output video frame.
TS. perspective V->V Correct the perspective of video.
T.. phase V->V Phase shift fields.
... photosensitivity V->V Filter out photosensitive epilepsy seizure-inducing flashes.
... pixdesctest V->V Test pixel format definitions.
T.. pixscope V->V Pixel data analysis.
T.C pp V->V Filter video using libpostproc.
T.. pp7 V->V Apply Postprocessing 7 filter.
TS. premultiply N->V PreMultiply first stream with first plane of second stream.
TS. prewitt V->V Apply prewitt operator.
... prewitt_opencl V->V Apply prewitt operator
... procamp_vaapi V->V ProcAmp (color balance) adjustments for hue, saturation, brightness, contrast
... program_opencl |->V Filter video using an OpenCL program
T.. pseudocolor V->V Make pseudocolored video frames.
... psnr VV->V Calculate the PSNR between two video streams.
... pullup V->V Pullup from field sequence to frames.
T.. qp V->V Change video quantization parameters.
... random V->V Return random frames.
T.. readeia608 V->V Read EIA-608 Closed Caption codes from input video and write them to frame metadata.
... readvitc V->V Read vertical interval timecode and write it to frame metadata.
... realtime V->V Slow down filtering to match realtime.
TS. remap VVV->V Remap pixels.
TS. removegrain V->V Remove grain.
T.. removelogo V->V Remove a TV logo based on a mask image.
... repeatfields V->V Hard repeat fields based on MPEG repeat field flag.
... reverse V->V Reverse a clip.
TSC rgbashift V->V Shift RGBA.
TS. roberts V->V Apply roberts cross operator.
... roberts_opencl V->V Apply roberts operator
TSC rotate V->V Rotate the input image.
T.. sab V->V Apply shape adaptive blur.
..C scale V->V Scale the input video size and/or convert the image format.
... scale_qsv V->V QuickSync video scaling and format conversion
... scale_vaapi V->V Scale to/from VAAPI surfaces.
... scale_vulkan V->V Scale Vulkan frames
..C scale2ref VV->VV Scale the input video size and/or convert the image format to the given reference.
... scdet V->V Detect video scene change
TSC scroll V->V Scroll input video.
... select V->N Select video frames to pass in output.
TS. selectivecolor V->V Apply CMYK adjustments to specific color ranges.
... sendcmd V->V Send commands to filters.
... separatefields V->V Split input video frames into fields.
... setdar V->V Set the frame display aspect ratio.
... setfield V->V Force field for the output video frame.
... setparams V->V Force field, or color property for the output video frame.
... setpts V->V Set PTS for the output video frame.
... setrange V->V Force color range for the output video frame.
... setsar V->V Set the pixel sample aspect ratio.
... settb V->V Set timebase for the video output link.
... sharpness_vaapi V->V VAAPI VPP for sharpness
... showinfo V->V Show textual information for each video frame.
... showpalette V->V Display frame palette.
T.. shuffleframes V->V Shuffle video frames.
T.. shuffleplanes V->V Shuffle video planes.
T.. sidedata V->V Manipulate video frame side data.
.S. signalstats V->V Generate statistics from video analysis.
... signature N->V Calculate the MPEG-7 video signature
T.. smartblur V->V Blur the input video without impacting the outlines.
TS. sobel V->V Apply sobel operator.
... sobel_opencl V->V Apply sobel operator
... split V->N Pass on the input to N video outputs.
T.C spp V->V Apply a simple post processing filter.
... sr V->V Apply DNN-based image super resolution to the input.
... ssim VV->V Calculate the SSIM between two video streams.
.S. stereo3d V->V Convert video stereoscopic 3D view.
..C streamselect N->N Select video streams
... subtitles V->V Render text subtitles onto input video using the libass library.
... super2xsai V->V Scale the input by 2x using the Super2xSaI pixel art algorithm.
T.. swaprect V->V Swap 2 rectangular objects in video.
T.. swapuv V->V Swap U and V components.
TS. tblend V->V Blend successive frames.
... telecine V->V Apply a telecine pattern.
... thistogram V->V Compute and draw a temporal histogram.
TS. threshold VVVV->V Threshold first video stream using other video streams.
T.. thumbnail V->V Select the most representative frame in a given sequence of consecutive frames.
... tile V->V Tile several successive frames together.
... tinterlace V->V Perform temporal field interlacing.
TS. tlut2 V->V Compute and apply a lookup table from two successive frames.
TS. tmedian V->V Pick median pixels from successive frames.
TS. tmix V->V Mix successive video frames.
.S. tonemap V->V Conversion to/from different dynamic ranges.
... tonemap_opencl V->V Perform HDR to SDR conversion with tonemapping.
... tonemap_vaapi V->V VAAPI VPP for tone-mapping
... tpad V->V Temporarily pad video frames.
.S. transpose V->V Transpose input video.
... transpose_opencl V->V Transpose input video
... transpose_vaapi V->V VAAPI VPP for transpose
... trim V->V Pick one continuous section from the input, drop the rest.
TS. unpremultiply N->V UnPreMultiply first stream with first plane of second stream.
TS. unsharp V->V Sharpen or blur the input video.
... unsharp_opencl V->V Apply unsharp mask to input video
... untile V->V Untile a frame into a sequence of frames.
T.. uspp V->V Apply Ultra Simple / Slow Post-processing filter.
.SC v360 V->V Convert 360 projection of video.
T.. vaguedenoiser V->V Apply a Wavelet based Denoiser.
... vectorscope V->V Video vectorscope.
T.. vflip V->V Flip the input video vertically.
... vfrdet V->V Variable frame rate detect filter.
TSC vibrance V->V Boost or alter saturation.
... vidstabdetect V->V Extract relative transformations, pass 1 of 2 for stabilization (see vidstabtransform for pass 2).
... vidstabtransform V->V Transform the frames, pass 2 of 2 for stabilization (see vidstabdetect for pass 1).
T.. vignette V->V Make or reverse a vignette effect.
... vmafmotion V->V Calculate the VMAF Motion score.
... vpp_qsv V->V Quick Sync Video VPP.
.S. vstack N->V Stack video inputs vertically.
TS. w3fdif V->V Apply Martin Weston three field deinterlace.
.S. waveform V->V Video waveform monitor.
... weave V->V Weave input video fields into frames.
.S. xbr V->V Scale the input using xBR algorithm.
.S. xfade VV->V Cross fade one video with another video.
... xfade_opencl VV->V Cross fade one video with another video.
.S. xmedian N->V Pick median pixels from several video inputs.
.S. xstack N->V Stack video inputs into custom layout.
TS. yadif V->V Deinterlace the input image.
TSC yaepblur V->V Yet another edge preserving blur filter.
... zoompan V->V Apply Zoom & Pan effect.
..C zscale V->V Apply resizing, colorspace and bit depth conversion.
... allrgb |->V Generate all RGB colors.
... allyuv |->V Generate all yuv colors.
... cellauto |->V Create pattern generated by an elementary cellular automaton.
..C color |->V Provide an uniformly colored input.
... frei0r_src |->V Generate a frei0r source.
.S. gradients |->V Draw a gradients.
... haldclutsrc |->V Provide an identity Hald CLUT.
... life |->V Create life.
... mandelbrot |->V Render a Mandelbrot fractal.
... mptestsrc |->V Generate various test pattern.
... nullsrc |->V Null video source, return unprocessed video frames.
... openclsrc |->V Generate video using an OpenCL program
... pal75bars |->V Generate PAL 75% color bars.
... pal100bars |->V Generate PAL 100% color bars.
... rgbtestsrc |->V Generate RGB test pattern.
.S. sierpinski |->V Render a Sierpinski fractal.
... smptebars |->V Generate SMPTE color bars.
... smptehdbars |->V Generate SMPTE HD color bars.
... testsrc |->V Generate test pattern.
... testsrc2 |->V Generate another test pattern.
... yuvtestsrc |->V Generate YUV test pattern.
... nullsink V->| Do absolutely nothing with the input video.
... abitscope A->V Convert input audio to audio bit scope video output.
... adrawgraph A->V Draw a graph using input audio metadata.
... agraphmonitor A->V Show various filtergraph stats.
... ahistogram A->V Convert input audio to histogram video output.
... aphasemeter A->N Convert input audio to phase meter video output.
... avectorscope A->V Convert input audio to vectorscope video output.
..C concat N->N Concatenate audio and video streams.
... showcqt A->V Convert input audio to a CQT (Constant/Clamped Q Transform) spectrum video output.
... showfreqs A->V Convert input audio to a frequencies video output.
.S. showspatial A->V Convert input audio to a spatial video output.
.S. showspectrum A->V Convert input audio to a spectrum video output.
.S. showspectrumpic A->V Convert input audio to a spectrum video output single picture.
... showvolume A->V Convert input audio volume to video output.
... showwaves A->V Convert input audio to a video output.
... showwavespic A->V Convert input audio to a video output single picture.
... spectrumsynth VV->A Convert input spectrum videos to audio output.
..C amovie |->N Read audio from a movie source.
..C movie |->N Read from a movie source.
... afifo A->A Buffer input frames and send them when they are requested.
... fifo V->V Buffer input images and send them when they are requested.
... abuffer |->A Buffer audio frames, and make them accessible to the filterchain.
... buffer |->V Buffer video frames, and make them accessible to the filterchain.
... abuffersink A->| Buffer audio frames, and make them available to the end of the filter graph.
... buffersink V->| Buffer video frames, and make them available to the end of the filter graph.
overlay: we can use special variables like W for the width of the video or H. Check the man page ffmpeg-filters
ffmprovisr: https://amiaopensource.github.io/ffmprovisr/ Filtering Examples: https://trac.ffmpeg.org/wiki/FancyFilteringExamples FFmpeg docs: https://ffmpeg.org/ffmpeg.html
In FFmpeg help:
ffmpeg -hide_banner -h filter=
FFmpeg: