Title: | Working with Audio and Video in R |
---|---|
Description: | Bindings to 'FFmpeg' <http://www.ffmpeg.org/> AV library for working with audio and video in R. Generates high quality video from images or R graphics with custom audio. Also offers high performance tools for reading raw audio, creating 'spectrograms', and converting between countless audio / video formats. This package interfaces directly to the C API and does not require any command line utilities. |
Authors: | Jeroen Ooms [aut, cre] |
Maintainer: | Jeroen Ooms <[email protected]> |
License: | MIT + file LICENSE |
Version: | 0.9.3 |
Built: | 2024-12-11 06:44:22 UTC |
Source: | CRAN |
Splits a video file in a set of image files. Default image format is
jpeg which has good speed and compression. Use format = "png"
for
losless images.
av_video_images(video, destdir = tempfile(), format = "jpg", fps = NULL)
av_video_images(video, destdir = tempfile(), format = "jpg", fps = NULL)
video |
an input video |
destdir |
directory where to save the png files |
format |
image format such as |
fps |
sample rate of images. Use |
For large input videos you can set fps to sample only a limited number
of images per second. This also works with fractions, for example fps = 0.2
will output one image for every 5 sec of video.
Runs the expression and captures all plots into a video. The av_spectrogram_video function is a wrapper that plots data from read_audio_fft with a moving bar and background audio.
av_capture_graphics( expr, output = "output.mp4", width = 720, height = 480, framerate = 1, vfilter = "null", audio = NULL, verbose = TRUE, ... ) av_spectrogram_video( audio, output = "output.mp4", framerate = 25, verbose = TRUE, ... )
av_capture_graphics( expr, output = "output.mp4", width = 720, height = 480, framerate = 1, vfilter = "null", audio = NULL, verbose = TRUE, ... ) av_spectrogram_video( audio, output = "output.mp4", framerate = 25, verbose = TRUE, ... )
expr |
an R expression that generates the graphics to capture |
output |
name of the output file. File extension must correspond to a known
container format such as |
width |
width in pixels of the graphics device |
height |
height in pixels of the graphics device |
framerate |
video framerate in frames per seconds. This is the input fps, the output fps may be different if you specify a filter that modifies speed or interpolates frames. |
vfilter |
a string defining an ffmpeg filter graph. This is the same parameter
as the |
audio |
path to media file with audio stream |
verbose |
emit some output and a progress meter counting processed images. Must
be |
... |
extra graphics parameters passed to |
Other av:
demo()
,
encoding
,
formats
,
info
,
logging
,
read_audio_fft()
library(gapminder) library(ggplot2) makeplot <- function(){ datalist <- split(gapminder, gapminder$year) lapply(datalist, function(data){ p <- ggplot(data, aes(gdpPercap, lifeExp, size = pop, color = continent)) + scale_size("population", limits = range(gapminder$pop)) + geom_point() + ylim(20, 90) + scale_x_log10(limits = range(gapminder$gdpPercap)) + ggtitle(data$year) + theme_classic() print(p) }) } # Play 1 plot per sec, and use an interpolation filter to convert into 10 fps video_file <- file.path(tempdir(), 'output.mp4') av_capture_graphics(makeplot(), video_file, 1280, 720, res = 144, vfilter = 'framerate=fps=10') av::av_media_info(video_file) # utils::browseURL(video_file)
library(gapminder) library(ggplot2) makeplot <- function(){ datalist <- split(gapminder, gapminder$year) lapply(datalist, function(data){ p <- ggplot(data, aes(gdpPercap, lifeExp, size = pop, color = continent)) + scale_size("population", limits = range(gapminder$pop)) + geom_point() + ylim(20, 90) + scale_x_log10(limits = range(gapminder$gdpPercap)) + ggtitle(data$year) + theme_classic() print(p) }) } # Play 1 plot per sec, and use an interpolation filter to convert into 10 fps video_file <- file.path(tempdir(), 'output.mp4') av_capture_graphics(makeplot(), video_file, 1280, 720, res = 144, vfilter = 'framerate=fps=10') av::av_media_info(video_file) # utils::browseURL(video_file)
Generates random video for testing purposes.
av_demo( output = "demo.mp4", width = 960, height = 720, framerate = 5, verbose = TRUE, ... )
av_demo( output = "demo.mp4", width = 960, height = 720, framerate = 5, verbose = TRUE, ... )
output |
name of the output file. File extension must correspond to a known
container format such as |
width |
width in pixels of the graphics device |
height |
height in pixels of the graphics device |
framerate |
video framerate in frames per seconds. This is the input fps, the output fps may be different if you specify a filter that modifies speed or interpolates frames. |
verbose |
emit some output and a progress meter counting processed images. Must
be |
... |
other parameters passed to av_capture_graphics. |
Other av:
capturing
,
encoding
,
formats
,
info
,
logging
,
read_audio_fft()
Encodes a set of images into a video, using custom container format, codec, fps, video filters, and audio track. If input contains video files, this effectively combines and converts them to the specified output format.
av_encode_video( input, output = "output.mp4", framerate = 24, vfilter = "null", codec = NULL, audio = NULL, verbose = TRUE ) av_video_convert(video, output = "output.mp4", verbose = TRUE) av_audio_convert( audio, output = "output.mp3", format = NULL, channels = NULL, sample_rate = NULL, bit_rate = NULL, start_time = NULL, total_time = NULL, verbose = TRUE )
av_encode_video( input, output = "output.mp4", framerate = 24, vfilter = "null", codec = NULL, audio = NULL, verbose = TRUE ) av_video_convert(video, output = "output.mp4", verbose = TRUE) av_audio_convert( audio, output = "output.mp3", format = NULL, channels = NULL, sample_rate = NULL, bit_rate = NULL, start_time = NULL, total_time = NULL, verbose = TRUE )
input |
a vector with image or video files. A video input file is treated as a series of images. All input files should have the same width and height. |
output |
name of the output file. File extension must correspond to a known
container format such as |
framerate |
video framerate in frames per seconds. This is the input fps, the output fps may be different if you specify a filter that modifies speed or interpolates frames. |
vfilter |
a string defining an ffmpeg filter graph. This is the same parameter
as the |
codec |
name of the video codec as listed in av_encoders. The
default is |
audio |
audio or video input file with sound for the output video |
verbose |
emit some output and a progress meter counting processed images. Must
be |
video |
input video file with optionally also an audio track |
format |
a valid output format name from the list of |
channels |
number of output channels. Default |
sample_rate |
output sampling rate. Default |
bit_rate |
output bitrate (quality). A common value is 192000. Default
|
start_time |
number greater than 0, seeks in the input file to position. |
total_time |
approximate number of seconds at which to limit the duration of the output file. |
The target container format and audio/video codes are automatically determined from
the file extension of the output file, for example mp4
, mkv
, mov
, or flv
.
For video output, most systems also support gif
output, but the compression~quality
for gif is really bad. The gifski package
is better suited for generating animated gif files. Still using a proper video format
is results in much better quality.
It is recommended to use let ffmpeg choose the suitable codec for a given container
format. Most video formats default to the libx264
video codec which has excellent
compression and works on all modern browsers, operating systems, and digital TVs.
To convert from/to raw PCM audio, use file extensions ".ub"
or ".sb"
for 8bit
unsigned or signed respectively, or ".uw"
or ".sw"
for 16-bit, see extensions
in av_muxers()
. Alternatively can also convert to other raw audio PCM by setting
for example format = "u16le"
(i.e. unsigned 16-bit little-endian) or another option
from the name
column in av_muxers()
.
It is safe to interrupt the encoding process by pressing CTRL+C, or via setTimeLimit. When the encoding is interrupted, the output stream is properly finalized and all open files and resources are properly closed.
Other av:
capturing
,
demo()
,
formats
,
info
,
logging
,
read_audio_fft()
List supported filters, codecs and container formats.
av_encoders() av_decoders() av_filters() av_muxers() av_demuxers()
av_encoders() av_decoders() av_filters() av_muxers() av_demuxers()
Encoders and decoders convert between raw video/audio frames and compressed stream data for storage or transfer. However such a compressed data stream by itself does not constitute a valid video format yet. Muxers are needed to interleave one or more audio/video/subtitle streams, along with timestamps, metadata, etc, into a proper file format, such as mp4 or mkv.
Conversely, demuxers are needed to read a file format into the separate data streams for subsequent decoding into raw audio/video frames. Most operating systems natively support demuxing and decoding common formats and codecs, needed to play those videos. However for encoding and muxing such videos, ffmpeg must have been configured with specific external libraries for a given codec or format.
Other av:
capturing
,
demo()
,
encoding
,
info
,
logging
,
read_audio_fft()
Get video info such as width, height, format, duration and framerate. This may also be used for audio input files.
av_media_info(file)
av_media_info(file)
file |
path to an existing file |
Other av:
capturing
,
demo()
,
encoding
,
formats
,
logging
,
read_audio_fft()
Reads raw audio data from any common audio or video format. Use read_audio_bin to get raw PCM audio samples, or read_audio_fft to stream-convert directly into frequency domain (spectrum) data using FFmpeg built-in FFT.
read_audio_fft( audio, window = hanning(1024), overlap = 0.75, sample_rate = NULL, start_time = NULL, end_time = NULL ) read_audio_bin( audio, channels = NULL, sample_rate = NULL, start_time = NULL, end_time = NULL ) write_audio_bin( pcm_data, pcm_channels = 1L, pcm_format = "s32le", output = "output.mp3", ... )
read_audio_fft( audio, window = hanning(1024), overlap = 0.75, sample_rate = NULL, start_time = NULL, end_time = NULL ) read_audio_bin( audio, channels = NULL, sample_rate = NULL, start_time = NULL, end_time = NULL ) write_audio_bin( pcm_data, pcm_channels = 1L, pcm_format = "s32le", output = "output.mp3", ... )
audio |
path to the input sound or video file containing the audio stream |
window |
vector with weights defining the moving fft window function. The length of this vector is the size of the window and hence determines the output frequency range. |
overlap |
value between 0 and 1 of overlap proportion between moving fft windows |
sample_rate |
downsample audio to reduce FFT output size. Default keeps sample rate from the input file. |
start_time , end_time
|
position (in seconds) to cut input stream to be processed. |
channels |
number of output channels, set to 1 to convert to mono sound |
pcm_data |
integer vector as returned by read_audio_bin |
pcm_channels |
number of channels in the data. Use the same value as you entered in read_audio_bin. |
pcm_format |
this is always |
output |
passed to av_audio_convert |
... |
other paramters for av_audio_convert |
Currently read_audio_fft automatically converts input audio to mono channel such
that we get a single matrix. Use the plot()
method on data returned by read_audio_fft
to show the spectrogram. The av_spectrogram_video generates a video that plays
the audio while showing an animated spectrogram with moving status bar, which is
very cool.
Other av:
capturing
,
demo()
,
encoding
,
formats
,
info
,
logging
# Use a 5 sec fragment wonderland <- system.file('samples/Synapsis-Wonderland.mp3', package='av') # Read initial 5 sec as as frequency spectrum fft_data <- read_audio_fft(wonderland, end_time = 5.0) dim(fft_data) # Plot the spectrogram plot(fft_data) # Show other parameters dim(read_audio_fft(wonderland, end_time = 5.0, hamming(2048))) dim(read_audio_fft(wonderland, end_time = 5.0, hamming(4096)))
# Use a 5 sec fragment wonderland <- system.file('samples/Synapsis-Wonderland.mp3', package='av') # Read initial 5 sec as as frequency spectrum fft_data <- read_audio_fft(wonderland, end_time = 5.0) dim(fft_data) # Plot the spectrogram plot(fft_data) # Show other parameters dim(read_audio_fft(wonderland, end_time = 5.0, hamming(2048))) dim(read_audio_fft(wonderland, end_time = 5.0, hamming(4096)))
Several common windows function generators. The functions return a vector of weights to use in read_audio_fft.
hanning(n) hamming(n) blackman(n) bartlett(n) welch(n) flattop(n) bharris(n) bnuttall(n) sine(n) nuttall(n) bhann(n) lanczos(n) gauss(n) tukey(n) dolph(n) cauchy(n) parzen(n) bohman(n)
hanning(n) hamming(n) blackman(n) bartlett(n) welch(n) flattop(n) bharris(n) bnuttall(n) sine(n) nuttall(n) bhann(n) lanczos(n) gauss(n) tukey(n) dolph(n) cauchy(n) parzen(n) bohman(n)
n |
size of the window (number of weights to generate) |
# Window functions plot(hanning(1024), type = 'l', xlab = 'window', ylab = 'weight') lines(hamming(1024), type = 'l', col = 'red') lines(bartlett(1024), type = 'l', col = 'blue') lines(welch(1024), type = 'l', col = 'purple') lines(flattop(1024), type = 'l', col = 'darkgreen')
# Window functions plot(hanning(1024), type = 'l', xlab = 'window', ylab = 'weight') lines(hamming(1024), type = 'l', col = 'red') lines(bartlett(1024), type = 'l', col = 'blue') lines(welch(1024), type = 'l', col = 'purple') lines(flattop(1024), type = 'l', col = 'darkgreen')