Diffusers documentation
Video Processor
You are viewing main version, which requires installation from source. If you'd like
regular pip install, checkout the latest stable version (v0.32.2).
Video Processor
The VideoProcessor
provides a unified API for video pipelines to prepare inputs for VAE encoding and post-processing outputs once they’re decoded. The class inherits VaeImageProcessor
so it includes transformations such as resizing, normalization, and conversion between PIL Image, PyTorch, and NumPy arrays.
VideoProcessor
diffusers.video_processor.VideoProcessor.preprocess_video
< source >( video height: typing.Optional[int] = None width: typing.Optional[int] = None )
Parameters
- video (
List[PIL.Image]
,List[List[PIL.Image]]
,torch.Tensor
,np.array
,List[torch.Tensor]
,List[np.array]
) — The input video. It can be one of the following:- List of the PIL images.
- List of list of PIL images.
- 4D Torch tensors (expected shape for each tensor
(num_frames, num_channels, height, width)
). - 4D NumPy arrays (expected shape for each array
(num_frames, height, width, num_channels)
). - List of 4D Torch tensors (expected shape for each tensor
(num_frames, num_channels, height, width)
). - List of 4D NumPy arrays (expected shape for each array
(num_frames, height, width, num_channels)
). - 5D NumPy arrays: expected shape for each array
(batch_size, num_frames, height, width, num_channels)
. - 5D Torch tensors: expected shape for each array
(batch_size, num_frames, num_channels, height, width)
.
- height (
int
, optional, defaults toNone
) — The height in preprocessed frames of the video. IfNone
, will use theget_default_height_width()
to get default height. - width (
int
, optional, defaults to
None) -- The width in preprocessed frames of the video. If
None, will use get_default_height_width()
to get the default width.
Preprocesses input video(s).
diffusers.video_processor.VideoProcessor.postprocess_video
< source >( video: Tensor output_type: str = 'np' )
Converts a video tensor to a list of frames for export.