Skip to content

Official PyTorch implementation of "Optical-Flow Guided Prompt Optimization for Coherent Video Generation (Motion Prompt)"

Notifications You must be signed in to change notification settings

HyelinNAM/MotionPrompt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

Optical-Flow Guided Prompt Optimization for Coherent Video Generation (Motion Prompt)

This repository is the official implementation of Optical-Flow Guided Prompt Optimization for Coherent Video Generation,
led by Hyelin Nam*, Jaemin Kim * , Dohun Lee, Jong Chul Ye

Project Website

Baseline Ours

Abstract

While text-to-video diffusion models have made significant strides, many still face challenges in generating videos with temporal consistency. Within diffusion frameworks, guidance techniques have proven effective in enhancing output quality during inference; however, applying these methods to video diffusion models introduces additional complexity of handling computations across entire sequences. To address this, we propose a novel framework called MotionPrompt that guides the video generation process via optical flow. Specifically, we train a discriminator to distinguish optical flow between random pairs of frames from real videos and generated ones. Given that prompts can influence the entire video, we optimize learnable token embeddings during reverse sampling steps by using gradients from a trained discriminator applied to random frame pairs. This approach allows our method to generate visually coherent video sequences that closely reflect natural motion dynamics, without compromising the fidelity of the generated content.

Overview


MotionPrompt enhances temporal consistency in text-to-video diffusion models by combining prompt optimization with an optical flow-based discriminator. Leveraging gradients from a subset of frames and aligning optical flow with real-world motion patterns, MotionPrompt efficiently generates videos with smooth, realistic motion and strong contextual coherence.

About

Official PyTorch implementation of "Optical-Flow Guided Prompt Optimization for Coherent Video Generation (Motion Prompt)"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published