Motion Prior Distillation in Time Reversal Sampling for Generative Inbetweening

1Yonsei University   2GIST
🌈 ICLR 2026

Teaser: Generative inbetweening results on the DAVIS dataset using our MPD sampler.

Abstract

Recent progress in image-to-video (I2V) diffusion models has significantly advanced the field of generative inbetweening, which aims to generate semantically plausible frames between two keyframes. In particular, inference-time sampling strategies, which leverage the generative priors of large-scale pre-trained I2V models without additional training, have become increasingly popular. However, existing inference-time sampling, either fusing forward and backward paths in parallel or alternating them sequentially, often suffers from temporal discontinuities and undesired visual artifacts due to the misalignment between two generated paths. This is because each path follows the motion prior induced by its own conditioning frame. We thus propose Motion Prior Distillation (MPD), a simple yet effective inference-time distillation technique that suppresses bidirectional mismatch by distilling the motion residual of the forward path into the backward path. MPD alleviates the misalignment by reconstructing the denoised estimate of the backward path from distilled forward motion residual. With our method, we can deliberately avoid denoising end-conditioned path which causes the ambiguity of the path, and yield more temporally coherent inbetweening results with the forward motion prior. Our method can be applied to off-the-shelf inbetweening works without any modification of model parameters. We not only perform quantitative evaluations on standard benchmarks, but also conduct extensive user studies to demonstrate the effectiveness of our approach in practical scenarios.

Method

Existing time-reversal sampling without motion prior alignment
Existing time reversal sampling methods simply connect the two temporal paths either by (a) linearly fusing them or (b) alternatively denoising each path. However, incompatible motion priors induced by two frame conditions could introduce the ambiguity between the two denoising paths.

Our MPD: distill forward motion residual into backward path
Motion Prior Distillation (MPD) aligns the backward path with the forward one by distilling the forward motion residual.
Unified motion prior: Reducing motion conflicts between forward and backward paths
Computationally efficient: Inference-time prior distillation without any model modification
More coherent videos: Improving temporal consistency and perceptual quality

Baseline Comparisons

We are preparing side-by-side videos for MPD and baselines!

BibTeX

@inproceedings{anonymous2026motion,
  title={Motion Prior Distillation in Time Reversal Sampling for Generative Inbetweening},
  author={Anonymous},
  booktitle={The Fourteenth International Conference on Learning Representations},
  year={2026},
  url={https://openreview.net/forum?id=GRElsj9W2t}
}