Harmony: Harmonizing Audio and Video Generation through Cross-Task Synergy Paper • 2511.21579 • Published Nov 26, 2025 • 23
Video Generation Models Are Good Latent Reward Models Paper • 2511.21541 • Published Nov 26, 2025 • 45
Video Generation Models Are Good Latent Reward Models Paper • 2511.21541 • Published Nov 26, 2025 • 45
Video Generation Models Are Good Latent Reward Models Paper • 2511.21541 • Published Nov 26, 2025 • 45 • 5
DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation Paper • 2511.19365 • Published Nov 24, 2025 • 63
Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation Paper • 2303.00440 • Published Mar 1, 2023
DPL: Decoupled Prompt Learning for Vision-Language Models Paper • 2308.10061 • Published Aug 19, 2023 • 1
MGMAE: Motion Guided Masking for Video Masked Autoencoding Paper • 2308.10794 • Published Aug 21, 2023
StableDrag: Stable Dragging for Point-based Image Editing Paper • 2403.04437 • Published Mar 7, 2024 • 27
VFIMamba: Video Frame Interpolation with State Space Models Paper • 2407.02315 • Published Jul 2, 2024
UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions Paper • 2511.03334 • Published Nov 5, 2025 • 52
UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions Paper • 2511.03334 • Published Nov 5, 2025 • 52
MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE Paper • 2507.21802 • Published Jul 29, 2025 • 17
Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding Paper • 2410.01699 • Published Oct 2, 2024 • 18