-
Qwen Image Edit Next Scene
🎥151Fast 4 step inference with Qwen Image Edit 2509
-
FilMaster: Bridging Cinematic Principles and Generative AI for Automated Film Generation
Paper • 2506.18899 • Published • 6 -
MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Paper • 2403.01422 • Published • 30 -
Shakker-Labs/FilmPortrait
Text-to-Image • Updated • 163 • 223
Collections
Discover the best community collections!
Collections including paper arxiv:2401.01256
-
SIGNeRF: Scene Integrated Generation for Neural Radiance Fields
Paper • 2401.01647 • Published • 13 -
Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions
Paper • 2401.01827 • Published • 18 -
VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM
Paper • 2401.01256 • Published • 21 -
TrailBlazer: Trajectory Control for Diffusion-Based Video Generation
Paper • 2401.00896 • Published • 15
-
StarVector: Generating Scalable Vector Graphics Code from Images
Paper • 2312.11556 • Published • 36 -
Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model
Paper • 2312.12423 • Published • 13 -
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing
Paper • 2312.11392 • Published • 20 -
stabilityai/stable-video-diffusion-img2vid-xt
Image-to-Video • Updated • 168k • 3.21k
-
FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline
Paper • 2311.13073 • Published • 58 -
MetaDreamer: Efficient Text-to-3D Creation With Disentangling Geometry and Texture
Paper • 2311.10123 • Published • 18 -
GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning
Paper • 2311.12631 • Published • 14 -
VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models
Paper • 2312.00845 • Published • 38
-
One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning
Paper • 2306.07967 • Published • 25 -
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Paper • 2306.07954 • Published • 111 -
TryOnDiffusion: A Tale of Two UNets
Paper • 2306.08276 • Published • 74 -
Seeing the World through Your Eyes
Paper • 2306.09348 • Published • 33
-
UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs
Paper • 2311.09257 • Published • 47 -
VideoPoet: A Large Language Model for Zero-Shot Video Generation
Paper • 2312.14125 • Published • 46 -
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
Paper • 2312.16862 • Published • 31 -
VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM
Paper • 2401.01256 • Published • 21
-
Qwen Image Edit Next Scene
🎥151Fast 4 step inference with Qwen Image Edit 2509
-
FilMaster: Bridging Cinematic Principles and Generative AI for Automated Film Generation
Paper • 2506.18899 • Published • 6 -
MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Paper • 2403.01422 • Published • 30 -
Shakker-Labs/FilmPortrait
Text-to-Image • Updated • 163 • 223
-
SIGNeRF: Scene Integrated Generation for Neural Radiance Fields
Paper • 2401.01647 • Published • 13 -
Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions
Paper • 2401.01827 • Published • 18 -
VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM
Paper • 2401.01256 • Published • 21 -
TrailBlazer: Trajectory Control for Diffusion-Based Video Generation
Paper • 2401.00896 • Published • 15
-
StarVector: Generating Scalable Vector Graphics Code from Images
Paper • 2312.11556 • Published • 36 -
Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model
Paper • 2312.12423 • Published • 13 -
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing
Paper • 2312.11392 • Published • 20 -
stabilityai/stable-video-diffusion-img2vid-xt
Image-to-Video • Updated • 168k • 3.21k
-
One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning
Paper • 2306.07967 • Published • 25 -
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Paper • 2306.07954 • Published • 111 -
TryOnDiffusion: A Tale of Two UNets
Paper • 2306.08276 • Published • 74 -
Seeing the World through Your Eyes
Paper • 2306.09348 • Published • 33
-
FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline
Paper • 2311.13073 • Published • 58 -
MetaDreamer: Efficient Text-to-3D Creation With Disentangling Geometry and Texture
Paper • 2311.10123 • Published • 18 -
GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning
Paper • 2311.12631 • Published • 14 -
VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models
Paper • 2312.00845 • Published • 38
-
UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs
Paper • 2311.09257 • Published • 47 -
VideoPoet: A Large Language Model for Zero-Shot Video Generation
Paper • 2312.14125 • Published • 46 -
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
Paper • 2312.16862 • Published • 31 -
VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM
Paper • 2401.01256 • Published • 21