Wan: Open and Advanced Large-Scale Video Generative Models
Find click coordinates on images based on instructions
Generate text from an image and question
Upgraded to v1.0!