This is a simplified guide to an AI model called Wan-2.1-T2v-720p maintained by Wavespeedai. If you like these kinds of analysis, you should join AImodels.fyi or follow us on Twitter.
wan-2.1-t2v-720p
specializes in high-resolution text-to-video generation, building on comprehensive video foundation models that generate fluid motion and detailed visuals. The model shares architecture with several related implementations including wan-2.1-t2v-480p and wan-2.1-i2v-720p, developed by wavespeedai.
Model inputs and outputs
The model takes text prompts and generates high-quality 720p video output. It uses advanced prompt extension techniques to enrich details and enhance video quality through either Dashscope API or local model processing.
Inputs
- Text prompt - Detailed description of desired video content
- Aspect ratio - 16:9 (1280x720), 9:16 (720x1280), or 1:1 (1024x1024)
- Frame count - 81-100 frames per video
- FPS - 5-24 frames per second
- Generation parameters - Sample steps, guide scale, and sample shift for fine control
Outputs
- Video file - High-resolution MP4 at specified dimensions and frame rate
Capabilities
The system excels at generating dynamic...
Top comments (0)