图生图

Nano Banana 2 Reference to Image

Google's advanced AI-powered video-to-image generation model, designed to generate high-quality static images from video clips combined with text instructions.

Seedance 2.0 Reference-to-Video

Grok Imagine Video v1.5 Image-to-Video

Wan-2.7 Text-to-video

Kling v3.0 Pro Image-to-Video

Video Booth

探索更多

最近添加

NEW

图生视频

Midjourney V8.1 Image-to-Video

Midjourney V8.1 animates an input image into four 5-second videos at 480p or 720p.

Grok Imagine Video v1.5 Image-to-Video

xAI Grok Imagine Video v1.5 animates a starting frame image with natural-language motion prompts at 480p or 720p.

Gemini Omni Flash Image-to-Video Developer

Gemini Omni Flash is Google's multimodal video generation model. This image-to-video variant creates subject-consistent videos from up to 7 reference images combined with a text prompt, preserving visual identity across the full generated video.

Gemini Omni Flash Text-to-Video Developer

Gemini Omni Flash is Google's multimodal video generation model. This text-to-video variant generates high-quality cinematic videos from text prompts with support for multiple resolutions, aspect ratios, and controllable duration.

HappyHorse-1.0 Text-to-video

Generates videos from text prompts with HappyHorse 1.0, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

HappyHorse-1.0 Image-to-video

Animates a first-frame image into video with optional prompt guidance, 720P or 1080P output, and durations from 3 to 15 seconds.

HappyHorse-1.0 Reference-to-video

Generates videos from one to nine reference images and a text prompt, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

HappyHorse-1.0 Video-edit

Edits an input video with text instructions and optional reference images, supporting 720P or 1080P output.

Seedance 2.0 Text-to-Video

Generate videos from text prompts with native audio and optional web search.

Seedance 2.0 Image-to-Video

Generate videos from a first-frame image (and optional last-frame) with native audio.

Seedance 2.0 Reference-to-Video

Multimodal video generation from reference images, videos, and audio. Supports video editing and extension.

Seedance 2.0 Fast Text-to-Video

Fast video generation from text prompts with native audio.

Seedance 2.0 Fast Image-to-Video

Fast video generation from first-frame image (and optional last-frame) with native audio.

Seedance 2.0 Fast Reference-to-Video

Fast multimodal video generation from reference images, videos, and audio. Supports video editing and extension.

Wan-2.7 Text-to-video

Generates videos from text prompts with multi-shot narrative, audio generation, and sound-image synchronization.

Wan-2.7 Image-to-video

Animates images into videos with first-frame, first-and-last-frame, video continuation, and audio-driven modes.

Wan-2.7 Reference-to-video

Generates character-driven videos from reference images and videos, with multi-subject and voice-cloning support.

Wan-2.7 Video-edit

Edits videos using text instructions, reference images, and style transfer with multi-modal input support.

Veo 3.1 Lite Text-to-video

High-efficiency Veo 3.1 Lite text-to-video: create video with synchronized audio from text prompts. Targets high-volume applications with strong price efficiency; 720p/1080p and flexible duration options. Does not support 4K outputs or Extension.

Veo 3.1 Lite Start-End Frame to Video

Veo 3.1 Lite start-end frame to video: generate motion between a first and last frame with audio. Lightweight, developer-oriented option with 8s duration and 720p/1080p. Does not support 4K outputs or Extension.

From

$0.05/秒

无限模型

NEW

HOT

文生视频

Wan-2.7 Text-to-video

Generates videos from text prompts with multi-shot narrative, audio generation, and sound-image synchronization.

Wan-2.7 Image-to-video

Animates images into videos with first-frame, first-and-last-frame, video continuation, and audio-driven modes.

Wan-2.7 Reference-to-video

Generates character-driven videos from reference images and videos, with multi-subject and voice-cloning support.

Wan-2.7 Video-edit

Edits videos using text instructions, reference images, and style transfer with multi-modal input support.

From

$0.1/秒

图生视频

Wan-2.2-spicy Image-to-video

Open and Advanced Large-Scale Video Generative Models.

From

$0.03/秒

图生视频

Wan-2.2-spicy Image-to-video Lora

Open and Advanced Large-Scale Video Generative Models.

From

$0.04/秒

视频转视频

Wan-2.2-spicy Video Extend

Open and Advanced Large-Scale Video Generative Models.

From

$0.032/秒

视频转视频

Wan-2.2-spicy Video Extend Lora

Open and Advanced Large-Scale Video Generative Models.

Wan 2.2 Turbo Spicy Infinite Image-to-Video

Image-to-video model for segmented prompt video generation with stable motion and 30fps workflow post-processing.

Wan 2.2 Turbo Spicy Infinite Image-to-Video LoRA

Image-to-video LoRA variant for segmented prompt video generation with stable motion and 30fps workflow post-processing.

Wan-2.2-turbo-spicy Image-to-video

Fast image-to-video generation powered by Wan 2.2 with rCM turbo acceleration. Supports 480p, 720p, and 1080p (via VSR upscaling) output with 5s or 8s duration.

Wan-2.2-turbo-spicy Image-to-video Lora

Fast image-to-video generation with custom LoRA support. Powered by Wan 2.2 rCM turbo with high/low noise LoRA injection. Supports 480p, 720p, and 1080p output.

Seedance v1.5 Pro Image-to-Video Spicy

Seedance V1.5 Pro Spicy transforms images into high-quality cinematic video with smooth motion and expressive animations, optimized for creative content at scale.

Wan-2.6 Text-to-video

A speed-optimized text-to-video option that prioritizes lower latency while retaining strong visual fidelity. Ideal for iteration, batch generation, and prompt testing.

Wan-2.6 Image-to-video

A speed-optimized image-to-video option that prioritizes lower latency while retaining strong visual fidelity. Ideal for iteration, batch generation, and prompt testing.

Wan-2.5 Text-to-video

A speed-optimized text-to-video option that prioritizes lower latency while retaining strong visual fidelity. Ideal for iteration, batch generation, and prompt testing.

Wan-2.5 Text-to-video Fast

Convert prompts into cinematic video clips with synchronized sound. Wan 2.5 generates 480p/720p/1080p outputs with stable motion, native audio sync, and prompt-faithful visual storytelling.

Van-2.6 Text-to-video

A speed-optimized text-to-video option that prioritizes lower latency while retaining strong visual fidelity. Ideal for iteration, batch generation, and prompt testing.

Van-2.6 Image-to-video

A speed-optimized image-to-video option that prioritizes lower latency while retaining strong visual fidelity. Ideal for iteration, batch generation, and prompt testing.

Van-2.5 Image-to-video

Get animated visuals from your images faster without major quality sacrifice. Perfect for preview workflows, previews at scale, or mass production of animated assets.

Van-2.5 Text-to-video

Convert prompts into cinematic video clips with synchronized sound. Van 2.5 generates 720p/1080p outputs with stable motion, native audio sync, and prompt-faithful visual storytelling.

Wan 2.6 Spicy Image-to-Video

AtlasCloud Wan 2.6 Spicy Image-to-Video turns a reference image into a short motion clip with expressive character movement and stable temporal detail.

Vidu Q3-Mix Reference to Video

Vidu Q3-Mix Reference-to-Video generates videos from 1-4 reference images with consistent subjects. Offers strong visual quality with intelligent scene transitions, smooth dynamic effects, and audio support up to 1080p.

Vidu Q3 Reference to Video

Vidu Q3 Reference-to-Video generates videos from 1-4 reference images with consistent subjects. Features intelligent camera switching with better consistency across multiple camera positions, audio support, and resolutions up to 1080p.

Vidu Q3-Pro Start-end-to-video

Vidu Q3-Pro Start-end-to-Video is an advanced AI video generation model that brings static images to life. Upload a reference image and describe the motion you want — the model generates high-quality video with smooth animation, optional audio, and cinematic quality up to 1080p.

Vidu Q3-Turbo Image-to-video

Vidu Q3-Turbo Image-to-Video is an advanced AI video generation model that brings static images to life. Upload a reference image and describe the motion you want — the model generates high-quality video with smooth animation, optional audio, and cinematic quality up to 1080p.

Vidu Q3-Turbo Start-end-to-video

Vidu Q3-Turbo Start-end-to-Video is an advanced AI video generation model that brings static images to life. Upload a reference image and describe the motion you want — the model generates high-quality video with smooth animation, optional audio, and cinematic quality up to 1080p.

Vidu Q3-Turbo Text-to-video

Vidu Q3-Turbo Text-to-Video is an advanced AI video generation model that creates high-quality videos directly from text descriptions. With support for multiple styles, resolutions up to 1080p, and optional audio generation, it delivers cinematic results with smooth motion and rich detail.

Vidu Q3-Pro Image-to-video

Vidu Q3-Pro Image-to-Video is an advanced AI video generation model that brings static images to life. Upload a reference image and describe the motion you want — the model generates high-quality video with smooth animation, optional audio, and cinematic quality up to 1080p.

Vidu Q3-Pro Text-to-video

Vidu Q3-Pro Text-to-Video is an advanced AI video generation model that creates high-quality videos directly from text descriptions. With support for multiple styles, resolutions up to 1080p, and optional audio generation, it delivers cinematic results with smooth motion and rich detail.

From$0.05/秒

$0.042/秒

-15%

Sora 替代方案 2026

NEW

图生视频

Kling v3.0 Std Image-to-Video

Kling v3.0 Standard Image-to-Video model by Kuaishou. High-quality video generation from images.

Kling v3.0 Pro Image-to-Video

Kling v3.0 Professional Image-to-Video model by Kuaishou. Premium quality video generation from images with advanced features.

Kling v3.0 Pro Text-to-Video

Kling v3.0 Professional Text-to-Video model by Kuaishou. Premium quality video generation from text prompts with advanced features.

Kling v3.0 Std Text-to-Video

Kling v3.0 Standard Text-to-Video model by Kuaishou. High-quality video generation from text prompts.

Kling Video O3 Pro Video-Edit

Kling Omni Video O3 Video-Edit enables conversational video editing through natural language commands. Professional quality with object removal/replacement, background changes, and effects.

Kling Video O3 Pro Reference-to-Video

Kling Omni Video O3 Reference-to-Video generates creative videos using character, prop, or scene references. Professional quality with up to 7 reference images and optional video input.

Kling Video O3 Pro Image-to-Video

Kling Omni Video O3 Image-to-Video transforms static images into dynamic cinematic videos using MVL technology. Professional quality with first/last frame control and audio generation.

Kling Video O3 Pro Text-to-Video

Kling Omni Video O3 is Kuaishou's advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Professional quality with enhanced motion and detail.

Kling Video O3 Std Video-Edit

Kling Omni Video O3 Video-Edit (Standard) enables natural-language video edits: remove or replace objects, change backgrounds, add effects, and more. Video duration limited to 10s.

Kling Video O3 Std Reference-to-Video

Kling Omni Video O3 (Standard) Reference-to-Video generates creative videos using character, prop, or scene references. Supports up to 7 reference images and optional video input.

Kling Video O3 Std Image-to-Video

Kling Omni Video O3 (Standard) Image-to-Video transforms static images into dynamic cinematic videos using MVL technology. Supports first/last frame control and audio generation.

Kling Video O3 Std Text-to-Video

Kling Omni Video O3 (Standard) is Kuaishou's advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Generates high-quality videos from text prompts with natural motion and audio generation support.

Veo3.1 Text-to-video

Generate high-fidelity videos from text prompts with Google’s most advanced generative video model. Veo 3.1 delivers cinematic quality, dynamic camera motion, and lifelike detail for storytelling and creative production.

Veo3.1 Reference-to-video

Create richly detailed videos guided by visual references. Veo 3.1 Reference-to-Video preserves characters, style, and composition across scenes for consistent, visually coherent storytelling.

Veo3.1 Image-to-video

Quickly animate static images into motion-rich, high-quality clips. Veo 3.1 Fast Image-to-Video accelerates rendering for fast previews and iterative visual storytelling.

Veo3.1 Fast Text-to-video

Generate visually compelling videos from text in record time. Veo 3.1 Fast Text-to-Video prioritizes speed and responsiveness while maintaining impressive fidelity for rapid creative iteration.

Seedance 2.0 Text-to-Video

Generate videos from text prompts with native audio and optional web search.

Seedance 2.0 Image-to-Video

Generate videos from a first-frame image (and optional last-frame) with native audio.

Seedance 2.0 Reference-to-Video

Multimodal video generation from reference images, videos, and audio. Supports video editing and extension.

Seedance 2.0 Fast Text-to-Video

Fast video generation from text prompts with native audio.

Wan-2.7 Text-to-video

Generates videos from text prompts with multi-shot narrative, audio generation, and sound-image synchronization.

Wan-2.7 Image-to-video

Animates images into videos with first-frame, first-and-last-frame, video continuation, and audio-driven modes.

Wan-2.7 Reference-to-video

Generates character-driven videos from reference images and videos, with multi-subject and voice-cloning support.

Wan-2.7 Video-edit

Edits videos using text instructions, reference images, and style transfer with multi-modal input support.

HappyHorse-1.0 Text-to-video

Generates videos from text prompts with HappyHorse 1.0, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

HappyHorse-1.0 Image-to-video

Animates a first-frame image into video with optional prompt guidance, 720P or 1080P output, and durations from 3 to 15 seconds.

HappyHorse-1.0 Reference-to-video

Generates videos from one to nine reference images and a text prompt, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

HappyHorse-1.0 Video-edit

Edits an input video with text instructions and optional reference images, supporting 720P or 1080P output.

From

$0.14/秒

最佳图像模型

NEW

文生图

PROULTRA

Nano Banana Pro Text-to-image Ultra

Nano Banana Pro is the next-generation Nano Banana image model, delivering sharper detail, richer color control, and faster diffusion for production-ready visuals.

Nano Banana Pro Text-to-image

Nano Banana Pro is the next-generation Nano Banana image model, delivering sharper detail, richer color control, and faster diffusion for production-ready visuals.

Nano Banana 2 Text-to-Image

Google's lightweight yet powerful AI image generation model, built for creators who need fast, high-quality visuals from simple text prompts.

Nano Banana 2 Edit

Google's advanced AI-powered image editing and generation model, designed to make visual transformation as intuitive as describing it in words.

Nano Banana Pro Edit Ultra

Nano Banana Pro Edit is an image editing tool built on the Nano Banana model family, designed for precise, AI-powered visual adjustments.

Nano Banana Pro Edit

Nano Banana Pro Edit is an image editing tool built on the Nano Banana model family, designed for precise, AI-powered visual adjustments.

Seedream v5.0 Lite

ByteDance next-generation image model with enhanced quality, typography, and poster design. Supports PNG output and fast prompt optimization mode.

Seedream v5.0 Lite Edit

ByteDance next-generation image editing model that preserves facial features, lighting, and color tones while enabling professional-quality modifications.

Seedream v5.0 Lite Sequential

ByteDance next-generation image model with batch generation support. Generate up to 15 related images in a single request.

Seedream v5.0 Lite Edit Sequential

ByteDance next-generation image editing model with batch generation support. Edit multiple images while preserving facial features and details.

Wan-2.7 Text-to-image

Generates images from text prompts with Wan 2.7 image, supporting fast iteration and strong prompt fidelity for illustration and photorealistic outputs.

Wan-2.7 Image-to-image

Edits and recomposes images with Wan 2.7 image using text instructions, multi-image references, and optional interaction boxes.

Wan-2.7 Pro Text-to-image

Generates images from text prompts with Wan 2.7 image pro, supporting higher fidelity outputs and 4K-ready workflows.

Wan-2.7 Pro Image-to-image

Edits and recomposes images with Wan 2.7 image pro using text instructions and multi-image references for higher quality outputs.

Openai GPT Image 2 Text-to-Image

GPT Image 2 text to image is OpenAI's fast, cost-efficient text-to-image generator powered by GPT-5 guidance. Create photorealistic shots, product renders, concept art, and stylized graphics from natural-language prompts (optionally conditioned with an image). Supports custom aspect ratios, seeds, negative prompts, hex color hints, and style presets. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Openai GPT Image 2 Edit

GPT Image 2 Edit is OpenAI's image model for precise, natural-language edits. Add/remove objects, swap backgrounds, retouch faces, adjust colors/lighting, edit text/graphics, crop/resize, and apply hex color control. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Qwen Image 2.0 Text-to-image

Qwen Image 2.0 is an advanced text-to-image model with enhanced image quality and improved prompt understanding. Up to 2k. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Qwen Image 2.0 Edit

Qwen Image 2.0 Edit is an advanced image-editing model with improved quality and better understanding of instructions. Up to 2k. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Qwen Image 2.0 Pro Edit

Qwen Image 2.0 Pro Edit is a professional-grade image editing model with superior quality and advanced instruction understanding. Up to 2k. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Qwen Image 2.0 Pro Text-to-image

Qwen Image 2.0 Pro is a professional-grade text-to-image model with superior quality and advanced prompt understanding. Up to 2k. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

From$0.075/张

$0.06/张

-20%

最佳图像编辑模型

NEW

HOT

图生图

Wan-2.6 Image-to-image

Supports image editing and mixed text and image output to meet diverse generation and integration needs.

Seedream v4.5 Edit

ByteDance advanced image editing model that preserves facial features, lighting, and color tones while enabling professional-quality modifications.

Seedream v4.5 Edit Sequential

ByteDance advanced image editing model with batch generation support. Edit multiple images while preserving facial features and details.

Nano Banana Pro Edit Ultra

Nano Banana Pro Edit is an image editing tool built on the Nano Banana model family, designed for precise, AI-powered visual adjustments.

Nano Banana Pro Edit

Nano Banana Pro Edit is an image editing tool built on the Nano Banana model family, designed for precise, AI-powered visual adjustments.

Qwen-Image Edit

Supports multiple image inputs and outputs, allowing for precise modification of text within images, addition, deletion, or movement of objects, alteration of subject actions, transfer of image styles, and enhancement of image details.

Qwen-Image Edit Plus

Nano Banana Pro Edit Developer

Open and Advanced Large-Scale Image Generative Models.

Nano Banana 2 Edit

Google's advanced AI-powered image editing and generation model, designed to make visual transformation as intuitive as describing it in words.

Seedream v5.0 Lite Edit Sequential

ByteDance next-generation image editing model with batch generation support. Edit multiple images while preserving facial features and details.

Seedream v5.0 Lite Edit

ByteDance next-generation image editing model that preserves facial features, lighting, and color tones while enabling professional-quality modifications.

Seedream v4 Edit

Open and Advanced Large-Scale Image Generative Models.

Seedream v4 Edit Sequential

Open and Advanced Large-Scale Image Generative Models.

Qwen-Image Edit Plus 20251215

Qwen Image Edit

Qwen-Image-Edit — a 20B MMDiT model for next-gen image edit generation.

Openai GPT Image 2 Edit

Wan-2.7 Image-to-image

Edits and recomposes images with Wan 2.7 image using text instructions, multi-image references, and optional interaction boxes.

Wan-2.7 Pro Image-to-image

Edits and recomposes images with Wan 2.7 image pro using text instructions and multi-image references for higher quality outputs.

Nano Banana 2 Edit Developer

Google's advanced AI-powered image editing and generation model, designed to make visual transformation as intuitive as describing it in words.

Qwen Image 2.0 Edit

Qwen Image 2.0 Pro Edit

Openai GPT Image-1.5 Edit

GPT Image 1.5 Edit is OpenAI’s image model for precise, natural-language edits. Add/remove objects, swap backgrounds, retouch faces, adjust colors/lighting, edit text/graphics, crop/resize, and apply hex color control. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

From$0.009/张

$0.008/张

-15%

旗舰视频模型

NEW

HOT

文生视频

Veo3.1 Text-to-video

Veo3.1 Reference-to-video

Create richly detailed videos guided by visual references. Veo 3.1 Reference-to-Video preserves characters, style, and composition across scenes for consistent, visually coherent storytelling.

Veo3.1 Image-to-video

Quickly animate static images into motion-rich, high-quality clips. Veo 3.1 Fast Image-to-Video accelerates rendering for fast previews and iterative visual storytelling.

Kling v3.0 Std Image-to-Video

Kling v3.0 Standard Image-to-Video model by Kuaishou. High-quality video generation from images.

Kling v3.0 Pro Image-to-Video

Kling v3.0 Professional Image-to-Video model by Kuaishou. Premium quality video generation from images with advanced features.

Kling v3.0 Pro Text-to-Video

Kling v3.0 Professional Text-to-Video model by Kuaishou. Premium quality video generation from text prompts with advanced features.

Kling v3.0 Std Text-to-Video

Kling v3.0 Standard Text-to-Video model by Kuaishou. High-quality video generation from text prompts.

Kling Video O3 Pro Video-Edit

Kling Omni Video O3 Video-Edit enables conversational video editing through natural language commands. Professional quality with object removal/replacement, background changes, and effects.

Kling Video O3 Pro Reference-to-Video

Kling Omni Video O3 Reference-to-Video generates creative videos using character, prop, or scene references. Professional quality with up to 7 reference images and optional video input.

Kling Video O3 Pro Image-to-Video

Kling Omni Video O3 Image-to-Video transforms static images into dynamic cinematic videos using MVL technology. Professional quality with first/last frame control and audio generation.

Kling Video O3 Pro Text-to-Video

Kling Omni Video O3 is Kuaishou's advanced unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Professional quality with enhanced motion and detail.

Kling Video O3 Std Video-Edit

Kling Omni Video O3 Video-Edit (Standard) enables natural-language video edits: remove or replace objects, change backgrounds, add effects, and more. Video duration limited to 10s.

Kling Video O3 Std Reference-to-Video

Kling Omni Video O3 (Standard) Reference-to-Video generates creative videos using character, prop, or scene references. Supports up to 7 reference images and optional video input.

Kling Video O3 Std Image-to-Video

Kling Omni Video O3 (Standard) Image-to-Video transforms static images into dynamic cinematic videos using MVL technology. Supports first/last frame control and audio generation.

Kling Video O3 Std Text-to-Video

Vidu Q3-Pro Start-end-to-video

Vidu Q3-Turbo Image-to-video

Vidu Q3-Turbo Start-end-to-video

Vidu Q3-Turbo Text-to-video

Vidu Q3-Pro Image-to-video

Vidu Q3-Pro Text-to-video

Veo3.1 Fast Text-to-video

Generate visually compelling videos from text in record time. Veo 3.1 Fast Text-to-Video prioritizes speed and responsiveness while maintaining impressive fidelity for rapid creative iteration.

Veo3.1 Fast Image-to-video

Bring still images to life with smooth, expressive motion. Veo 3.1 Image-to-Video transforms photos or keyframes into cinematic video sequences with realistic continuity and sound.

Seedance 2.0 Text-to-Video

Generate videos from text prompts with native audio and optional web search.

Seedance 2.0 Image-to-Video

Generate videos from a first-frame image (and optional last-frame) with native audio.

Seedance 2.0 Reference-to-Video

Multimodal video generation from reference images, videos, and audio. Supports video editing and extension.

Seedance 2.0 Fast Text-to-Video

Fast video generation from text prompts with native audio.

Seedance 2.0 Fast Image-to-Video

Fast video generation from first-frame image (and optional last-frame) with native audio.

Seedance 2.0 Fast Reference-to-Video

Fast multimodal video generation from reference images, videos, and audio. Supports video editing and extension.

Wan-2.7 Text-to-video

Generates videos from text prompts with multi-shot narrative, audio generation, and sound-image synchronization.

Wan-2.7 Image-to-video

Animates images into videos with first-frame, first-and-last-frame, video continuation, and audio-driven modes.

Wan-2.7 Reference-to-video

Generates character-driven videos from reference images and videos, with multi-subject and voice-cloning support.

Wan-2.7 Video-edit

Edits videos using text instructions, reference images, and style transfer with multi-modal input support.

HappyHorse-1.0 Text-to-video

Generates videos from text prompts with HappyHorse 1.0, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

HappyHorse-1.0 Image-to-video

Animates a first-frame image into video with optional prompt guidance, 720P or 1080P output, and durations from 3 to 15 seconds.

HappyHorse-1.0 Reference-to-video

Generates videos from one to nine reference images and a text prompt, supporting 720P or 1080P output, flexible aspect ratios, and durations from 3 to 15 seconds.

HappyHorse-1.0 Video-edit

Edits an input video with text instructions and optional reference images, supporting 720P or 1080P output.

From

$0.14/秒

最佳虚拟化身模型

音频转视频

InfiniteTalk

InfiniteTalk turns a reference portrait and audio into a realistic talking-head video with lip-sync, supporting up to 10-minute audio in 480p or 720p.

Veed-fabric-1.0 Image-to-Video

暂无描述

Veed-fabric-1.0-fast Image-to-Video

暂无描述

Kling v2.6 Pro Avatar

Kling V2 AI Avatar Pro generates high-quality AI avatar videos with clean detail, stable motion, and strong identity consistency—ideal for profiles, intros, and social content.

Kling v2.6 Std Avatar

Kling AI Avatar generates high-quality AI avatar videos for profiles, intros, and social content, delivering clean detail and cinematic motion with reliable prompt adherence.

Kling v2.6 Pro Motion Control

Kling 2.6 Pro Motion Control turns reference motion clips (dance, action, gesture) into smooth, realistic animations. Upload a character image (or source video) and a motion video; the model transfers the movement while preserving identity and temporal consistency.

Kling v2.6 Std Motion Control

Kling 2.6 Standard Motion Control transfers motion from reference videos to animate still images. Upload a character image and a motion clip (dance, action, gesture), and the model extracts the movement to generate smooth, realistic video.

From$0.07/秒

$0.06/秒

-15%

Seedance 2.0 & Nano Banana Pro/2 全球最低价

NEW

文生视频

Seedance 2.0 Text-to-Video

Generate videos from text prompts with native audio and optional web search.

Seedance 2.0 Image-to-Video

Generate videos from a first-frame image (and optional last-frame) with native audio.

Seedance 2.0 Reference-to-Video

Multimodal video generation from reference images, videos, and audio. Supports video editing and extension.

Seedance 2.0 Fast Text-to-Video

Fast video generation from text prompts with native audio.

Seedance 2.0 Fast Image-to-Video

Fast video generation from first-frame image (and optional last-frame) with native audio.

Seedance 2.0 Fast Reference-to-Video

Fast multimodal video generation from reference images, videos, and audio. Supports video editing and extension.

Nano Banana 2 Text-to-Image

Google's lightweight yet powerful AI image generation model, built for creators who need fast, high-quality visuals from simple text prompts.

Nano Banana 2 Edit

Google's advanced AI-powered image editing and generation model, designed to make visual transformation as intuitive as describing it in words.

Nano Banana Pro Text-to-image

Nano Banana Pro is the next-generation Nano Banana image model, delivering sharper detail, richer color control, and faster diffusion for production-ready visuals.