kwaivgi/kling-lipsync/audio-to-video

Synchronizes facial motion with real audio input for expressive, speech-driven video avatars.

AUDIO-TO-VIDEO
Hem
Utforska
Kling Video Models
kwaivgi/kling-lipsync/audio-to-video
ljud-till-video

Synchronizes facial motion with real audio input for expressive, speech-driven video avatars.

Detaljerade Specifikationer

Översikt:

Modellleverantör:KWAIVGI
Modelltyp:audio-to-video
Driftsättning:Inferens-API; Playground
Prissättning:$0.1275/second

Nyckelspecifikationer:

Storleksgräns:Max bredd × höjd (användardefinierad)
LoRA-stöd:Nej
Seed-alternativ:N/A

Skapa Ditt Nästa Mästerverk

Utforska Liknande Modeller

Kling Video O1 Text-to-video
NEW
text-till-video

Kling Video O1 Text-to-video

Kling Omni Video O1 is Kuaishou's first unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Text-to-Video mode generates cinematic videos from text prompts with subject consistency, natural physics simulation, and precise semantic understanding. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

$0.0952/SEK
Kling Video O1 Reference-to-video
NEW
video-till-video

Kling Video O1 Reference-to-video

Kling Omni Video O1 Reference-to-Video generates creative videos using character, prop, or scene references from multiple viewpoints. Extracts subject features and creates new video content while maintaining identity consistency across frames. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

$0.0952/SEK
Kling Video O1 Image-to-video
NEW
bild-till-video

Kling Video O1 Image-to-video

Kling Omni Video O1 Image-to-Video transforms static images into dynamic cinematic videos using MVL (Multi-modal Visual Language) technology. Maintains subject consistency while adding natural motion, physics simulation, and seamless scene dynamics. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

$0.0952/SEK
Kling v2.5 Turbo Pro Text-to-video
text-till-video

Kling v2.5 Turbo Pro Text-to-video

Delivers high-speed text-to-video generation with cinematic motion precision and enhanced temporal stability.

$0.0595/SEK
Börja från 300+ Modeller,

Endast på Atlas Cloud.