Produce long-form avatar videos with Kling Avatar
Turn a single image and a voice track into up to 60 seconds of expressive, broadcast-ready avatar video — powered by Kuaishou's Kling model.
What Kling Avatar can do
Up to 60-second clips
Generate long-form avatar videos in a single render — ideal for full monologues and ad scripts.
One-image avatar
Provide a reference image and Kling animates it into a coherent speaking character that holds identity.
Audio-driven performance
Lip-sync, expression, and head motion are driven directly by the prosody of your voice track.
Multi-format aspect ratios
Render 9:16 for Shorts and Reels, 1:1 for feed, and 16:9 for YouTube and web placements.
Standard and Pro modes
Pick the faster standard tier or the Pro tier on Popcraft when you need maximum visual quality.
Multilingual input
Drive the performance with any voice track — Popcraft pairs it with ElevenLabs TTS in 70+ languages.
Built for real creators
Long-form explainer and UGC ads
Record a 60-second avatar spokesperson pitch for performance ads, landing pages, and course trailers without a physical shoot.
Virtual presenters and news reads
Produce branded anchor reads, internal announcements, and weekly update videos from one reference image and a script.
Localized creator content
Clone a creator's look and relaunch videos in multiple languages — pair Kling Avatar with multilingual TTS for instant market expansion.
E-learning and training modules
Build full-lesson talking-head segments where the instructor keeps visual consistency across an entire course library.
Specifications
- Provider
- Kuaishou (Kling)
- Input types
- Portrait image + audio
- Supported image formats
- JPG / JPEG / PNG / WebP
- Supported audio formats
- MP3 / WAV / M4A / AAC
- Aspect ratios
- 9:16, 16:9, 1:1
- Min duration
- 2 seconds
- Max duration
- 60 seconds
- Quality modes
- Standard, Pro
- Output format
- MP4 with synchronized audio
How Popcraft uses Kling Avatar
Kling Avatar is Popcraft's long-form talking-video option in the Character Studio, sitting alongside OmniHuman 1.5. Upload a portrait, attach a voice track up to 60 seconds, choose 9:16, 16:9, or 1:1, and pick standard or Pro mode. Popcraft routes the job through the Batch AI gateway, streams progress via SSE, and drops the rendered MP4 into your Avatar project. From there you can export, share to the project gallery, or splice the clip into a Remotion timeline next to SFX and BGM.
Frequently asked questions
Kling Avatar is Kuaishou's long-form talking-head video model. It animates a reference image into a speaking character driven by an audio track, producing clips of up to 60 seconds with synchronized lip movement and expressive performance.
You supply a portrait and a voice recording. The model reads the audio for timing, prosody, and emotion, then generates video where the reference character speaks in sync. Popcraft handles routing, polling, and delivery of the finished MP4.
Popcraft's free tier includes credits you can spend on Kling Avatar. Longer clips and Pro-mode renders cost more credits, and paid plans or top-ups extend your monthly allowance.
Popcraft paid plans grant commercial rights to the avatar videos you render. You still need permission to use the likeness of any real person in your reference image — do not generate talking-head videos of people without their consent.
Between 2 and 60 seconds per render on Popcraft. For anything longer, chain multiple clips on the Remotion timeline or stitch scenes together in the AI Agent pipeline.
Both generate talking-head video from one portrait and audio. Kling Avatar is tuned for longer takes up to 60 seconds and offers Standard and Pro quality tiers. OmniHuman 1.5 tops out at 30 seconds at 1080p and adds mask support for multi-face scenes.
Open the Character page, pick Kling Avatar, upload a portrait, attach your audio, choose aspect ratio and quality mode, and submit. Progress streams live and the rendered video saves to your gallery and asset library.
Ready to try Kling Avatar?
Start creating in seconds with 100 free credits — no card required.
Try avatar video free