Seedance 1.5 Pro

dialogue-scenescommercialmultilingual

ByteDance · Dual-Branch Diffusion Transformer · v1.5verifiedVerified

—/sec

starting from, on FAL.ai

Resolution

1080p

Duration

4–12s

Providers

Text-to-VideoImage-to-VideoAudioCameraLipsync

API Pricing

FAL.aiV1.5 ProCheapest

Try it →

Text-to-VideoAudio

$0.26

Text-to-Video

$0.13

Image-to-VideoAudio

$0.26

Image-to-Video

$0.13

Verified 2026-04-10

ReplicateV1.5 Pro

Try it →

Text-to-VideoAudio

—

Verified 2026-04-10

Why Seedance 1.5 Pro?

thumb_upStrengths

Native audio-visual joint generation with millisecond-precision lip-sync across 8+ languages
Dual-branch diffusion transformer processes video and audio in parallel for tight synchronization
Strong cinematic camera control — pan, tilt, zoom, truck, orbit all respond well to prompts
Multilingual dialogue support including Chinese dialects (Cantonese, Sichuanese, Shanghainese)
Competitive API pricing at ~$0.26 per 5-second clip with audio on FAL.ai

infoLimitations

Not open source — no self-deployment option available
Maximum 12-second duration is shorter than competitors like Kling v3 (15s) or Wan 2.7 (15s)
480p/720p output resolution on most endpoints — 1080p limited to certain configurations
No multi-shot generation in a single call — must stitch clips manually
Superseded by Seedance 2.0 which adds longer duration and higher quality

auto_fix_highPrompt Guide

1Structure prompts across four layers: subject definition, dialogue or key sound events, environmental audio cues, and visual style — Seedance 1.5 Pro processes all four simultaneously.
2Include quoted dialogue with speaker descriptions for audio sync — e.g., [Woman, warm tone]: 'Welcome back.' This signals the model to prioritize lip-sync and voice generation.
3Use explicit camera language — pan, tilt, zoom, truck, orbit, handheld — as cinematic lens response is a core strength of this model.
4Use adverbs of degree to control motion intensity: 'quickly,' 'violently,' 'gently,' 'with large amplitude' directly influence animation dynamics.
5For multi-shot sequences, connect shots with 'camera switch' and describe the new scenario after each cut to maintain narrative coherence.
6Specify ambient audio alongside visual cues — 'rain on cobblestones, distant thunder' — since the model generates foley and environment sound natively.

✓ Do this

Follow the pattern: Subject + Motion + Scene + Shot/Style for each prompt segment
Keep prompts clear and concise — the model handles complex multi-element prompts but clarity beats length
For image-to-video, describe how the scene evolves FROM the input image, not a new scene
Describe both what viewers see AND what they hear in a unified instruction set
Specify lighting, weather, and time of day to anchor the visual atmosphere

✗ Avoid this

Maximum resolution is 1080p — no 4K output unlike Seedance 2.0
Duration capped at 12 seconds — shorter than some competing models
Animated or stylized content may require more prompt engineering than photorealism
Text rendering within video is not reliably supported
Rapid camera movements in a single shot can introduce artifacts

Example Prompts

Corporate / Dialogue

“A middle-aged woman in business attire walks through a modern office lobby, morning sunlight streaming through glass windows, her heels clicking on marble floors. [Woman, confident tone]: 'The quarterly results exceeded expectations.' Camera follows her in a smooth tracking shot, shallow depth of field.”

Product / Commercial

“Close-up of a barista pouring latte art in a cozy cafe. Ambient sounds: espresso machine hissing, quiet jazz in the background. Camera holds steady, warm golden-hour lighting through the window, shallow depth of field on the milk swirl.”

Nature / Multi-shot

“Aerial shot slowly descending over a misty Japanese garden at dawn. A koi pond reflects cherry blossoms. Birds chirping, water trickling over stones. Camera switch: close-up of a single cherry blossom petal falling into the pond, creating gentle ripples.”

Based on the official prompt guide →

FAQexpand_more

Where can I use Seedance 1.5 Pro?

Via API on FAL.ai and Replicate.

How do I get good results with Seedance 1.5 Pro?

Structure prompts across four layers: subject definition, dialogue or key sound events, environmental audio cues, and visual style — Seedance 1.5 Pro processes all four simultaneously. See the prompt guide below.