Seedance 1.5 Pro

dialogue-scenescommercialmultilingual

ByteDance · Dual-Branch Diffusion Transformer · v1.5verifiedVerified

/sec

starting from, on FAL.ai

Resolution

1080p

Duration

4–12s

Providers

2

Text-to-VideoImage-to-VideoAudioCameraLipsync

API Pricing

FAL.aiV1.5 ProCheapest
Try it →
Text-to-VideoAudio
$0.26
Text-to-Video
$0.13
Image-to-VideoAudio
$0.26
Image-to-Video
$0.13
Verified 2026-04-10
ReplicateV1.5 Pro
Try it →
Text-to-VideoAudio
Verified 2026-04-10

Why Seedance 1.5 Pro?

thumb_upStrengths

  • Native audio-visual joint generation with millisecond-precision lip-sync across 8+ languages
  • Dual-branch diffusion transformer processes video and audio in parallel for tight synchronization
  • Strong cinematic camera control — pan, tilt, zoom, truck, orbit all respond well to prompts
  • Multilingual dialogue support including Chinese dialects (Cantonese, Sichuanese, Shanghainese)
  • Competitive API pricing at ~$0.26 per 5-second clip with audio on FAL.ai

infoLimitations

  • Not open source — no self-deployment option available
  • Maximum 12-second duration is shorter than competitors like Kling v3 (15s) or Wan 2.7 (15s)
  • 480p/720p output resolution on most endpoints — 1080p limited to certain configurations
  • No multi-shot generation in a single call — must stitch clips manually
  • Superseded by Seedance 2.0 which adds longer duration and higher quality

auto_fix_highPrompt Guide

  1. 1Structure prompts across four layers: subject definition, dialogue or key sound events, environmental audio cues, and visual style — Seedance 1.5 Pro processes all four simultaneously.
  2. 2Include quoted dialogue with speaker descriptions for audio sync — e.g., [Woman, warm tone]: 'Welcome back.' This signals the model to prioritize lip-sync and voice generation.
  3. 3Use explicit camera language — pan, tilt, zoom, truck, orbit, handheld — as cinematic lens response is a core strength of this model.
  4. 4Use adverbs of degree to control motion intensity: 'quickly,' 'violently,' 'gently,' 'with large amplitude' directly influence animation dynamics.
  5. 5For multi-shot sequences, connect shots with 'camera switch' and describe the new scenario after each cut to maintain narrative coherence.
  6. 6Specify ambient audio alongside visual cues — 'rain on cobblestones, distant thunder' — since the model generates foley and environment sound natively.

✓ Do this

  • Follow the pattern: Subject + Motion + Scene + Shot/Style for each prompt segment
  • Keep prompts clear and concise — the model handles complex multi-element prompts but clarity beats length
  • For image-to-video, describe how the scene evolves FROM the input image, not a new scene
  • Describe both what viewers see AND what they hear in a unified instruction set
  • Specify lighting, weather, and time of day to anchor the visual atmosphere

✗ Avoid this

  • Maximum resolution is 1080p — no 4K output unlike Seedance 2.0
  • Duration capped at 12 seconds — shorter than some competing models
  • Animated or stylized content may require more prompt engineering than photorealism
  • Text rendering within video is not reliably supported
  • Rapid camera movements in a single shot can introduce artifacts

Example Prompts

Corporate / Dialogue

A middle-aged woman in business attire walks through a modern office lobby, morning sunlight streaming through glass windows, her heels clicking on marble floors. [Woman, confident tone]: 'The quarterly results exceeded expectations.' Camera follows her in a smooth tracking shot, shallow depth of field.

Product / Commercial

Close-up of a barista pouring latte art in a cozy cafe. Ambient sounds: espresso machine hissing, quiet jazz in the background. Camera holds steady, warm golden-hour lighting through the window, shallow depth of field on the milk swirl.

Nature / Multi-shot

Aerial shot slowly descending over a misty Japanese garden at dawn. A koi pond reflects cherry blossoms. Birds chirping, water trickling over stones. Camera switch: close-up of a single cherry blossom petal falling into the pond, creating gentle ripples.

Based on the official prompt guide →

FAQexpand_more

Where can I use Seedance 1.5 Pro?

Via API on FAL.ai and Replicate.

How do I get good results with Seedance 1.5 Pro?

Structure prompts across four layers: subject definition, dialogue or key sound events, environmental audio cues, and visual style — Seedance 1.5 Pro processes all four simultaneously. See the prompt guide below.