Seedance 1.5 Pro
dialogue-scenescommercialmultilingualByteDance · Dual-Branch Diffusion Transformer · v1.5verifiedVerified
—/sec
starting from, on FAL.ai
Resolution
1080p
Duration
4–12s
Providers
2
API Pricing
Why Seedance 1.5 Pro?
thumb_upStrengths
- Native audio-visual joint generation with millisecond-precision lip-sync across 8+ languages
- Dual-branch diffusion transformer processes video and audio in parallel for tight synchronization
- Strong cinematic camera control — pan, tilt, zoom, truck, orbit all respond well to prompts
- Multilingual dialogue support including Chinese dialects (Cantonese, Sichuanese, Shanghainese)
- Competitive API pricing at ~$0.26 per 5-second clip with audio on FAL.ai
infoLimitations
- Not open source — no self-deployment option available
- Maximum 12-second duration is shorter than competitors like Kling v3 (15s) or Wan 2.7 (15s)
- 480p/720p output resolution on most endpoints — 1080p limited to certain configurations
- No multi-shot generation in a single call — must stitch clips manually
- Superseded by Seedance 2.0 which adds longer duration and higher quality
auto_fix_highPrompt Guide
- 1Structure prompts across four layers: subject definition, dialogue or key sound events, environmental audio cues, and visual style — Seedance 1.5 Pro processes all four simultaneously.
- 2Include quoted dialogue with speaker descriptions for audio sync — e.g., [Woman, warm tone]: 'Welcome back.' This signals the model to prioritize lip-sync and voice generation.
- 3Use explicit camera language — pan, tilt, zoom, truck, orbit, handheld — as cinematic lens response is a core strength of this model.
- 4Use adverbs of degree to control motion intensity: 'quickly,' 'violently,' 'gently,' 'with large amplitude' directly influence animation dynamics.
- 5For multi-shot sequences, connect shots with 'camera switch' and describe the new scenario after each cut to maintain narrative coherence.
- 6Specify ambient audio alongside visual cues — 'rain on cobblestones, distant thunder' — since the model generates foley and environment sound natively.
✓ Do this
- Follow the pattern: Subject + Motion + Scene + Shot/Style for each prompt segment
- Keep prompts clear and concise — the model handles complex multi-element prompts but clarity beats length
- For image-to-video, describe how the scene evolves FROM the input image, not a new scene
- Describe both what viewers see AND what they hear in a unified instruction set
- Specify lighting, weather, and time of day to anchor the visual atmosphere
✗ Avoid this
- Maximum resolution is 1080p — no 4K output unlike Seedance 2.0
- Duration capped at 12 seconds — shorter than some competing models
- Animated or stylized content may require more prompt engineering than photorealism
- Text rendering within video is not reliably supported
- Rapid camera movements in a single shot can introduce artifacts
Example Prompts
“A middle-aged woman in business attire walks through a modern office lobby, morning sunlight streaming through glass windows, her heels clicking on marble floors. [Woman, confident tone]: 'The quarterly results exceeded expectations.' Camera follows her in a smooth tracking shot, shallow depth of field.”
“Close-up of a barista pouring latte art in a cozy cafe. Ambient sounds: espresso machine hissing, quiet jazz in the background. Camera holds steady, warm golden-hour lighting through the window, shallow depth of field on the milk swirl.”
“Aerial shot slowly descending over a misty Japanese garden at dawn. A koi pond reflects cherry blossoms. Birds chirping, water trickling over stones. Camera switch: close-up of a single cherry blossom petal falling into the pond, creating gentle ripples.”
Based on the official prompt guide →
FAQexpand_more
Where can I use Seedance 1.5 Pro?
Via API on FAL.ai and Replicate.
How do I get good results with Seedance 1.5 Pro?
Structure prompts across four layers: subject definition, dialogue or key sound events, environmental audio cues, and visual style — Seedance 1.5 Pro processes all four simultaneously. See the prompt guide below.