Hailuo 02 Pro
physics-simulationsaction-sequencescommercialMiniMax · Noise-aware Compute Redistribution (NCR) · v02 ProverifiedVerified
$0.08/sec
starting from, on FAL.ai
Resolution
1080p
Duration
6–10s
Providers
2
API Pricing
Why Hailuo 02 Pro?
thumb_upStrengths
- Industry-leading physics simulation — accurately renders fluid dynamics, cloth, collisions, debris, and acrobatics
- Affordable Pro tier at $0.08/sec on FAL.ai — a 6-second 1080p video costs just $0.48
- Director-level camera control with support for 10+ camera movements via bracketed keywords
- Strong temporal coherence with reduced flicker and warping compared to competitors
- Flexible image-to-video with both first-frame and end-frame anchoring for precise control
infoLimitations
- No native audio generation — output is video-only
- Maximum 10 seconds per generation — shorter than many competing models
- No lip-sync or digital human mode
- Not self-deployable (closed source, no model weights available)
- Fixed 25 fps output with no higher frame rate option
auto_fix_highPrompt Guide
- 1Structure prompts as a director's script, not a checklist — MiniMax's LLM backbone thrives on narrative flow and temporal relationships rather than comma-separated adjectives.
- 2Use the formula: [Camera Shot + Motion] + [Subject + Description] + [Action] + [Scene + Description] + [Lighting] + [Style/Mood] for consistent, high-quality output.
- 3Choose precise verbs and adverbs for motion — 'The sports car aggressively accelerates, careening around the corner' produces dramatically better motion than 'The sports car drives fast.'
- 4Use director-level camera keywords in square brackets — supports pan, truck, push, pedestal, tilt, zoom, shake, tracking, and static shots with up to 3 movements per prompt.
- 5For image-to-video workflows, generate a key image first (e.g., with Hailuo 2.3), then animate it with Hailuo 02 Pro — this image-first approach reduces visual drift and improves consistency.
✓ Do this
- Enable the prompt optimizer for automatic refinement and safety checking — it consistently improves output quality over raw prompts
- Include detailed facial expressions and body language — from subtle smiles to dramatic reactions, these details significantly improve character quality
- Avoid excessive storytelling in one sentence — break complex narratives into modular prompts for better output control
- Use high-resolution input images (min 300px shorter side) for image-to-video to ensure smoother animation and better detail
- Keep prompts clear and restrained — overloading with excessive visual detail reduces frame-to-frame consistency
✗ Avoid this
- Maximum duration is 10 seconds per generation — longer sequences require multiple clips
- No native audio generation — video output is silent
- Text rendering within video frames is unreliable
- Fixed 25 fps output — no frame rate selection available
- Very fast camera movements may cause temporal artifacts in single-shot clips
Example Prompts
“[Tracking shot] A parkour athlete in a black hoodie sprints across rain-slicked rooftops at dusk. He vaults over an AC unit, rolls on impact, and leaps across a gap between buildings. Camera follows from behind. Neon city lights reflect off wet surfaces. Cinematic, gritty urban aesthetic.”
“[Push in] A ceramic vase shatters in slow motion against a concrete wall. Shards spiral outward, catching golden hour sunlight. Camera slowly pushes toward the impact point. Dust particles hang in the air. Studio lighting, shallow depth of field.”
“[Static shot] An elderly painter sits before a canvas in a sunlit atelier. She mixes cobalt blue on her palette, lifts her brush, and applies a confident stroke. Paint glistens wet on the canvas. Warm natural window light, medium close-up, shallow depth of field.”
Based on the official prompt guide →
FAQexpand_more
How much does Hailuo 02 Pro cost?
From $0.08/sec on FAL.ai. A 5-second video ≈ $0.40.
Where can I use Hailuo 02 Pro?
Via API on FAL.ai and WaveSpeed.
How do I get good results with Hailuo 02 Pro?
Structure prompts as a director's script, not a checklist — MiniMax's LLM backbone thrives on narrative flow and temporal relationships rather than comma-separated adjectives. See the prompt guide below.