VidScore
ModelsLeaderboardCompareBest ForCalculatorBlog
View Rankings
VidScore

The source of truth for AI video. Objective benchmarks, transparent data.

Platform
  • Leaderboard
  • Models
  • Compare
  • Tools
  • Cost Calculator
Resources
  • Best For Guides
  • Blog
  • Methodology
  • Editorial Policy
About

Objective benchmarks and transparent data for AI video generation. Rankings refreshed weekly.

Read our methodologyRead our editorial policy

© 2026 VidScore. Data updated April 2026.

  1. Home
  2. Models
  3. Text-to-Video

Text-to-video model selection

Best text-to-video models right now

Users searching for text to video ai usually need a shortlist first: which popular models are worth trying, and which one fits the scene. Start with the recommendations, then use the table for trusted price, specs, audio, and provider evidence.

Built for production model selection and text to video leaderboard research. Use this page when the prompt creates the scene; use image-to-video when a reference frame already exists.

See recommendationsCompare specsAPI pricing

Short answer

Pick by the scene brief, not by the cheapest row.

Start with a script or prompt

Kling v3 Pro or Seedance 2.0

You need the model to invent shots, motion, and pacing before you have a source frame.

Need realism or synced dialogue

Veo 3.1

You are paying for polished lighting, believable people, and audiovisual quality.

Need longer API clips

Sora 2

Use it when duration and native audio matter, then check availability before building around it.

Recommended shortlist

Four picks cover most serious text-to-video jobs.

This is the high-signal part of the page. Most users should start with scene fit, audio, duration, and trusted provider evidence; the complete table is there when constraints force a wider search.

01Best for creators
Kling v3 Pro

Use for creator clips that need motion, consistency, and audio.

People, music videos, social clips

Price
$0.11/sec
FAL.ai
Limit
15s
1080p
Audio
Yes
02Best for cinematic realism
Veo 3.1

Use when realism and synced sound matter more than clip length.

Premium ads, presenters, cinematic B-roll

Price
$0.10/sec
FAL.ai
Limit
8s
1080p
Audio
Yes
03Best for audio stories
Seedance 2.0

Use for 15-second dialogue, music, and beat-sync scenes.

Dialogue, music, multi-shot clips

Price
$0.10/sec
WaveSpeed
Limit
15s
1080p
Audio
Yes
04Best for long-form API
Sora 2

Use for clips up to 20 seconds while the API remains available.

Longer clips, native audio, OpenAI workflows

Price
$0.10/sec
OpenAI API
Limit
20s
1080p
Audio
Yes

Full comparison table

Check the complete text-to-video list after the shortlist.

This table is the supporting layer: trusted price, specs, audio, provider evidence, and links to full model profiles. It is intentionally below the recommendations so the page stays decision-first.

ModelTrusted priceSpecsAudioProvider evidence
HunyuanVideo 1.5

Tencent

$0.02/sec

WaveSpeed

720p / 10s

16:9, 9:16

No native audioWaveSpeedVerified / Trust: MEDIUM
Pika 2.0

Pika Labs

$0.04/sec

FAL.ai

1080p / 10s

16:9, 9:16, 1:1

No native audioFAL.aiVerified / Trust: HIGH
Kling 2.5 Turbo

Kuaishou

$0.04/sec

WaveSpeed

1080p / 10s

16:9, 9:16, 1:1

No native audioWaveSpeedVerified / Trust: MEDIUM
LTX-2 Pro

Lightricks

$0.06/sec

FAL.ai

4K / 10s

16:9

Native audioFAL.aiVerified / Trust: HIGH
Kling v3 Standard

Kuaishou

$0.08/sec

FAL.ai

1080p / 15s

16:9, 9:16, 1:1

Native audioFAL.aiVerified / Trust: HIGH
Seedance 2.0

ByteDance

$0.10/sec

WaveSpeed

1080p / 15s

21:9, 16:9, 4:3

Native audioWaveSpeedVerified / Trust: MEDIUM
Sora 2

OpenAI

$0.10/sec

OpenAI API

1080p / 20s

16:9, 9:16, 1:1

Native audioOpenAI APIVerified / Trust: HIGH
Veo 3.1

Google DeepMind

$0.10/sec

FAL.ai

1080p / 8s

16:9, 9:16, 1:1

Native audioFAL.aiVerified / Trust: HIGH
Veo 3.1 Fast

Google DeepMind

$0.10/sec

FAL.ai

1080p / 8s

16:9, 9:16

Native audioFAL.aiVerified / Trust: HIGH
Kling v3 Pro

Kuaishou

$0.11/sec

FAL.ai

1080p / 15s

16:9, 9:16, 1:1

Native audioFAL.aiVerified / Trust: HIGH
Runway Gen-4.5

Runway

$0.12/sec

Runway API

720p / 10s

16:9, 9:16, 1:1

No native audioRunway APIVerified / Trust: HIGH
CogVideoX-5B

Zhipu AI / THUDM

No T2V price

No trusted T2V price

480p / 10s

16:9

No native audioCheck model profile
Grok Imagine Video

xAI

No T2V price

No trusted T2V price

720p / 15s

16:9, 9:16, 1:1

Native audioCheck model profile
Hailuo 02 Pro

MiniMax

No T2V price

No trusted T2V price

1080p / 10s

16:9, 9:16, 1:1

No native audioCheck model profile
HappyHorse 1.0

Unverified / attributed in public reporting to Alibaba ATH

No T2V price

No trusted T2V price

1080p / 10s

16:9, 9:16, 4:3

Native audioCheck model profile
Luma Ray 3

Luma AI

No T2V price

No trusted T2V price

4K (upscaled) / 10s

16:9, 9:16, 1:1

No native audioCheck model profile
Luma Ray2

Luma AI

No T2V price

No trusted T2V price

1080p / 10s

16:9, 9:16, 4:3

Native audioCheck model profile
Minimax Hailuo

MiniMax

No T2V price

No trusted T2V price

1080p / 10s

16:9, 9:16, 1:1

No native audioCheck model profile
Mochi 1

Genmo

No T2V price

No trusted T2V price

480p / 5s

16:9

No native audioCheck model profile
Pika 2.5

Pika Labs

No T2V price

No trusted T2V price

1080p / 25s

16:9, 9:16, 1:1

No native audioCheck model profile
PixVerse V6

PixVerse

No T2V price

No trusted T2V price

1080p / 15s

16:9, 9:16, 1:1

Native audioCheck model profile
Runway Gen-4

Runway

No T2V price

No trusted T2V price

4K (upscale) / 10s

16:9, 9:16, 1:1

No native audioCheck model profile
Seedance 1.5 Pro

ByteDance

No T2V price

No trusted T2V price

1080p / 12s

21:9, 16:9, 4:3

Native audioCheck model profile
SkyReels V4

Skywork AI (Kunlun)

No T2V price

No trusted T2V price

1080p / 15s

16:9, 9:16, 1:1

Native audioCheck model profile
Vidu Q3 Pro

Shengshu Technology

No T2V price

No trusted T2V price

1080p / 16s

16:9, 9:16, 1:1

Native audioCheck model profile
Wan 2.1

Alibaba

No T2V price

No trusted T2V price

720p / 5s

16:9, 9:16, 1:1

No native audioCheck model profile
Wan 2.7

Alibaba

No T2V price

No trusted T2V price

1080p / 15s

16:9, 9:16, 1:1

Native audioCheck model profile