VidScore
ModelsLeaderboardCompareBest ForCalculatorBlog
View Rankings
VidScore

The source of truth for AI video. Objective benchmarks, transparent data.

Platform
  • Leaderboard
  • Models
  • Compare
  • Tools
  • Cost Calculator
Resources
  • Best For Guides
  • Blog
  • Methodology
  • Editorial Policy
About

Objective benchmarks and transparent data for AI video generation. Rankings refreshed weekly.

Read our methodologyRead our editorial policy

© 2026 VidScore. Data updated April 2026.

  1. Home
  2. Models
  3. Image-to-Video

Image-to-video model selection

Best image-to-video models right now

Users searching for image to video ai usually need two things: which popular models are worth trying, and which one fits their job. Start with the recommendations, then use the table for specs, provider pricing, and evidence.

Built for production model selection and safe free ai image to video generator research. VidScore does not rank NSFW or no-restrictions generator traffic here.

See recommendationsCompare specsAPI pricing

Short answer

Pick by the source image, not by the cheapest row.

Start with a face or character

Kling v3 Pro or Seedance 2.0

You need identity to survive camera movement, expression changes, and audio timing.

Start with a product image

Runway Gen-4

You care about preserving the object, framing, and brand composition more than native audio.

Start with a cinematic frame

Veo 3.1

You are paying for realism, lighting, and polished audiovisual output rather than cheap iteration.

Recommended shortlist

Four picks cover most serious image-to-video jobs.

This is the high-signal part of the page. The full model list is useful, but most users should begin with these options and only go wider when trusted price, provider, or workflow constraints force it.

01Best for creators
Kling v3 Pro

Use for creator clips that need motion, consistency, and audio.

People, music videos, social clips

Price
$0.11/sec
FAL.ai
Limit
15s
1080p
Audio
Yes
02Best for cinematic realism
Veo 3.1

Use when realism and synced sound matter more than clip length.

Premium ads, presenters, cinematic B-roll

Price
$0.40/sec
FAL.ai
Limit
8s
1080p
Audio
Yes
03Best for product control
Runway Gen-4

Use when the source image must stay controlled and stable.

Products, ecommerce, visual edits

Price
$0.05/sec
Runway API
Limit
10s
4K (upscale)
Audio
No
04Best for audio stories
Seedance 2.0

Use for 15-second dialogue, music, and beat-sync scenes.

Dialogue, music, multi-shot clips

Price
$0.24/sec
WaveSpeed
Limit
15s
1080p
Audio
Yes

Full comparison table

Check the complete image-to-video list after the shortlist.

This table is the supporting layer: trusted price, specs, audio, provider evidence, and links to full model profiles. It is intentionally below the recommendations so the page stays decision-first.

ModelTrusted priceSpecsAudioProvider evidence
HunyuanVideo 1.5

Tencent

$0.02/sec

WaveSpeed

720p / 10s

16:9, 9:16

No native audioWaveSpeedVerified / Trust: MEDIUM
FramePack

HuggingFace Community (lllyasviel)

$0.03/sec

FAL.ai

720p / 60s

16:9, 4:3, 1:1

No native audioFAL.aiVerified / Trust: HIGH
Pika 2.0

Pika Labs

$0.04/sec

FAL.ai

1080p / 10s

16:9, 9:16, 1:1

No native audioFAL.aiVerified / Trust: HIGH
Kling 2.5 Turbo

Kuaishou

$0.04/sec

WaveSpeed

1080p / 10s

16:9, 9:16, 1:1

No native audioWaveSpeedVerified / Trust: MEDIUM
Runway Gen-4

Runway

$0.05/sec

Runway API

4K (upscale) / 10s

16:9, 9:16, 1:1

No native audioRunway APIVerified / Trust: HIGH
LTX-2 Pro

Lightricks

$0.06/sec

FAL.ai

4K / 10s

16:9

Native audioFAL.aiVerified / Trust: HIGH
Kling v3 Pro

Kuaishou

$0.11/sec

FAL.ai

1080p / 15s

16:9, 9:16, 1:1

Native audioFAL.aiVerified / Trust: HIGH
Runway Gen-4.5

Runway

$0.12/sec

Runway API

720p / 10s

16:9, 9:16, 1:1

No native audioRunway APIVerified / Trust: HIGH
Seedance 2.0

ByteDance

$0.24/sec

WaveSpeed

1080p / 15s

21:9, 16:9, 4:3

Native audioWaveSpeedVerified / Trust: MEDIUM
Veo 3.1

Google DeepMind

$0.40/sec

FAL.ai

1080p / 8s

16:9, 9:16, 1:1

Native audioFAL.aiVerified / Trust: HIGH
CogVideoX-5B

Zhipu AI / THUDM

No I2V price

No trusted I2V price

480p / 10s

16:9

No native audioCheck model profile
Grok Imagine Video

xAI

No I2V price

No trusted I2V price

720p / 15s

16:9, 9:16, 1:1

Native audioCheck model profile
Hailuo 02 Pro

MiniMax

No I2V price

No trusted I2V price

1080p / 10s

16:9, 9:16, 1:1

No native audioCheck model profile
HappyHorse 1.0

Unverified / attributed in public reporting to Alibaba ATH

No I2V price

No trusted I2V price

1080p / 10s

16:9, 9:16, 4:3

Native audioCheck model profile
Kling v3 Standard

Kuaishou

No I2V price

No trusted I2V price

1080p / 15s

16:9, 9:16, 1:1

Native audioCheck model profile
Luma Ray 3

Luma AI

No I2V price

No trusted I2V price

4K (upscaled) / 10s

16:9, 9:16, 1:1

No native audioCheck model profile
Luma Ray2

Luma AI

No I2V price

No trusted I2V price

1080p / 10s

16:9, 9:16, 4:3

Native audioCheck model profile
Minimax Hailuo

MiniMax

No I2V price

No trusted I2V price

1080p / 10s

16:9, 9:16, 1:1

No native audioCheck model profile
Pika 2.5

Pika Labs

No I2V price

No trusted I2V price

1080p / 25s

16:9, 9:16, 1:1

No native audioCheck model profile
PixVerse V6

PixVerse

No I2V price

No trusted I2V price

1080p / 15s

16:9, 9:16, 1:1

Native audioCheck model profile
Seedance 1.5 Pro

ByteDance

No I2V price

No trusted I2V price

1080p / 12s

21:9, 16:9, 4:3

Native audioCheck model profile
SkyReels V4

Skywork AI (Kunlun)

No I2V price

No trusted I2V price

1080p / 15s

16:9, 9:16, 1:1

Native audioCheck model profile
Sora 2

OpenAI

No I2V price

No trusted I2V price

1080p / 20s

16:9, 9:16, 1:1

Native audioCheck model profile
Veo 3.1 Fast

Google DeepMind

No I2V price

No trusted I2V price

1080p / 8s

16:9, 9:16

Native audioCheck model profile
Vidu Q3 Pro

Shengshu Technology

No I2V price

No trusted I2V price

1080p / 16s

16:9, 9:16, 1:1

Native audioCheck model profile
Wan 2.1

Alibaba

No I2V price

No trusted I2V price

720p / 5s

16:9, 9:16, 1:1

No native audioCheck model profile
Wan 2.7

Alibaba

No I2V price

No trusted I2V price

1080p / 15s

16:9, 9:16, 1:1

Native audioCheck model profile