New model

Seedance 2.0

New multi‑shot video model with strong consistency.

Next‑gen video model focused on multi‑shot storytelling with multimodal reference inputs, audio‑visual sync, and strong character consistency.

Best for: Cinematic previews

Inputs: Text/Audio

Outputs: Video

What this model is best at

Short answer: Next‑gen video model focused on multi‑shot storytelling with multimodal reference inputs, audio‑visual sync, and strong character consistency.

Use this workspace to preview the model, compare example output, and start creating with the recommended workflow for this model.

Highlight 1

Multi‑shot narrative generation with consistent characters.

Highlight 2

Multimodal references (image, video, text, and audio).

Highlight 3

Audio‑visual beat matching for timing and rhythm.

New

Seedance 2.0 workspace

Start from the built-in workflow below, then tune the model inside the standard LipsyncX creation surface.

New model

Cinematic previews · Multi‑scene storytelling

Talking Photo Video Dubbing Long Video Pet & Anime

1. Choose a face

Choose a template or uploadDrag & drop video or photoor click to upload

2. Model

3. Write your greeting

Instant script templates

One-click copy for greetings, celebrations, and announcements.

—

Billing unit10 credits / 5s

Billing units—

Estimated length—

Est. total—

Balance0 credits

Uses real audio duration when available.

Voice

Speed (0.90x)

0 / 1000

—

Step 1/4

Choose a face

Follow the next step to keep building your video.

—

Avg render time

7 min

Languages supported

50+

Creators onboarded

3,200+

Trusted by teams

StudioBlendAudioNovaCourseWaveMintlyVisionSpark

Multi‑shot teaser

Generate a cinematic multi‑scene preview.

Concept

Generated

Popular use cases

Use case 1

Cinematic previews

Showcase multi‑shot narratives.

Use case 2

Brand storytelling

Create multi‑scene launch teasers.

Use case 3

Concept videos

Prototype full sequences fast.

Quick specs

Image inputs

Up to 9 images

Video inputs

Up to 3 videos (max 15s total)

Audio inputs

Up to 3 MP3 files (max 15s total)

Text input

Natural‑language prompts

Best practices

Be explicit about what each reference controls (style, motion, camera, character).

Prioritize the most important assets within the 12‑file limit.

Double‑check @‑mentions to avoid swapping files.

Specify edit vs reference when using an existing video.

Align generation duration with intended extension length.

Write prompts like you are briefing a human editor.

FAQ

What makes it different from single‑shot models?

Seedance 2.0 focuses on multi‑shot storytelling with consistent characters across scenes.

What reference inputs are supported?

It supports multimodal references such as text, images, video, and audio.

What resolution does it target?

High‑resolution outputs up to 2K are supported.

Ready to try Seedance 2.0?

Use the built-in workspace to test prompts, compare outputs, and see how this model fits your content workflow.