AiHubMix Documentation Hub

Seedance 是字节豆包推出的多模态视频生成模型系列,支持基于文本生成 480p / 720p / 1080p 视频,并可接受图像、视频、音频作为参考输入。其多模态参考能力(首尾帧锁定、镜头语言迁移、音频驱动)是当前业内最完整的视频生成方案之一。通过 AIHubMix 调用 Seedance,无需注册其他账号,使用统一的 API Key 即可。

未注册请先创建 AIHubMix 账号,在 API Keys 页面生成 Key。

接口说明

Seedance 视频生成为长耗时任务,采用异步调用:提交任务后立即返回任务标识,生成完成后通过任务接口取回视频地址。

POST https://aihubmix.com/v1/videos

生成耗时通常在数十秒至数分钟之间,具体取决于时长、分辨率和参考媒体数量。建议在 AIHubMix 控制台查看任务状态与历史视频。

提交任务

最简单的视频生成请求,仅需 model、prompt、duration 三个字段。

curl -X POST "https://aihubmix.com/v1/videos" \
  -H "Authorization: Bearer $AIHUBMIX_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "doubao-seedance-2-0-fast-260128",
    "prompt": "东京公寓窗户上,雨水滑落的特写镜头,慢动作,霓虹倒影,电影感",
    "ratio": "16:9",
    "duration": 8,
    "watermark": false
  }'

视频文件有效期为 24 小时,生成完成后请及时下载至自有存储。

多模态参考输入

Seedance 的核心差异化在于 5 种参考媒体角色,可在单次请求中组合使用:

Role	说明	典型用途
`first_frame`	首帧图(锁定开场画面)	图生视频,控制起点
`last_frame`	尾帧图(锁定结尾画面)	配合 `first_frame` 锁定首尾
`reference_image`	通用参考图	角色一致性、产品 lookalike、风格参考
`reference_video`	参考视频	镜头运动 / 运镜风格迁移
`reference_audio`	参考音频	背景音乐节奏同步、音频驱动剪辑

多模态参考上限

单次请求最多支持:

9 张图像(first_frame / last_frame / reference_image 合计)
3 段视频(reference_video)
3 段音频(reference_audio)

完整示例:商业级广告生成

下例为一支 11 秒的第一人称水果茶产品广告:沿用参考视频的镜头语言、以参考音频作为背景音乐、首尾帧锁定到指定参考图。

curl -X POST "https://aihubmix.com/v1/videos" \
  -H "Authorization: Bearer $AIHUBMIX_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "doubao-seedance-2-0-260128",
    "prompt": "Use the first-person POV framing from Video 1 throughout, and use Audio 1 as the background music. Create a first-person fruit tea commercial.\n\nOpening frame: Image 1. From a first-person perspective, your hand picks a dew-covered Aksu red apple.\n\nSeconds 2-4: Fast-paced cuts. Your hand drops freshly cut apple chunks into a shaker, adds ice and tea base, then shakes vigorously. Voiceover: \"Freshly cut, freshly shaken.\"\n\nSeconds 4-6: First-person close-up. The layered fruit tea is poured into a clear cup. A pink label is placed on the cup.\n\nSeconds 6-8: First-person hand raises the drink toward the camera. Voiceover: \"Take a refreshing sip.\"\n\nFinal frame: Freeze on Image 2.",
    "content": [
      {
        "type": "image_url",
        "image_url": { "url": "https://example.com/apple-pick.jpg" },
        "role": "first_frame"
      },
      {
        "type": "image_url",
        "image_url": { "url": "https://example.com/final-product.jpg" },
        "role": "last_frame"
      },
      {
        "type": "video_url",
        "video_url": { "url": "https://example.com/pov-style.mp4" },
        "role": "reference_video"
      },
      {
        "type": "audio_url",
        "audio_url": { "url": "https://example.com/bg-music.mp3" },
        "role": "reference_audio"
      }
    ],
    "ratio": "16:9",
    "duration": 11,
    "generate_audio": true,
    "watermark": false
  }'

请求参数

字段	类型	必填	默认	说明
`model`	string	是	—	Seedance 模型 ID,见下方可用模型表
`prompt`	string	是	—	文本提示词,上限 4000 字符,推荐 500 字以内。建议使用分镜式描述
`content`	array	否	`[]`	参考媒体数组,详见上文
`ratio`	string	否	`"16:9"`	画面比例:`16:9` / `9:16` / `1:1` / `4:3` / `3:4` / `21:9`
`resolution`	string	否	`"720p"`	输出分辨率:`480p` / `720p` / `1080p`(取决于模型)
`duration`	int	否	`5`	视频时长(秒),范围 2-15
`generate_audio`	bool	否	`false`	是否生成原生音频(口型同步等)
`return_last_frame`	bool	否	`false`	返回末帧图像,用于续接生成
`seed`	int	否	随机	复现性种子,相同种子 + 相同输入可复现结果
`watermark`	bool	否	因模型而异	是否添加水印

可用模型

模型 ID	分辨率	单价(美元/秒)	适用场景
`doubao-seedance-2-0-260128`	480p / 720p	0.071 / 0.154	旗舰多模态参考,商业级广告
`doubao-seedance-2-0-fast-260128`	480p / 720p	0.055 / 0.113	同上,生成更快,适合迭代
`doubao-seedance-1-5-pro-251215`	480p / 720p / 1080p	0.0225 / 0.0487 / 0.11	多角色对白 + 环境音同步
`doubao-seedance-1-0-pro-250528`	480p / 720p / 1080p	0.021 / 0.044 / 0.102	影视级 1080p 叙事
`doubao-seedance-1-0-pro-fast-251015`	480p / 720p / 1080p	0.0062 / 0.0123 / 0.0298	当前业内最低成本 1080p 视频 API

含参考视频的请求计费更高(Seedance 2.0 系列):480p

0.158/秒、720p

0.341/秒。

Prompt 编写建议

使用分镜式描述:Seconds 0-2: ... / Seconds 2-4: ...,Seedance 在结构化脚本上表现最佳
明确镜头语言:first-person POV、close-up push-in、tracking shot from left
首尾帧锁定:用 first_frame + last_frame 控制起止画面,中间运动由模型补帧
音频驱动节奏:传入 reference_audio 并在 prompt 中说明「sync cuts to the music」
商业用途:watermark: false
续接长视频:开启 return_last_frame: true,将返回的末帧作为下一次请求的 first_frame

Seedance 2.0 API 完整使用指南(2026 版)

接口说明

提交任务

多模态参考输入

多模态参考上限

完整示例:商业级广告生成

请求参数

可用模型

Prompt 编写建议

通过第三方客户端使用

相关页面

Documentation Index

​接口说明

​提交任务

​多模态参考输入

​多模态参考上限

​完整示例:商业级广告生成

​请求参数

​可用模型

​Prompt 编写建议

​通过第三方客户端使用

​相关页面

接口说明

提交任务

多模态参考输入

多模态参考上限

完整示例:商业级广告生成

请求参数

可用模型

Prompt 编写建议

通过第三方客户端使用

相关页面