AgentSkillsCN

ai-video-generation

通过 inference.sh CLI 使用 Google Veo、Seedance、Wan、Grok 以及 40 多种模型生成 AI 视频。模型包括:Veo 3.1、Veo 3、Seedance 1.5 Pro、Wan 2.5、Grok Imagine Video、OmniHuman、Fabric、HunyuanVideo。功能涵盖:文本转视频、图像转视频、唇形同步、头像动画、视频超分辨率、fole…

SKILL.md
--- frontmatter
name: ai-video-generation
description: |-
  Generate AI videos with Google Veo, Seedance, Wan, Grok and 40+ models via inference.sh CLI. Models: Veo 3.1, Veo 3, Seedance 1.5 Pro, Wan 2.5, Grok Imagine Video, OmniHuman, Fabric, HunyuanVideo. Capabilities: text-to-video, image-to-video, lipsync, avatar animation, video upscaling, fole...
allowed-tools: Bash(infsh *)
risk: safe
source: community
license: MIT

AI Video Generation

When to Use

Use this skill as needed to perform the specified automation task.

Generate videos with 40+ AI models via inference.sh CLI.

AI Video Generation

Quick Start

bash
# Install CLI
curl -fsSL https://cli.inference.sh | sh && infsh login

# Generate a video with Veo
infsh app run google/veo-3-1-fast --input '{"prompt": "drone shot flying over a forest"}'

Available Models

Text-to-Video

ModelApp IDBest For
Veo 3.1 Fastgoogle/veo-3-1-fastFast, with optional audio
Veo 3.1google/veo-3-1Best quality, frame interpolation
Veo 3google/veo-3High quality with audio
Veo 3 Fastgoogle/veo-3-fastFast with audio
Veo 2google/veo-2Realistic videos
Grok Videoxai/grok-imagine-videoxAI, configurable duration
Seedance 1.5 Probytedance/seedance-1-5-proWith first-frame control
Seedance 1.0 Probytedance/seedance-1-0-proUp to 1080p

Image-to-Video

ModelApp IDBest For
Wan 2.5falai/wan-2-5Animate any image
Wan 2.5 I2Vfalai/wan-2-5-i2vHigh quality i2v
Seedance Litebytedance/seedance-1-0-liteLightweight 720p

Avatar / Lipsync

ModelApp IDBest For
OmniHuman 1.5bytedance/omnihuman-1-5Multi-character
OmniHuman 1.0bytedance/omnihuman-1-0Single character
Fabric 1.0falai/fabric-1-0Image talks with lipsync
PixVerse Lipsyncfalai/pixverse-lipsyncRealistic lipsync

Utilities

ToolApp IDDescription
HunyuanVideo Foleyinfsh/hunyuanvideo-foleyAdd sound effects to video
Topaz Upscalerfalai/topaz-video-upscalerUpscale video quality
Media Mergerinfsh/media-mergerMerge videos with transitions

Browse All Video Apps

bash
infsh app list --category video

Examples

Text-to-Video with Veo

bash
infsh app run google/veo-3-1-fast --input '{
  "prompt": "A timelapse of a flower blooming in a garden"
}'

Grok Video

bash
infsh app run xai/grok-imagine-video --input '{
  "prompt": "Waves crashing on a beach at sunset",
  "duration": 5
}'

Image-to-Video with Wan 2.5

bash
infsh app run falai/wan-2-5 --input '{
  "image_url": "https://your-image.jpg"
}'

AI Avatar / Talking Head

bash
infsh app run bytedance/omnihuman-1-5 --input '{
  "image_url": "https://portrait.jpg",
  "audio_url": "https://speech.mp3"
}'

Fabric Lipsync

bash
infsh app run falai/fabric-1-0 --input '{
  "image_url": "https://face.jpg",
  "audio_url": "https://audio.mp3"
}'

PixVerse Lipsync

bash
infsh app run falai/pixverse-lipsync --input '{
  "image_url": "https://portrait.jpg",
  "audio_url": "https://speech.mp3"
}'

Video Upscaling

bash
infsh app run falai/topaz-video-upscaler --input '{"video_url": "https://..."}'

Add Sound Effects (Foley)

bash
infsh app run infsh/hunyuanvideo-foley --input '{
  "video_url": "https://silent-video.mp4",
  "prompt": "footsteps on gravel, birds chirping"
}'

Merge Videos

bash
infsh app run infsh/media-merger --input '{
  "videos": ["https://clip1.mp4", "https://clip2.mp4"],
  "transition": "fade"
}'

Related Skills

bash
# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@inference-sh

# Google Veo specific
npx skills add inference-sh/skills@google-veo

# AI avatars & lipsync
npx skills add inference-sh/skills@ai-avatar-video

# Text-to-speech (for video narration)
npx skills add inference-sh/skills@text-to-speech

# Image generation (for image-to-video)
npx skills add inference-sh/skills@ai-image-generation

# Twitter (post videos)
npx skills add inference-sh/skills@twitter-automation

Browse all apps: infsh app list

Documentation