Advanced Video Downloader
Overview
This skill provides comprehensive video downloading and transcription capabilities from 1000+ platforms including YouTube, Bilibili, TikTok, Twitter, Instagram, and more. It combines:
- •yt-dlp: Powerful video downloading tool
- •SiliconFlow API: Free AI-powered transcription to convert videos to Markdown
When to Use This Skill
Activate this skill when the user:
- •Explicitly requests to download a video ("download this video", "下载视频")
- •Provides video URLs from any platform
- •Mentions saving videos for offline viewing
- •Wants to extract audio from videos
- •Needs to download multiple videos or playlists
- •Asks about video quality options
- •Requests video transcription ("转录视频", "提取字幕", "视频转文字")
- •Wants to convert video/audio to text or Markdown
- •Asks to download AND transcribe a video in one workflow
Core Capabilities
1. Single Video Download
Download individual videos from any supported platform with automatic quality selection.
Example usage:
User: "Download this YouTube video: https://youtube.com/watch?v=abc123" User: "下载这个B站视频: https://bilibili.com/video/BV1xxx"
2. Batch & Playlist Download
Download multiple videos or entire playlists at once.
Example usage:
User: "Download all videos from this playlist" User: "Download these 3 videos: [URL1], [URL2], [URL3]"
3. Audio Extraction
Extract audio only from videos, saving as MP3 or M4A.
Example usage:
User: "Download only the audio from this video" User: "Convert this video to MP3"
4. Quality Selection
Choose specific video quality (4K, 1080p, 720p, etc.).
Example usage:
User: "Download in 4K quality" User: "Get the 720p version to save space"
5. Video/Audio Transcription
Convert video or audio files to Markdown text using SiliconFlow's free AI transcription API.
Example usage:
User: "Transcribe this video to text" / "转录这个视频" User: "Download and transcribe this YouTube video" User: "将这个音频转成文字" User: "Extract transcript from this MP4 file"
Supported formats:
- •Audio: MP3, WAV, M4A, FLAC, AAC, OGG, OPUS, WMA
- •Video: MP4, AVI, MOV, MKV, FLV, WMV, WEBM, M4V
Response Pattern
When a user requests video download:
Step 1: Identify the Platform and URL(s)
# Extract video URL(s) from user message # Identify platform: YouTube, Bilibili, TikTok, etc.
Step 2: Check Tool Availability
# Check if yt-dlp is installed yt-dlp --version
Step 3: Select Appropriate yt-dlp Command
Based on platform and requirements:
- •YouTube, Twitter, Instagram, TikTok: Basic command works
- •Bilibili: Basic command works for most videos
- •Quality selection: Use
-fwith height filter - •Audio only: Use
-x --audio-format mp3 - •Playlists: Use playlist-specific output template
Step 4: Execute Download
Use yt-dlp directly with appropriate options:
# Basic download (best quality MP4) yt-dlp -f "bestvideo[ext=mp4]+bestaudio[ext=m4a]/best[ext=mp4]/best" --merge-output-format mp4 -o "%(title)s.%(ext)s" "VIDEO_URL" # Specific quality (1080p) yt-dlp -f "bestvideo[height<=1080]+bestaudio/best[height<=1080]" --merge-output-format mp4 -o "%(title)s.%(ext)s" "VIDEO_URL" # Audio only (MP3) yt-dlp -x --audio-format mp3 -o "%(title)s.%(ext)s" "VIDEO_URL" # With cookies file (for protected content) yt-dlp --cookies cookies.txt -o "%(title)s.%(ext)s" "VIDEO_URL" # Playlist download yt-dlp -o "%(playlist)s/%(playlist_index)s - %(title)s.%(ext)s" "PLAYLIST_URL"
Step 5: Report Results
After download completes, report:
- •✅ Video title and duration
- •✅ File size and format
- •✅ Save location
- •✅ Download speed and time taken
- •⚠️ Any warnings or quality limitations
Example output:
✅ Downloaded: "Video Title Here" Duration: 15:30 Quality: 1080p MP4 Size: 234 MB Location: ./Video Title Here.mp4 Time: 45 seconds at 5.2 MB/s
Transcription Response Pattern
When a user requests video/audio transcription:
Step 1: Check Prerequisites
# Verify SiliconFlow API key is available echo $SILICONFLOW_API_KEY # Or user must provide via --api-key parameter
API Key Setup:
- •Get free API key from: https://cloud.siliconflow.cn/account/ak
- •Copy
.env.exampleto.envand add your API key - •Or set environment variable:
SILICONFLOW_API_KEY=sk-xxx
Step 2: Validate File
Ensure the file exists and is a supported format (audio or video).
Step 3: Execute Transcription
Use the bundled script scripts/transcribe_siliconflow.py:
# Basic transcription python scripts/transcribe_siliconflow.py --file video.mp4 --api-key sk-xxx # With custom output path python scripts/transcribe_siliconflow.py --file audio.mp3 --output transcript.md --api-key sk-xxx # Using environment variable for API key python scripts/transcribe_siliconflow.py --file video.mp4
Step 4: Report Transcription Results
✅ Transcription complete! File: video.mp4 Output: 2025-01-15-video.md Size: 12.5 KB Preview: -------------------------------------------------- [First 200 characters of transcription...] --------------------------------------------------
Combined Workflow: Download + Transcribe
For requests like "Download and transcribe this video":
# Step 1: Download video yt-dlp -f "bestvideo[ext=mp4]+bestaudio[ext=m4a]/best" --merge-output-format mp4 -o "%(title)s.%(ext)s" "VIDEO_URL" # Step 2: Transcribe the downloaded file python scripts/transcribe_siliconflow.py --file "Downloaded Video Title.mp4" --api-key sk-xxx
Platform-Specific Notes
YouTube
- •Fully supported by yt-dlp
- •No authentication needed for public videos
- •Supports all quality levels including 4K/8K
Bilibili
- •Supported by yt-dlp
- •High-quality downloads may require login cookies
- •Use
--cookieswith cookies.txt for member-only content
Other Platforms
- •Most platforms work well with yt-dlp
- •Check
references/supported_platforms.mdfor full list
Handling Cookies for Protected Content
For platforms requiring authentication (Bilibili VIP, member-only content, etc.):
Method 1: Export Cookies File (Recommended)
# Use browser extension "Get cookies.txt LOCALLY" # Export cookies.txt, then: yt-dlp --cookies cookies.txt "VIDEO_URL"
Method 2: Manual Cookies File
# Create cookies.txt in Netscape format # Use browser extension "Get cookies.txt LOCALLY" # Then use with yt-dlp yt-dlp --cookies cookies.txt "VIDEO_URL"
Troubleshooting
Issue: Video quality lower than expected
Solution:
- •Check if platform requires login for HD
- •Use
--cookies cookies.txtfor authenticated access - •Explicitly specify quality with
-fparameter
Issue: Download very slow
Solution:
- •Check internet connection
- •Try different time of day (peak hours affect speed)
- •Use
--concurrent-fragmentsfor faster downloads
Issue: "Video unavailable" or geo-restricted
Solution:
- •Video may be region-locked
- •Use proxy/VPN if legally permitted
- •Check if video is still available on platform
Issue: Transcription API key error
Solution:
- •Verify API key starts with
sk- - •Get free key from: https://cloud.siliconflow.cn/account/ak
- •Set environment variable:
SILICONFLOW_API_KEY=sk-xxx
Issue: Transcription returns empty text
Solution:
- •Check if audio/video has clear speech
- •Verify file format is supported
- •File may be too short or contain only music
Common Commands
Quality Presets
# 4K (2160p) yt-dlp -f "bestvideo[height<=2160]+bestaudio/best[height<=2160]" --merge-output-format mp4 "VIDEO_URL" # 1080p (Full HD) yt-dlp -f "bestvideo[height<=1080]+bestaudio/best[height<=1080]" --merge-output-format mp4 "VIDEO_URL" # 720p (HD) yt-dlp -f "bestvideo[height<=720]+bestaudio/best[height<=720]" --merge-output-format mp4 "VIDEO_URL" # 480p (SD) yt-dlp -f "bestvideo[height<=480]+bestaudio/best[height<=480]" --merge-output-format mp4 "VIDEO_URL"
Audio Extraction
# Extract as MP3 yt-dlp -x --audio-format mp3 -o "%(title)s.%(ext)s" "VIDEO_URL" # Extract as M4A (better quality) yt-dlp -x --audio-format m4a -o "%(title)s.%(ext)s" "VIDEO_URL"
Batch Downloads
# Download multiple URLs from file yt-dlp -a urls.txt # Download playlist with custom naming yt-dlp -o "%(playlist)s/%(playlist_index)s - %(title)s.%(ext)s" "PLAYLIST_URL" # Download channel's videos yt-dlp -o "%(uploader)s/%(title)s.%(ext)s" "CHANNEL_URL"
Bundled Resources
Configuration
.env.example
Template for environment variables. Copy to .env and add your SiliconFlow API key.
Scripts
scripts/transcribe_siliconflow.py
AI-powered transcription script using SiliconFlow's free API.
Usage:
python scripts/transcribe_siliconflow.py --file <audio/video> [--api-key <key>] [--output <path>]
Parameters:
- •
--file, -f: Input audio/video file (required) - •
--api-key, -k: SiliconFlow API key (or useSILICONFLOW_API_KEYenv var) - •
--output, -o: Output markdown file path (default:YYYY-MM-DD-filename.md) - •
--model, -m: Model to use (default:FunAudioLLM/SenseVoiceSmall)
References
references/supported_platforms.md
Comprehensive list of 1000+ supported platforms with platform-specific notes and requirements.
references/quality_formats.md
Detailed explanation of video formats, codecs, and quality selection strategies.
Tips for Best Results
- •Always specify quality if user has preference - saves bandwidth and storage
- •Batch downloads save time - use playlist URLs when possible
- •Audio extraction is faster - recommend for podcast/music content
- •Check file size before downloading - warn user for very large files (>1GB)
- •Transcription works best with clear audio - consider extracting audio first for better results