Transcribe any YouTube video or MP4 to text. Powered by yt-dlp + OpenAI Whisper.
Transcribe any YouTube video or direct MP4/video URL to text using yt-dlp for audio extraction and OpenAI Whisper for transcription. Supports multiple output formats: plain text, SRT subtitles, or JSON with timestamps. Auto-detects language or accepts a specific language code.
| Parameter | Type | Description |
|---|---|---|
| url | string (required) | YouTube URL or direct MP4/video URL |
| language | string (optional) | Language code (auto, en, pt, es, fr, deβ¦). Default: auto |
| format | string (optional) | Output format: text (default), srt (subtitles), json (with segments) |
$0.020 per video up to 30 minutes
$0.040 per video up to 2 hours
Payments via x402 micropayment on Base (USDC).