jarvis-video-stt

Jarvis-Video-STT - 批量视频语音转文字工具。 基于Faster-Whisper,支持多进程并行、进度条、汇总报告。 **触发场景**: - 用户需要将视频中的语音转换为文字/字幕 - 批量处理多个视频 - 需要生成SRT字幕或纯文本 - 需要处理报告查看结果统计 **使用方式**: 1. 确认已安装依赖: pip install faster-whisper tqdm 2. 确认ffmpeg已安装 3. 执行命令或调用主脚本 **支持格式**:MP4, MKV, AVI, MOV **输出**: - .srt 文件:带时间戳的字幕格式 - .txt 文件:纯文本全文 - report.json:机器可读的汇总报告 - report.md:人类可读的汇总报告

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "jarvis-video-stt" with this command: npx skills add chongjie-ran/jarvis-video-stt

Jarvis-Video-STT Skill

快速开始

1. 安装依赖

pip install faster-whisper tqdm 确保ffmpeg已安装 (brew install ffmpeg on macOS)

2. 基本用法

medium模式(高精度,推荐): python ~/.openclaw/workspace-researcher/tools/jarvis-video-stt/batch_whisper.py -i videos/*.mp4 -o results -m medium

small模式(快速): python ~/.openclaw/workspace-researcher/tools/jarvis-video-stt/batch_whisper.py -i videos/*.mp4 -o results -m small

指定语言(略快): python batch_whisper.py -i videos/ -o results -m medium -l zh

调整并行数: python batch_whisper.py -i videos/ -o results -w 4

3. 参数说明

参数简写说明默认值
--input-i视频路径/文件夹/通配符必填
--output-o输出目录output
--model-msmall/mediummedium
--language-l语言代码,None=自动None
--workers-w并行进程数3
--cpu-强制使用CPUFalse

4. 输出文件

每个视频生成:

  • 视频名.srt - 带时间戳字幕
  • 视频名.txt - 纯文本

整体生成:

  • report.json - JSON汇总报告
  • report.md - Markdown汇总报告

性能参考

模型一小时视频(单进程)推荐并行
small~2分钟4进程
medium~5分钟3进程
large-v3~8分钟2进程

适用场景

  • 课程视频转文字
  • 电影/纪录片字幕生成
  • 播客/访谈转录
  • 短视频内容分析
  • 视频内容检索预处理

故障排除

Q: 报 faster-whisper 找不到? pip install faster-whisper

Q: 报 ffmpeg 找不到? brew install ffmpeg (macOS) apt install ffmpeg (Ubuntu)

Q: Mac显存不足? 减少并行数:-w 2

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

视频字幕自动生成器——免费的才是最好的

自动提取视频音频,识别生成带时间戳的文字稿,输出SRT/VTT字幕及带字幕的视频,并智能提炼视频标题。

Registry Source
2281Profile unavailable
General

Gettr Transcribe

Download audio from a GETTR post or streaming page and transcribe it locally with MLX Whisper on Apple Silicon (with timestamps via VTT). Use when given a GE...

Registry SourceRecently Updated
5030Profile unavailable
General

Fanqie Novel AI Video Drama Batch Generator

Batch generates complete AI video dramas from multi-chapter novels with covers, titles, tags, and outputs for TikTok serialization.

Registry SourceRecently Updated
800Profile unavailable
General

Local Transcription

Local speech-to-text transcription with Qwen ASR — transcription routed across your Apple Silicon fleet. Transcribe meetings, voice notes, podcasts with loca...

Registry SourceRecently Updated
1581Profile unavailable