Initial commit with translated description
This commit is contained in:
49
SKILL.md
Normal file
49
SKILL.md
Normal file
@@ -0,0 +1,49 @@
|
||||
---
|
||||
name: local-whisper
|
||||
description: "使用OpenAI Whisper进行本地语音转文字。模型下载后可完全离线运行。具备多种模型尺寸的高质量转录。"
|
||||
metadata: {"clawdbot":{"emoji":"🎙️","requires":{"bins":["ffmpeg"]}}}
|
||||
---
|
||||
|
||||
# Local Whisper STT
|
||||
|
||||
Local speech-to-text using OpenAI's Whisper. **Fully offline** after initial model download.
|
||||
|
||||
## Usage
|
||||
|
||||
```bash
|
||||
# Basic
|
||||
~/.clawdbot/skills/local-whisper/scripts/local-whisper audio.wav
|
||||
|
||||
# Better model
|
||||
~/.clawdbot/skills/local-whisper/scripts/local-whisper audio.wav --model turbo
|
||||
|
||||
# With timestamps
|
||||
~/.clawdbot/skills/local-whisper/scripts/local-whisper audio.wav --timestamps --json
|
||||
```
|
||||
|
||||
## Models
|
||||
|
||||
| Model | Size | Notes |
|
||||
|-------|------|-------|
|
||||
| `tiny` | 39M | Fastest |
|
||||
| `base` | 74M | **Default** |
|
||||
| `small` | 244M | Good balance |
|
||||
| `turbo` | 809M | Best speed/quality |
|
||||
| `large-v3` | 1.5GB | Maximum accuracy |
|
||||
|
||||
## Options
|
||||
|
||||
- `--model/-m` — Model size (default: base)
|
||||
- `--language/-l` — Language code (auto-detect if omitted)
|
||||
- `--timestamps/-t` — Include word timestamps
|
||||
- `--json/-j` — JSON output
|
||||
- `--quiet/-q` — Suppress progress
|
||||
|
||||
## Setup
|
||||
|
||||
Uses uv-managed venv at `.venv/`. To reinstall:
|
||||
```bash
|
||||
cd ~/.clawdbot/skills/local-whisper
|
||||
uv venv .venv --python 3.12
|
||||
uv pip install --python .venv/bin/python click openai-whisper torch --index-url https://download.pytorch.org/whl/cpu
|
||||
```
|
||||
Reference in New Issue
Block a user