Audio Transcription Pipeline
VAD-first, evidence-backed Whisper transcription running locally on Apple Silicon. Produces high-quality transcripts with zero hallucinations, correct domain terminology, and LLM-polished output.
Architecture: Audio → ffmpeg (16kHz WAV) → Silero-VAD (speech segmentation) → MLX Whisper → Dictionary replacement → Claude LLM correction → Output.
Prerequisites
- macOS with Apple Silicon (M1/M2/M3/M4) — required for MLX
- Python 3.9+ — Xcode Python at `/L
[Description truncada. Veja o README completo no GitHub.]