Local Whisper transcription with VTT output. Used by /audio transcribe and the AAD pipeline; supports diarisation.
Open source ↗