Hacker News new | ask | show | jobs
by ethan_smith 329 days ago
WhisperX works better because it implements a robust VAD (Voice Activity Detection) preprocessing step that effectively filters out silence segments before they reach the model, preventing the hallucination triggers entirely.