| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by zaptrem 2129 days ago
	Would it be possible to substitute this for GPT2/BERT? Or is that a different type of language model? Can the pre-trained language model be fine-tuned? I’m using DeepSpeech to transcribe long-form lecture audio, and have just assumed there would be a massive improvement once they noise-harden the models with 1.0.

1 comments

nshm 2124 days ago

GPT2 is not a good language model but there are things like XLM. Mozilla DeepSpeech doesn't support XLM rescoring, other toolkits do and it gives great improvement in accuracy. If you care about accurate transcription you'd better consider alternatives.

link

zaptrem 2120 days ago

I didn't know any other ML-based open source transcription engines existed? I can't seem to find them on Google.

link