Hacker News new | ask | show | jobs
by throwaway2245 2066 days ago
But computers must have access to far more than a single (fluent) person's exposure to Glaswegian/Scots.
1 comments

Yes, BUT that's not taken into account in these tests. It is given beforehand what you get to train on, and shall way say "there are known problems" on this front.

So the benchmarks say how well model X does on this exact transcription taks given this exact training data, and no other knowledge.

Even basic things, like female/male voices in train vs test set don't match.