|
|
|
|
|
by allan_s
4279 days ago
|
|
for the case of "one sentence detection" you can use Tatoeba project database dump http://tatoeba.org/eng/downloads you have a CSV of iso code => sentence , which should be 99% accurate (as it gets user proofed), so on in which you can compare your tool with. I think for longer text one could use Wikipedia dump or alike ? |
|