Hacker News new | ask | show | jobs
by marc0 4547 days ago
I must say I find this fascinating, esp two aspects: First, great idea to train on number sequences instead of single characters. That's certainly a lesson which is applicable in many other situation, too. Second, 11 levels and deep learning -- a bold approach, since (from what I know) the paradigm is that deep networks are generally not working well. They even found that performance improves with the levels (they just stopped at 11, probably because of resource limitations). As mentioned in the paper, this is probably due to the fact that the network ist trained with a huge dataset, so the size of the dataset really is the relevant factor.