|
|
|
|
|
by leminimal
1083 days ago
|
|
Are there project-based tutorial that talks more about neural net architecture, hyperparameters selection and debugging? Something that walks through getting poor results and make explicit the reasoning for tweaking? When I try to use transformers or any AI thing on a toy problem I come up with, it never works. Even Fizz-Buzz which I thought was easy doesn't work (because division or modulo is apparently hard to represent for NNs). And there's this blackbox of training that's hard to debug into. Yes, for the available resources, if you pick the exact same problem, the exact same NN architecture and exact same hyperparameters, it all works out. But surely they didn't get that on the first try. So what's the tweaking process? Somehow this point isn't often talked about in courses and consequently the ones who've passed this hurdle don't get their experience transferred. I'd follow an entire course on this if it were available. An HN commenter linked me to this https://karpathy.github.io/2019/04/25/recipe/ which is exactly on point. But it'd be great if it were one or more tutorials with a specific example, wrapped in code and peppered with many failures. |
|
https://mlu-explain.github.io/neural-networks/
See also here:
http://playground.tensorflow.org/