Hacker News new | ask | show | jobs
by thomasahle 491 days ago
I guess the most interpretable is to have as shallow a model as possible, but with longer cot. It would be quite interesting seeing the trade-off between the two. Though, unfortunately, deeper is probably better.