Y
Hacker News
new
|
ask
|
show
|
jobs
by
machiaweliczny
656 days ago
YeS, exactly. The trick is to have enough tough data so you find optimal one. I think as we will scale models back to smaller sizes we will discover viable/correct representations