Y
Hacker News
new
|
ask
|
show
|
jobs
by
QuadrupleA
1182 days ago
Yeah seems spotty. Especially considering recent "chinchilla scaling" laws suggesting training set size is generally the current bottleneck, the mileage llama/alpaca gets out of 7b/13b, the huge inference cost of 1T, etc.