| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by mirekrusin 39 days ago

non synthetic pre-training text got exhausted long time ago, now focus is more on quality and rl/post-training.

cost will keep going down and more powerful chips will be available, like it always was.

reinforcement learning doesn't have fixed ceiling, advancements in software will keep happening as well.

things like distilling smaller models will likely become free – ie. through speculative decoding which speeds up larger models (incentive to run during inference) where you have free access to strong distillation (large model's logits are calculated, ie. it's free for pickup for distillation) etc.

1 comments

LtWorf 39 days ago

> cost will keep going down and more powerful chips will be available, like it always was.

cost is not a monotonous function.

link

mirekrusin 38 days ago

compute and memory cost is monotonic with temporary hiccups

link

LtWorf 37 days ago

I figure you aren't aware of why there are other kinds of interpolation other than linear.

link