Hacker News new | ask | show | jobs
by pu_pe 262 days ago
I'm curious about it too. I think there are two bottlenecks, one is that training a relatively large LLM can be resource-intensive (so people go for RAGs and other shortcuts), and making it finetuned to your use cases might make it dumber overall.
1 comments

> making it finetuned to your use cases might make it dumber overall.

LoRa doesn't overwrite weights.

Do you need to overwrite weights to produce the effect I mentioned above?
Good point