|
|
|
|
|
by rohanmehta1
849 days ago
|
|
> But this is because larger models tend to be better, not because throwing more compute at an existing model helps it produce better results. There's a caveat here - allowing the model to produce more tokens (i.e. giving it more compute time to "think") can produce better results. E.g. asking a model to reason before producing an answer, leads to better answers. And the extra tokens = more compute. |
|
Different prompting techniques like what you're describing are one way, and RAG [0] and ART [1] are also in a similar category.
[0] https://stackoverflow.blog/2023/10/18/retrieval-augmented-ge...
[1] https://www.promptingguide.ai/techniques/art