|
|
|
|
|
by Havoc
1042 days ago
|
|
Don't think I've ever heard anyone call it "GRAM" instead of VRAM. Another cost saving tip: On API, do combo calls where possible to dual use the input tokens. e.g. """You are an AI assistant that summarizes text given. After the summarized text, add the word END. After that answer the following questions with Yes or NO: Is the text about Donald Trump? Is the text about Space?
""" Down side is now you need code to parse the output pieces & error handling around that |
|
Just FYI in case anyone reading your comment tries your suggestion and has same issue, that with more firm instructions the problem can be avoided. Though I've not felt the need to experiment enough to understand exactly where the line is to avoid it trying to start one task too early without being wastefully verbose in the prompt.