Hacker News new | ask | show | jobs
by Havoc 1042 days ago
Don't think I've ever heard anyone call it "GRAM" instead of VRAM.

Another cost saving tip: On API, do combo calls where possible to dual use the input tokens. e.g.

"""You are an AI assistant that summarizes text given.

After the summarized text, add the word END.

After that answer the following questions with Yes or NO:

Is the text about Donald Trump?

Is the text about Space? """

Down side is now you need code to parse the output pieces & error handling around that

1 comments

I've found - when using ChatGPT with GPT4 - that sometimes when I ask it to do two things like that it will ignore my request to do one before the other and try to do both at the same time before providing a combined answer, unless I give even more specific instructions along the lines of "do not <do whatever> until after you have entirely finished and answered to completion <first thing>".

Just FYI in case anyone reading your comment tries your suggestion and has same issue, that with more firm instructions the problem can be avoided. Though I've not felt the need to experiment enough to understand exactly where the line is to avoid it trying to start one task too early without being wastefully verbose in the prompt.

Works much better with function calling when you just give it one function it has to call with two non optional arguments.