|
|
|
|
|
by tlrobinson
1184 days ago
|
|
Or perhaps a progressive summary, where the most recent messages are full fidelity, and older messages get “compressed” into a summary. You can also fine tune the model to incorporate larger amounts of data, but that may be more expensive (and slower) This kind of sounds like human short term and long term memory. Maybe “fine tuning” is analogous what happens to our memory when we sleep. |
|
Alternatives are maybe architectures using langchain or toolformer to retrieve "memories" from a database by smart fuzzy search. But that's worse, because reasoning would only be done on that context, instead of all memories it ever had.