|
|
|
|
|
by r_lee
120 days ago
|
|
because it's not easy to identify exactly when to r/w memory accordingly, especially when you'd need to have an LLM decide when and if to do that and to scale it in a way where you don't need a whole custom model loaded for 1 user (financially unviable) just my immediate thoughts, could be wrong though. |
|