| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by r_lee 120 days ago

because it's not easy to identify exactly when to r/w memory accordingly, especially when you'd need to have an LLM decide when and if to do that

and to scale it in a way where you don't need a whole custom model loaded for 1 user (financially unviable)

just my immediate thoughts, could be wrong though.