I believe it is not only garbage collecting. It is also doing backpropagation on the memories of the day before. After 8 hours you get an updated, more optimized service.
This is the insight missing from everyone comparing LLM parameter counts to human neurons or synapses. The human model gets a new version every day, and the digital one costs $5B of energy and a year to do the same.