Hacker News new | ask | show | jobs
by robrenaud 35 days ago
There is some recent work on modularizing knowledge in LLMs.

https://arxiv.org/html/2605.06663v1

It might be possible to train a big generalist that is a composition of modules, some of which can be dropped dynamically at inference time, depending on the prompt.

1 comments

Cool. Thanks for sharing. I am thinking about creating a series of smaller models for specific purposes and then orchestrating them so they mirror the human brain which is a bunch of subsystems that give multiple opinions about the same stimulus
Interesting direction. I’ve also been thinking about modular / subsystem-based approaches for specialized tasks in small AI systems.