Hacker News new | ask | show | jobs
by dash2 1103 days ago
I don't understand why you can't get explainable models simply by training two LLMs. The first one has to tell the second one what to do (in English). The second one follows English instructions.
1 comments

The work seems to generate per-instance weights that describe the features based on the effect that they have on the outcome. How would you propose to do that with two LLMs?