Y
Hacker News
new
|
ask
|
show
|
jobs
by
GaggiX
1060 days ago
Not just a projection layer but also Q-former, in this case it was already trained for that specific vision encoder but if you change it you would need to train a Q-former from scratch.
1 comments
famouswaffles
1060 days ago
Not for mini gpt-4 but it's just a projection layer for many others(like Llava). The Qformer isn't a necessary part of the equation.
link