|
|
|
|
|
by MrZander
377 days ago
|
|
This doesn't follow with my understanding of transformers at all. I'm not aware of any human labeling in the training. What would labeling even do for an LLM? (Not including multimodal) The whole point of attention is that it uses existing text to determine when tokens are related to other tokens, no? |
|