|
|
|
|
|
by plaidthunder
3 hours ago
|
|
It seems like there's an opportunity to embed identity information into tokens themselves, the way we embed sequence information. The trouble is... it's quite a challenge to train. Sequence is easy to derive for any corpus of data, but identity is not. https://usize.github.io/blog/2026/april/why-no-ai-coworkers.... > In similar fashion to how sequence information is embedded within input tensors, an approach called “Instructional Segment Embedding”2 adds a parallel embedding channel for identity information. This gives models real awareness of provenance. And it works. But they only tested three fixed categories: system, user, data. Interesting paper that touches on the idea here: https://arxiv.org/abs/2410.09102 |
|