|
|
|
|
|
by grey-area
100 days ago
|
|
At this point it’s pretty easy to detect unaltered LLM output because it is such bad writing. That will change over time with training I would hope. At some point I imagine it will be hard to tell. I honestly don’t know what sites like this will do when that happens and the only way of detecting LLMs is that they are subtly wrong or post too much, we’d be overrun with them. Not sure if we should be hopefully or fearful that they will improve to be undetectable but I suspect they will. |
|
There's precious little training material left that isn't generated by LLMs themselves.
Consider this to be model collapse (i.e. we might be at the best SOTA possible with the approach we use today - any further training is going to degrade it).