|
|
|
|
|
by RayVR
1119 days ago
|
|
restricting the distribution of potential output imposes a cost. "Alignment" here likely refers to aligning the model to the desired safety parameters. I'm not in the llm research business but I would expect that the best and worst/most dangerous outputs come from the tails of distributions. I imagine the tuning for safety often results in fewer really good and really bad answers by trimming these tails. Edit:
I asked chatGPT4: https://chat.openai.com/share/a2c7d380-c6eb-4745-b91d-c3996a... |
|