|
|
|
|
|
by pgkr
504 days ago
|
|
Correct. The bias is baked into the weights of both V3 and R1, even in the largest 671B parameter model. We're currently conducting analysis on the 671B model running locally to cut through the speculation, and we're seeing interesting biases, including differences between V3 and R1. Meanwhile, we've released the first part of our research including the dataset: https://news.ycombinator.com/item?id=42879698 |
|