|
|
|
|
|
by dimitry12
553 days ago
|
|
"1B solver + 8B verifier + search" beating 0-shot 70B is nice, agree. "1B solver + 8B verifier + search" beating 1B-0-shot or 1B-majority as baselines isn't illustrative imo. In other words, by using larger verifier, HF's replication fails to establish a "fair" baseline. Still an awesome blog and release/repository from HF's group - I love it! |
|