|
|
|
|
|
by algo_trader
779 days ago
|
|
If we had a magical (fast) oracle for grading responses, have people done search/expert iteration for LLMs? Specifically for codegen, i am playing with an iterative interpreter that can quickly (re)evaluate a tree of similar responses |
|