|
|
|
|
|
by pu_pe
149 days ago
|
|
Every time I see some complex orchestration like this, I feel that the authors should have compared it to simpler alternatives. One of the metrics they use is that human review suggests the system is right 83% of the time. How much performance would they achieve by just having a reasoning "judge" decide without all the other procedure? |
|