|
|
|
|
|
by obblekk
482 days ago
|
|
The open source ranking library is really interesting. It's using a type of merge sort where the comparator function is an llm comparing (but doing batches >2 for fewer calls). Reducing problems to document ranking is effectively a type of test-time search - also very interesting! I wonder if this approach could be combined with GRPO to create more efficient chain of thought search... https://github.com/BishopFox/raink?tab=readme-ov-file#descri... |
|
https://news.ycombinator.com/item?id=43175658