|
|
|
|
|
by m00dy
178 days ago
|
|
May I ask your internal benchmark ? I'm building a new set of benchmarks and testing suite for agentic workflows using deepwalker [0]. How do you design your benchmark suite ? would be really cool if you can give more details. [0] https://deepwalker.xyz |
|
But pretty rudimentary, nothing special. Also did not know about deepwalker, looks quite interesting - you building it?