Hacker News new | ask | show | jobs
by imtringued 702 days ago
LLMs are reaching saturation on even some of the latest benchmarks and yet I am still a little disappointed by how they perform in practice.

They are by no means bad, but I am now mostly interested in long context competency. We need benchmarks that force the LLM to complete multiple tasks simultaneously in one super long session.

1 comments

I don't know anything about AI but there's one thing I want it to do for me. Program a full body exercise program long term based on the parameters I give it such as available equipment and past workout context goals. I haven't had good success with chatgpt but I assume what you're talking about is relevant to my goals.
Aren't there apps that already do this like Fitbod?
Fitbod might do the trick. Thanks! The availability of equipment was a difficult thing for me to incorporate into a fitness program.