Hacker News new | ask | show | jobs
by wlsaidhi 496 days ago
Is it possible to setup a MLLM pipeline to play other roblox games and use that as another evaluation?
1 comments

I think it's totally possible. Multimodal reasoning eval would be fun to consider too