Hacker News new | ask | show | jobs
I ran GLM-5.1 on a 16GB RAM machine (github.com)
10 points by drunkonvinyl 22 days ago
1 comments

Beautiful. Twenty seconds per token is funny, but the only direction is up from here!