Hacker News new | ask | show | jobs
by sebzim4500 847 days ago
If consumer cards can run the big models, then datacenter cards will be able to efficiently run the really big models.
1 comments

Some tasks we are using LLMs for are performing very close to GPT-4 levels using 7B models, so really depends on what value you are looking to get.