Hacker News new | ask | show | jobs
by nperez 1001 days ago
Ah, 175b it is then. That's actually kind of encouraging that an open source trillion+ param model isn't needed to get there.

A 180b Falcon model very recently came out https://huggingface.co/tiiuae/falcon-180B A finetune of this might be competitive, but I haven't tried it.