| > Can these models feasibly be run locally? Bluntly, no. The models which are small enough to run locally perform so badly it’s not worth bothering. To run inference on the large models the perform decently you need the equivalent of two or three top end graphics cards. If you're serious about looking into it now, consider looking at this project that lets you run a bunch of independent machines as a cluster for inference using Bloom: https://github.com/bigscience-workshop/petals/wiki/Launch-yo... (You'll need around 200GB of GPU memory across the machines in the swarm) |
I am asking as I once had a Markov-chain IRC bot* and while it often struggled to string together a sentence, it was quite hilarious sometimes. Absolutely pointless other than the occasional laugh.
Can it form sentences or are those small models completely unusable for anything?
I'm not thinking OpenAI level uses - sort of compare a Postgres cluster to a SQLite file (not literally, conceptually I guess). Can it be used for single tasks in any way?
Could it figure out how to map search terms to URLs for a knowledge base type thing?
Forgive me if these are silly questions. The extent of my knowledge in this field is asking ChatGPT questions and going "that's so cool" when it answers.
* Your phone's predictive text except it finishes the sentence itself based on a word someone in chat used so that it felt on topic.
In my case it also learned how to form sentences from other people talking in chat, in hindsight it's amazing I never had a Tay issue.
https://en.m.wikipedia.org/wiki/Tay_(bot)