Hacker News new | ask | show | jobs
by dserban 1118 days ago
For a side project I'm using a fine-tuned vicuna-13b. I'm using to generate search queries from natural language, and it outperforms all other open-source models at deep intent recognition.
1 comments

Do you run Vicuna locally or have you had any success in deploying it somewhere? I have been thinking of using a fine-tuned model but haven't found anything cost effective where I cab deploy it. You either run the machine where it's served consistently and the cost is in the hundreds or you accept extremely slow starts.
I have a monster of a Linux cluster at home, which is busy running Cassandra and Spark for my other side projects. I'm piggybacking on that while this particular project is still in development.

At the rate LLMs are evolving, a reasonably priced cloud offering will probably exist for me to leverage for production, by the time I need it.