Hacker News new | ask | show | jobs
by scottydog51834 1175 days ago
I'd love a tool where I can upload a private dataset and RLHF a model (even better if the tool provides the pre-trained model) without me having to worry about GPUs, memory, commercial access, or even writing any python code. I'd happy pay several hundred and maybe several thousands for access to this.
3 comments

Even better would be a crowdfunding platform where you can collect some money from a bunch of people and then use it to fine-tune the bigger models (30B and 65B in the case of LLaMA).
OpenAI allows you to fine tune any of their LLM models. I'm not sure if your requirements were "without paying OpenAI something" but if you are ok with that, fine tuning with them is pretty straightforward. IANAL or anything but if I recall they even promise they won't use your data for their training.
They let you do this with their GPT-3 derived models, but not the GPT-3.5 or GPT-4 ones.

That would be unsafe lol

I think that tool would have broad appeal, but I wonder if the most likely case is that it would be buried inside other, higher-level systems, such as customer support automation SaaS.