Hacker News new | ask | show | jobs
by ivanfioravanti 883 days ago
I posted about the Python library few hours after release. Great experience. Easy, fast and works well.

I create a GIST with a quick and dirty way of generating a dataset for fine-tuning Mistral model using Instruction Format on a given topic: https://gist.github.com/ivanfioravanti/bcacc48ef68b02e9b7a40...

5 comments

How does this fine-tuning work? I can see that you are loading a train.jsonl file and the some instructions but is the output model generated or this is some kind of a new way of training the models?
The gist is only to create the dataset not to fine tune
What's your observations about finetunes - are they really useful for anything practical? :)
Does olana support fine-tuning? I assume not. (Not asking about finetuned models that I know they support)
can we use it on cloud or I gotta download it locally? it might not work on my MacBook 2015 with 8GB ram
Gist isn't an acronym, it's a word. (e.g. "get the gist of things")