Hacker News new | ask | show | jobs
by bradhilton 473 days ago
We trained all the parameters. Those would definitely be interesting ablations. I would also like to see how much of a performance hit we would take with PEFT methods like LoRA.