Y
Hacker News
new
|
ask
|
show
|
jobs
by
bradhilton
473 days ago
We trained all the parameters. Those would definitely be interesting ablations. I would also like to see how much of a performance hit we would take with PEFT methods like LoRA.