Fine-Tuning LLMs to 1.58bit

Y	Hacker News new \| ask \| show \| jobs

	Fine-Tuning LLMs to 1.58bit (huggingface.co)
	52 points by galeos 638 days ago

2 comments

patleeman 638 days ago

That's awesome. The original discussion of bitnet made it seem like you needed to train a model from scratch but its neat they were able to adapt an existing model. This is quite exciting.

link

cpldcpu 637 days ago

the performance is still a bit degraded though.

link

amilios 638 days ago

Very exciting, although it was a bit disappointing to see that they're hitting just llama1 7b performance by quantizing llama3. but i'm sure the performance gap will close over time!

link