| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by wren6991 13 days ago
	> you could do substantially the same thing by just prompting/LoRA finetuning the model to produce a single-token output ("yes" or "no") You could probably achieve this with logit masking. Or equivalently, comparing the "yes" vs "no" logprobs in the final dis-embedded vector.