| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by jauntbox 1040 days ago
	It depends on what your goal is, but I've had success reproducing specific output formatting by fine-tuning the base LLaMA2 models instead of the RLHF'd models. My use cases were simpler - information extraction/synthesis from text rather than creative writing. The base models might not be good fits for your task.