| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by renchuw 864 days ago
	Hi, OP here. So you evaluate LLMs on corpuses to evaluate their performance right? Bayesian optimization is here to select points (in the latent space) and tell the LLM where to evaluate next. To be precise, entropy search is used here (coupled with some latent space reduction techniques like N-sphere representation and embedding whitening). Hope that makes sense!

1 comments

hackerlight 864 days ago

The definition of "evaluate" isn't clear. Do you mean inference?

link

renchuw 864 days ago

Perhaps I should clarify it in the project README. It's the phase to evaluate how well your model is performing. So the pipeline goes training -> evaluation -> deployment (inference) corresponding to the datasets in supervised training, training (training) -> evaluation (validation) -> deployment (testing).

link