| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by jackylupino23 1111 days ago
	Very interesting. Wondering what is the state of the art in Hyperparameter Optimization at the moment. Does this method apply to all Deep Learning systems?

4 comments

blackbear_ 1111 days ago

For a general overview, this could be a good starting point [1]. As for deep learning, you may wanna start from here [2], but I personally had good results with Hyperband [3] for DL.

[1] https://wires.onlinelibrary.wiley.com/doi/full/10.1002/widm....

[2] https://github.com/google-research/tuning_playbook

[3] https://jmlr.csail.mit.edu/papers/v18/16-558.html

link

gillesjacobs 1111 days ago

Hyperband [1] has been my go-to hyperparam optimization method over the past few years. Handily beats Bayesian search wherever I applied it, also implemented in most frameworks.

1. https://arxiv.org/abs/1603.06560

link

Lindizz 1111 days ago

The work compares against Hyperband and the new method is significantly better (Figure 2, Hypothesis 2).

link

rch 1111 days ago

I just got back into hyperopt a couple weeks ago. It's easy enough and worked for me, but I was thinking there had to be some new things I'm not aware of.

link

mlminer 1111 days ago

I think so, they apply it to Computer Vision datasets as well

link