| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by sdan 200 days ago
	A few months ago I made a (theoretically) infinitely learning geo-guessing model that updated the policy with each user guess: https://geospot.sdan.io/ Hoping to implement a simple RL loop here and optimize whats generated by the LLM to create the perfect slop machine :)