| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by khafra 397 days ago
	LLM model interpretability also uses Sparse Autoencoders to find concept representations (https://openai.com/index/extracting-concepts-from-gpt-4/), and, more recently, linear probes.