| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by mhitza 59 days ago
	In theory, locally you'd use these where lossiness is acceptable for audio transcription and image labeling (as simple examples). In practice I haven't got around to building something around multimodality since I'm primarily using their text generation capabilities.