| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by skottenborg 875 days ago
	Cool! Can webLLM handle inference of models with any meaningful size? Can I ask what model is used?

1 comments

Thanks! It's using Llama 2 7B, It supports bigger models but those take longer to download and also infer (if run at all depending on the device)