Hacker News new | ask | show | jobs
by skottenborg 875 days ago
Cool! Can webLLM handle inference of models with any meaningful size?

Can I ask what model is used?

1 comments

Thanks! It's using Llama 2 7B, It supports bigger models but those take longer to download and also infer (if run at all depending on the device)