Hacker News new | ask | show | jobs
by DrNosferatu 814 days ago
Any performance benchmark against 'llamafile'[0] or others?

[0] - https://github.com/mozilla-Ocho/llamafile

1 comments

You can already use intel GPUs (both ARC and iGPUS) with llama.cpp on a bunch of backends:

- SYCL [1]

- Vulkan

- OpenCL

I don't own the hardware, but I imagine SYCL is more performant for ARC , because it's the one intel is pushing for their datacenter stuff

[1]: https://www.intel.com/content/www/us/en/developer/articles/t...