| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ggerganov 962 days ago
	Heh, funny to see this popup here :) The performance on Apple Silicon should be much better today compared to what is shown in the video as whisper.cpp now runs fully on the GPU and there have been significant improvements in llama.cpp generation speed over the last few months.

5 comments

boesboes 962 days ago

13 minutes between this and the commit of a new demo video, not bad :D

And impressive performance indeed!

tomtom1337 961 days ago

Ah, forget the other message, I watched the videos in the wrong order! And I can’t delete or edit using the Hack app!

tomtom1337 961 days ago

Is it just me, or is the gpu version actually slower to respond?

A4ET8a8uTh0 962 days ago

You are kinda famous now man. Odds are, people follow your github religiously.

MuffinFlavored 962 days ago

Is ggerganov to LLM what Fabrice Bellard is to QuickJS/QEMU/FFMPEG?

actionfromafar 961 days ago

That's a big burden to place on anyone.

asadm 962 days ago

I have sent a PR to move that new demo to the top. I think the new demo is significantly better.

sgt 961 days ago

Is running this on Apple Silicon the most cost effective way to run this, or can it be done cheaper on a beefed up homelab Linux server?

v3ss0n 962 days ago

will this work with latested distilled llama?