| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ArcaneMoose 211 days ago
	I've been really impressed with this model specifically because of how insanely cheap it is: https://replicate.com/ibm-granite/granite-vision-3.3-2b I didn't expect IBM to be making relevant AI models but this thing is priced at $1 per 4,000,000 output tokens... I'm using it to transcribe handwritten input text and it works very well and super fast.

4 comments

rubikscubeguy 211 days ago

I'm the dev who made this:) We are looking into adding granite!

link

intalentive 210 days ago

IBM and Nvidia speech to text models are also SOTA (according to HF leaderboard) and relatively lightweight. Replicate hosts those too, although some (like Parakeet) run easily on consumer GPU.

link

nicman23 211 days ago

English only :( . it seems only 2 orders of magnitude larger models have support for ie greek :/

link

irjustin 211 days ago

Thanks for this! Will test this model out because we do a lot of in between steps to get around the output token limits.

Super nice if it worked for our use case to simply get full output.

link