Hacker News new | ask | show | jobs
by ArcaneMoose 211 days ago
I've been really impressed with this model specifically because of how insanely cheap it is: https://replicate.com/ibm-granite/granite-vision-3.3-2b

I didn't expect IBM to be making relevant AI models but this thing is priced at $1 per 4,000,000 output tokens... I'm using it to transcribe handwritten input text and it works very well and super fast.

4 comments

I'm the dev who made this:) We are looking into adding granite!
IBM and Nvidia speech to text models are also SOTA (according to HF leaderboard) and relatively lightweight. Replicate hosts those too, although some (like Parakeet) run easily on consumer GPU.
English only :( . it seems only 2 orders of magnitude larger models have support for ie greek :/
Thanks for this! Will test this model out because we do a lot of in between steps to get around the output token limits.

Super nice if it worked for our use case to simply get full output.