| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by delaminator 162 days ago

I have a 3090 24Gb Twin Xenon 64Gb RAM sat on a machine in our server room.

I do local AI with Qwen, Whisper and another I can't remember right now.

These are all QWEN:

We do AI Invoice OCR - PDF -> Image -> Excel. Works much better than other solutions because it has invoice context so it looks for particular data to extract and ignores others. Why local? I proved it worked, no need to send our data outside for processing and it works,

We deal with photos of food packaging - I do a "photograph ingredients list and check them against our expected ingredients" - downside is it takes 2 mins per photo, I might actually push this one outside.

Ingredients classifier - is it animal (if so what species), vegetarian, vegan, halal, kosher, alcoholic, is nut based, peanuts and more - simply no need to send it outside.

I've got a Linux chatbot helper on the "test this" pile with Qwen Coder - not evaluated it but the idea will be "type command, get it wrong, ask Qwen for the answer" - I use Claude for this but it seems a bit heavy weight and I'm curious.

tbh some of it is solution hunting - we spent $1000 on the kit to evaluate if it was worth it so I try and get some value out of it.

But it is slow, 3 hours for a recent task that took Claude API 2 minutes.

My favourite use is Whisper. I voice->text almost all of my typing now.

I've also bought a Nvidia Orin Nano but I haven't set it up yet - I want to run Whisper in the car to take voice dictation as I drive.