Hacker News new | ask | show | jobs
by sgbeal 724 days ago
Wow! We're now just a hair's-width away from finally being able to say, "Computer, enhance image!" without sounding like we're in a bad sci-fi show.
3 comments

Think the only thing historical science fiction/Blade Runner photo inspect scene[0] didn't forsee was voically having AI assist/analyze photo to summerize list of items/objects avaliable to zoom/view. (vs. pan/zoom around). Although altavista glasses / hand gestures[3] would have been a future concept at the time, too.

----

[0] : https://scifiinterfaces.com/2020/04/29/deckards-photo-inspec...

[1] 'mirror reality' image / TERI[2] : https://www.hackster.io/news/blade-runner-s-image-enhancemen...

[2] : TERI, almost IRL blade runner move image enhancement tool : https://news.ycombinator.com/edit?id=40844595 / https://github.com/iscilab2020/TERI-3DNLOS/tree/TERI

[3] : Gest : https://news.ycombinator.com/edit?id=40844704

Using Whisper as the voice interface, an LLM to understand the prompt and issue function call commands and an image upscaler you could build this in a weekend. Would it be useful? Not especially by itself but I think there is a lot of promise in voice interaction with LLM operated software.
Make it so!
gMake it, you gAught it. (once there's enough bandwidth to go around[0])

[0] : Intel CPU with OCI Chiplet Demoed with 4Tbps of Bandwidth and 100M Reach : https://news.ycombinator.com/item?id=40844616