Hacker News new | ask | show | jobs
by breadislove 10 days ago
very bad take. with most modern multomodal models you get way better performance then going to text first
1 comments

it's a cost/latency trade-off in production + very use-case dependent