Hacker News new | ask | show | jobs
by bm-rf 832 days ago
Maybe you could use something like GPT 4 vision To include a text description of the image in the transcript
1 comments

Filtering full-color images down to a halftone suitable for book publishing is a mature technology, setting up an ImageMagick pipeline to do so would not be among the hard parts of preparing a book like this. Picking the right still frame out of gifs and video is a bit trickier, but not by much.