Hacker News new | ask | show | jobs
by nutrientharvest 426 days ago
It's cropping the original image then tokenizing it again with less downsampling, not cropping its internal representation.