Hacker News new | ask | show | jobs
by serjester 491 days ago
Correct, it's with batching Vertex pricing with slightly lower output tokens per page since a lot of pages are somewhat empty in real world docs - I wanted a fair comparison to providers that charge per page.

Regardless of what assumptions you use - it's still an order of magnitude + improvement over anything else.