|
|
|
|
|
by maggreenWAI
592 days ago
|
|
Context length + API cost is right now main bottleneck for huge HTML + CSS files. The extraction here is already quite efficient but still:
with past messages + system prompt + sometimes extracted text + extracted interactive elements you are quickly already around 2500 tokens (for gpt-4o 0.01$). If you extract entire HTML and CSS your cost + inference time are quickly 10x. |
|