|
|
|
|
|
by vikp
509 days ago
|
|
Hey, I'm the author of marker - thanks for sharing. Most of the processing time is model inference right now. I've been retraining some models lately onto new architectures to improve speed (layout, tables, LaTeX OCR). We recently integrated gemini flash (via the --use_llm flag), which maybe moves us towards the "hybrid system" you mentioned. Hoping to add support for other APIs soon, but focusing on improving quality/speed now. Happy to chat if anyone wants to talk about the difficulties of parsing PDFs, or has feedback - email in profile. |
|