|
|
|
|
|
by wongarsu
2 days ago
|
|
I can get great results from a YOLO model with 30M to maybe 300M params. To get decent CV from a LLM 8B params is the absolute minimum, closer to 30B for interesting tasks I might be on board about LLMs being the future of OCR (though many would disagree), but for general CV they are very inefficient for very limited benefit |
|
Also if they are better then you can also have a flow that’s cheap model -> marginal cases go to more complex thing (and a chain of these).
The yolo models are really shockingly good for their cost and how well they can work with not much training data as well.