|
|
|
|
|
by jszymborski
338 days ago
|
|
I wonder how well suited some of the smaller LLMs like Qwen 0.6B would be suited to this... it doesn't sound like a super complicated task. I also feel like you can train a model on this task by using the zero-shot performance of larger models to create a dataset, making something very zippy. |
|