|
|
|
|
|
by magicalhippo
516 days ago
|
|
There's an Ask HN thread going[1] asking about what people have done with small LLMs. This seems like a possible application. I asked Granite 3.1 MOE 3B to generate a title based on the abstract and it came up with: Tensor Product Attention: A Memory-Efficient Solution for Longer Input Sequences in Language Models Maybe a Greasemonkey script to pass arXiv abstracts to a local Ollama could be something... [1]: https://news.ycombinator.com/item?id=42784365 |
|