Hacker News new | ask | show | jobs
by magicalhippo 516 days ago
There's an Ask HN thread going[1] asking about what people have done with small LLMs. This seems like a possible application. I asked Granite 3.1 MOE 3B to generate a title based on the abstract and it came up with:

Tensor Product Attention: A Memory-Efficient Solution for Longer Input Sequences in Language Models

Maybe a Greasemonkey script to pass arXiv abstracts to a local Ollama could be something...

[1]: https://news.ycombinator.com/item?id=42784365