| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by magicalhippo 516 days ago

There's an Ask HN thread going[1] asking about what people have done with small LLMs. This seems like a possible application. I asked Granite 3.1 MOE 3B to generate a title based on the abstract and it came up with:

Tensor Product Attention: A Memory-Efficient Solution for Longer Input Sequences in Language Models

Maybe a Greasemonkey script to pass arXiv abstracts to a local Ollama could be something...

[1]: https://news.ycombinator.com/item?id=42784365