Hacker News new | ask | show | jobs
by freeone3000 1765 days ago
No more than 1GB, and inference would be dependent on the speed but no more than 10 seconds. Completely reasonable as a standalone program.
2 comments

Is there a guide on running it locally? I am currently using some classic algorithms to summarize my RSS feeds, but they aren’t very good.
Sure! Here's the weights https://huggingface.co/sshleifer/distilbart-cnn-12-6 , here's the model https://huggingface.co/transformers/model_doc/bart.html?#tra... . Uses standard pytorch loading, so write your python program around that and it should be fine.
Would that require running a separate server locally? That's all kinds of non-techie hostile :(