Hacker News new | ask | show | jobs
PFlash: 10x prefill speedup over llama.cpp at 128K on a RTX 3090 (github.com)
3 points by GreenGames 47 days ago
1 comments

is this repo a legit one, has anyone audited it?