Hacker News new | ask | show | jobs
by wruza 492 days ago
Sadly no lora/finetune benchmarks. Phoronix sort of missed the whole idea, imo.
1 comments

Closest would be prompt processing from https://www.phoronix.com/review/nvidia-rtx5090-llama-cpp/2, and it's barely +20%