Hacker News new | ask | show | jobs
by zkmon 15 days ago
I'm waiting for FP8 quant, preferably from Google.
1 comments

Do they run well on vLLM?