Hacker News new | ask | show | jobs
by kristianp 1093 days ago
Isn't there a problem with the Falcon models being too slow? At least I have seen reports of the quantised model being very slow [1]

https://huggingface.co/TheBloke/WizardLM-Uncensored-Falcon-4...