Y
Hacker News
new
|
ask
|
show
|
jobs
by
ayushkaushal
1185 days ago
We will add quantized CodeGen for fast inference on CPUs up on cformers (
https://github.com/NolanoOrg/cformers/
) by later today.
3 comments
meghan_rain
1184 days ago
> by later today
Wow, that's the timeframe things are moving at right now, we better get used to it!
link
syntaxing
1184 days ago
Whoa is there a PR or wiki about this
link
underlines
1184 days ago
4bit GPTQ maybe?
link
Wow, that's the timeframe things are moving at right now, we better get used to it!