Y
Hacker News
new
|
ask
|
show
|
jobs
by
rockinghigh
61 days ago
The MoE experts are quantized to int4, all other weights like the shared expert weights are excluded from quantization and use bf16.