|
|
|
|
|
by santander_cl
79 days ago
|
|
Starred immediately. This is exactly the kind of practical quantization work that makes running longer-context models on consumer GPUs actually feasible. Looking forward to seeing it generalized beyond the one model.Great stuff, g023. |
|
edit: just added Mirostat v2 to clean up repetitive output from the model