Hacker News new | ask | show | jobs
user: volodia
created: 2008-05-28
karma: 545

submissions:

Mercury 2 on PinchBench: Diffusion LLM benchmarked on real OpenClaw agent tasks
2 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
Mercury 2: Best-in-class speed-optimized intelligence at 1,200 tok/SEC
1 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
Finetuning 3-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers
2 points | 0 comments
LLMTune: 4-Bit finetuning of 65B LLAMA models on a single consumer GPU
3 points | 0 comments
LLMTune: 4-Bit Finetuning of LLMs on a Consumer GPI
2 points | 0 comments
0 points | 0 comments
Don't have a $5k MacBook to run LLAMA65B? MiniLLM runs LLMs on GPUs in <500 LOC
3 points | 2 comments