Hacker News new | ask | show | jobs
by SlavikCA 112 days ago
So, only Americans can use data against others?

By the way, I'm running 400B model on my computer with 72GB VRAM: Qwen3.5-397B-A17B-GGUF/UD-Q4_K_XL getting 13 t/s. Subjectively, I feel it's runs at the level of Anthropic Claude, just slower.

2 comments

Question for you, that 13t/s, is that pretty solid even with high context/tokens?

I know Apple marketing says 'look at our 20t/s' but they sent less than 40 tokens.

256 GB of RAM?