| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by xs83 253 days ago
	Now this looks much more interesting! Is the top one input tokens and the second one output tokens? So 38.54 t/s on 120B? Have you tested filling the context too?

1 comments