Hacker News new | ask | show | jobs
by leoh 160 days ago
Not for inference, right?
1 comments

correct - h100 can do like 100 tokens per second on a gpt4 like model, but you'd need to account for regular fine-tuning to accurately compare to a person, hence 4 or so. of course the whole comparison is inane since computers and humans are obviously so different ha...