Hacker News new | ask | show | jobs
by neximo64 849 days ago
Both

It takes time to train them. More = better. Usually about 6 months or so. More processing power can allow the model to cram more power in

1 comments

The OP asks about request time (and, I imagine, processing power) not training