Hacker News new | ask | show | jobs
by zyl1n 1064 days ago
I am not familiar with Replicate, but based on their website, they charge per GPU type. I didn't see the GPU type set in the example. Is it baked in as part of the "a16z-infra/llama13b-v2-chat" model?
1 comments

There's info about that here: https://replicate.com/a16z-infra/llama13b-v2-chat

> Run time and cost

> Predictions run on Nvidia A100 (40GB) GPU hardware. Predictions typically complete within 9 seconds.