|
|
|
|
|
by irthomasthomas
33 days ago
|
|
No its super easy. I think the confusion is due to the serving and hosting APIs that let you add your own GPUs to a pool and earn money. But for regular inference they have an openai responses API a basic chat app. You can signup to a $3 subscription, or deposit $5 and use your api key. https://chutes.ai/app/chute/2ff25e81-4586-5ec8-b892-3a6f3426... curl -X POST \
https://llm.chutes.ai/v1/chat/completions \
-H "Authorization: Bearer $CHUTES_API_TOKEN" \
-H "Content-Type: application/json" \
-d ' {
"model": "moonshotai/Kimi-K2.5-TEE",
"messages": [
{
"role": "user",
"content": "Tell me a 250 word story."
}
],
"stream": true,
"max_tokens": 1024,
"temperature": 0.7
}' |
|