|
|
|
|
|
by riku_iki
218 days ago
|
|
> How do the Chinese train these models if they don't have access to the GPUs to train them? they may be taking some western models: llama, chatgpt-oss, gemma, mistral, etc, and do postraining, which required way less resources. |
|