How is an open source project going to download the entire Internet? The model requires 10x20k cards to run. You are dreaming, this is a factor+ more complex than stable diffusion. Big players only
According to Altman, each chat costs a few cents to evaluate. Let's also assume that there are some performance breakthroughs. Also, maybe i don't want to run the whole internet, for me it would be enough if it was trained in a scientific corpus. Also, it only needs to be trained once by someone.