Hacker News new | ask | show | jobs
by jiggawatts 743 days ago
My reason for believing that is that it was trained from scratch, and was not a fine-tuning or other optimisation of the existing GPT-4 model. We know this because OpenAI has publicly stated that they're using a different tokenizer, which would have forced them to start the model training from step one.