|
|
|
|
|
by anentropic
1157 days ago
|
|
the GPT-J-6B one is Dolly 1.0, previously released Dolly 2.0 is Pythia-12B fine-tuned on this new dataset on their hugging face page [1] they admit the performance may not be much or any better than the original model (I am guessing this may be a weakness of Pythia-12B, which was intended for model-training research rather than best results) the main point of Dolly 2.0 is the new dataset is unencumbered legally [2] whereas Alpaca et al were trained on ChatGPT transcripts, so commercialising those models would contradict OpenAI licensing terms [1] https://huggingface.co/databricks/dolly-v2-12b [2] https://www.databricks.com/blog/2023/04/12/dolly-first-open-... |
|