|
|
|
|
|
by xatalytic
1157 days ago
|
|
15,000 instruction tuning records generated by Databricks employees in seven of the behavior categories outlined in the InstructGPT paper (predecessor to ChatGPT). Coincides with the release of Dolly 2.0, which is trained exclusively on this dataset and demonstrates high quality (but not state-of-the-art) instruction-following behavior. The data and models are licensed for commercial use, setting them apart from recent releases trained on data from OpenAI. |
|
This is not correct. It was fine-tuned with this data set, but the model itself is the 12B Eleuther AI pythia model.