Hacker News new | ask | show | jobs
by _5hxt 1107 days ago
This is old and dates from the period surrounding the launch of the 6b model from Eleuther, but have a gander, sheds some light on their sources: https://arxiv.org/abs/2101.00027

GPT4 technical paper doesn't seem to disclose it (or I didn't dive deep enough): https://arxiv.org/abs/2303.08774

One could ask it niche questions about the stuff you mentioned to gain insight, but may stem from discussions about it.