Hacker News new | ask | show | jobs
by threethirtytwo 166 days ago
The datasets going into LLMs have to have an element of human-ness to it.

For example I can’t just feed it weather data from the past decade and expect it to understand weather. It needs input and output pairs with the output being human language. So you can feed it weather data but it has to be paired with human description of said data. So if we give it data of a rain storm there has to be an english description paired with it saying it’s a rainstorm.