Hacker News new | ask | show | jobs
by tsthename 1279 days ago
I tried to do this with hacker news data [0]. I wanted to feed the model the entire community's discourse and then ask it questions (like simulating an interview with a HN user). The main problems encountered were:

- 1. Token limit: You can only input a limited amount of text at once. The challenge then becomes trying to compress data to fit into the window. But it can be lossy.

- 2. Trust: This is the main one. It's hard to determine if the output is based on the new learning material or the large amounts of data the model was originally trained on. There are techniques that can help but they add a lot of additional work and don't guarantee great results.

- [0] https://idiotlamborghini.com/articles/using_gpt3_and_hacker_...