Y
Hacker News
new
|
ask
|
show
|
jobs
by
101008
704 days ago
What's the model behind it? I asked a simple question (that others LLM got it right without a problem) and this answered somethign completely wrong (and curious, since I don't know where the hallucination came from)
1 comments
vaasuu
703 days ago
Looks like it's using llama3-8b-8192 as the LLM [1], which is a relatively small model, so hallucination is quite likely.
[1]:
https://github.com/ai-ng/swift/blob/7d1f993b095abc4a51cf9c70...
link
[1]: https://github.com/ai-ng/swift/blob/7d1f993b095abc4a51cf9c70...