Hacker News new | ask | show | jobs
by JimmyRuska 816 days ago
Looking forward to an 8bit instruct version on llama.cpp to try out problems with the insane context length.

It would be interesting if all these models were finetuned on basic datalog which is a very simple language. That way they could demonstrate their logic/reasoning capabilities as well as ability to learn from mistakes and iterate.