Hacker News new | ask | show | jobs
by malborodog 1182 days ago
This kind of problem is (trivially?) solvable through the ReAct framework, like LangChain etc. Basically you get good data, vector embed it, and make sure the LLM knows where to look for accurate information.
3 comments

Slight tangent but preferably i would like an AI that can go against the status quo, the elite classes, the mainstream discourses and ignore the circus.

I think one of that most interesting things that can come of bots like GPT-X is that it can make new connections, unravel "stuff", do extremely intricate deductive reasoning.

Be the data driven arbiter of the the truth for everyone, not just the tiny established classes or cultural hegemonies.

The ideological and cultural noise increasingly smokescreen any realpolitical, material or resource-oriented analysis of actual economic power structures in the world in the last years, AI could be a godsend (or the opposite unfortunately).

I remember reading a sci-fi years ago about the stuff an AI concluded when asked philosophical questions that were so bizarre and frightening that people shut it down, and i'm sure we're in the same territory with political and scientific analysis.

It's either dangerous to the orders of the world, or not that interesting and borg like on a philosophical level.

> Slight tangent but preferably i would like an AI that can go against the status quo, the elite classes, the mainstream discourses and ignore the circus

There's a Charlie Brown comic about this sort of thing. Although I think it's an edit, not an original comic. Something like "They are never going to give you the education you need to overthrow them"

Similarly they are never going to give you an AI that will side with a you against them.

I would also love such an AI though.

Agreed... An example would be the recent Jordan Peterson mentions in Twitter around the bias in the system. Even without leaning one way or the other politically speaking it should be concerning... because any similar bias can easily be used to target "you".
Who - WHO - thought that naming something new 'the ReAct framework' in 2023 was a reasonable choice.
This approach has some issues

- it chunks inputs, with some overlap, but this can destroy context

- the retrieved passages, when they come from different documents, have no apparent relation or could be mistakenly considered related

- the model struggles to correlate data between the document snippets, taking half an idea from one side and half from the other side and mixing them up in something that doesn't really make sense

Implementation details. Check out what's going on with LangChain, augmented retrieval, etc. We'll be able to create knowledge bases on specific subjects with vetted data, and get the bot to retrieve and summarize appropriate results while providing a citation to the original source.