Hacker News new | ask | show | jobs
AI will make the Semantic Web possible (theaimaze.com)
27 points by 3dm4r 1212 days ago
6 comments

Yep i agree and it gets even better.

In order for AI to actually be usefull as a knowledge aggregator it NEEDS to create the sematic web.

The question is if anybody is going to give you acces to their semantic databse though, because im pretty sure that google itself has a nice semantic index alerady.

Plese stop talking about adding semantic annotations to every webpage as "semantic web". Semantic Web was originally to intended to make machine-to-machine communication a first class citizen on the web, to complement a human-friendly web. See https://videos.cern.ch/record/2671957 if you are interested in a bit of history (this video precedes cloud by ca. 12 years, IoT by 14 years, and digital twins by 18-19 years).

SOME people assumed this would require annotating human-targeted pages (artificial general intelligence and commonsense reasoning academics). This turned out to be wrong, by the way. IoT enabled machine-to-machine communication all right without annotations, not to mention that your IoT device is not really interested in those semantically annotated fake news posts and cat GIFs anyway. In fact, Wikipedia pages pull in some data from Wikidata, not the other way around (sorry DBpedia).

Now, the problem is that most IoT devices from different vendors can't talk to each other unless vendors team up explicitly. That's where semantics (on the API endpoints, for example) could help.

> ChatGPT cannot navigate the web yet, but it will soon. Combining the reading comprehension capability of modern AI systems with web navigation will allow us to have agents roaming the web and finding the information we want.

Won't these agents also be influenced or even biased by the prompts they're given? Maybe I am missing something, but when a misleading link based on a suboptimal prompt is formed and reinforced via recurring prompts, how can we make sure that we are getting back the actual connection of pieces of information?

AIs automatically posting content will surely be used for SEO purposes. This is a very disheartening thing, but it's inevitable. We should have regulation to know if content was written by an AI, a human, or both. Otherwise AI will train on AI written content, what will that end with? AIs don't care if the content they train on is BS, so it would end with a very poor system.
I think we will need to create personalized agents on top of indexes. There is a principal agent problem between users and search engine robots that will become even larger as those systems become smarter.
Isn't it better than having humans do it instead individually. Yes, it will have bias, but it will all be biased the same from 1 models perspective. The data could then be adjusted for bias.

Humans doing it would instead create bias in many different directions. An AI attempting it is a good starting point.

While I agree with the author, there's an issue is that the 'semantic annotations' are centralised within each AI, so if the AI get it wrong, well though luck you can't do anything about it..
Google has been effectively attempting this for years. Many searches produce immediate results parsed out of web pages. It works okay-ish.
So we are back to thinking about the semantic web? What's old is new again, I guess.