Hacker News new | ask | show | jobs
by Bender 15 days ago
Adding to that some sites already create sitemap.xml. Seems like the bots should be able to parse that. I would imagine some people are probably generating their llms.txt from their existing sitemap.xml. The bots should have just been adapted to parse the xml.
1 comments

Exactly, and even without sitemap ai can crawl the webpage content for internal links.

Not to mention the environmental impact of creating extra files just for AI to crawl and eventually use for training