Hacker News new | ask | show | jobs
by coder543 529 days ago
Google's research blog does not seem to provide this, but many blogs include the Open Graph metadata[0] around when the article was published or modified:

    article:published_time - datetime - When the article was first published.
    article:modified_time - datetime - When the article was last changed.
For example, I pulled up a random article on another website, and found these <meta> tags in the <head>:

    <meta property="article:published_time" content="2025-01-11T13:00:00.000Z">
    <meta property="article:modified_time" content="2025-01-11T13:00:00.000Z">
For pages that contain this metadata, it would be a cheaper/faster implementation than using an LLM, but using an LLM as a fallback could easily provide you with the publication date of this Google article.

[0]: https://ogp.me/