|
|
|
|
|
by rgiar
4708 days ago
|
|
so this is just when a given site was crawled? "_id": "b919f02c8f053c41e8ee86311ca9b0f6,
"url": "https://www.example.com/",
"host": "www.example.com",
"root": "example.com",
"time_spent": [
{
"sec": 45,
"seen_at": ISODate("2013-06-23T00: 41: 44.0Z")
},
{
"sec": 5,
"seen_at": ISODate("2013-07-01T14: 41: 44.0Z")
}
|
|
yes, as it is said in the blogpost, the only thing missing is the full text of the page for indexing & searching in it, we don't dare to release it because of copyright issues (he, you distribute the full text of my page!).
With this data you could for example built a new alexa and find out what was the most visited page last week :)