xxx = crawl and cache, yyy = reachable pages on the web xxx = scan and preview, yyy = published books etc