Hacker News new | ask | show | jobs
by hahla 5325 days ago
Does anyone else find it disturbing that google employees are bending over for pg/hn? Seriously, if any other webmaster blocked googles bots they wouldn't change their algorithms to accommodate, or see how they could use less of our resources.

Its pg's fault not googles, and I dont see why they should care. Maybe from their standpoint it would be more beneficial to google users who are used to typing in 'hacker news' to visit this site, but since when did that matter to google?

Also don't get me wrong I love both google and hackernews. I just find whats going on in this thread interesting..

15 comments

From my personal point of view, I help webmasters when I see an issue. It's part of my job and it's something that gets me fired up. I help when I go on our forums, I do that here, and I'm regularly helping webmasters on Google+ (my team does regular webmaster hangouts that anyone can join - see my profile: https://plus.google.com/115984868678744352358 ) and also on Twitter. We can't cover everything, we can't be everywhere, but when we come across something we can help with, we try to help!

Another important point about this thread: this is a very common issue that regularly comes up, and no site is immune from it. I regularly see major sites have a firewall that auto-configures itself to block Googlebot, and the webmaster doesn't know what's going on. Raising awareness about this problem and how to fix it adds to the importance of replying.

Anyone know if this is also a common issue with BingBot? Our site is doing fine in Google at the moment, but was completely deindexed by Bing, and we're totally baffled.
It's not disturbing at all. I am glad Google is a company made up of humans and not uncaring robots. Human nature is that people want to help others when they are in a position to do so. I would do the same if HN needed my help.

Of course, there are people who want to take advantage of people like Matt to give themselves an economic advantage, and I have no sympathy for those people if they can't get someone in Google to help them as easily.

It comes down to karma.

> I am glad Google is a company made up of humans and not uncaring robots

That's exactly the problem. Google is notorious for providing horrible customer support, so why some people get personalized help, while the rest of us are stuck with the uncaring robots?

Actually, much as I've blasted[1] Google over the years for not doing anything (like abuse-desk staffing) that requires People Not Algorithms...

Giving occasional personalized help is a great way to avoid a total black hole while still creating an escalation path. It's like having EXPLAIN QUERY stack traces on NewRelic. You don't need a deep trace of every transaction; you need a deep trace from a few representative squeaky wheels, and you use that information to improve the underlying self-serve process. This is something Google, with their "1/3 of the company parses logs", excels at.

You'll note neither Matt nor Pierre offered to do anything special for PG. They didn't say "Oh! I love HN, and we want it to succeed; I'll go push out a manual override that makes sure we use only the one crawler you unblocked." They gave him exactly the info he'd get from Webmaster Tools plus general knowledge, and told him how to use Webmaster Tools to fix it. Maybe they'll go figure out how to implement that "you're blocked!" hover-over in SERPs, but again, that would benefit everyone, not just HN. There's no special treatment here.

[1] http://blog.wordtothewise.com/2008/04/troubleshooting-a-post...

Google does have bad customer service when they're working through the normal channels. But Google Employees are awesome. I got personal attention from core G+ devs when I broke my G+ profile.

The point is if you can build relationships with Google employees rather than with Google itself then you too can get this sort of amazing help.

There's nothing sinister or crazy about this. Rather than being one in a million, become one in a circle of friends or acquaintances.

To be fair, I've seen Cutts jump in on "no-name" threads where there is an issue with Googlebot or some search-related thing. It does not appear to be favoritism, beyond choice of forums in which to participate.
Fairness is what it's all about. I like yiur choice of wording. A lot of people have a problem with Matt Cutts jumping in because they feel it's not fair. They're mistaking equality for fairness. This is perfectly fair but not equal. Equality sucks. Fairness is better. Fairness is treatment based on merit and equality is just equal treatment no matter what the situation. Equality can really screw you if you're on the wrong side of it.
I see your point and maybe my attitude toward it is totally biased or morally/ethically off balance but I think it's just fine.

PG knows his shit. He knows how to block crawlers, he knows how get SEO done, he's no dummy. "So what?" you say. Well I work in web development and I meet with people all the time and explain to them how to run a site and get rankings up, etc. I tell them I'll give them the tools, show them how to use them and even leave detailed manuals on how to work everything. These people are so into it at our first meeting and they're excited to get into it and work this stuff.

Then the job is done, the tools are handed over, and the lessons are learned. What do they do? Nothing. Then they do nothing. Then even more nothing. The site is never updated and they do no maintenance whatsoever and eventually it becomes outdated and dead for all intents and purposes. A year later they want to know why they're not ranking in Google. I ask if they've checked or even once used any of the tools I gave them and the answer is always no. Then they demand I fix it. I kindly remind them they're the ones who wanted to have Larry in accounting run the site because he's "good at computers" and to save money by not hiring an IT guy.

So the point I'm making is that I'm sure there are a ton of people out there who are trying to get customer service without trying to help themselves or using the resources readily available. I think it's great that Google is showing they're willing to help those who help themselves.

I think google isn't treating everyone equally but they are treating people fairly. There's a difference. I'd prefer fairness. Not all "customers" are created equal so you compensate by treating them with fairness instead.

PG knows his shit. He knows how to block crawlers, he knows how get SEO done, he's no dummy.

Apparently he doesn't use Google Webmaster Tools, though.

It seems he doesn't want to. From what I understand there's the possibility that he'd prefer to accomplish his goal without the use of Wemaster Tools. Like I said, he knows his shit, so he's going to do things on his terms if he can. Webmaster Tools might be what the average guy wants to use but then there are some who follow the beat of their own drummer whenever possible for whatever reasons we may never know.
This fiasco is not good evidence for pg knowing his shit. If he wants to block bots and he doesn't want to use the tools provided to make that work, maybe HN should be #10. It's even behind the wikipedia article describing it. Which is actually not bad for a dead site.
Speaking of knowing ones shit your site listed in your profile billpatrianakos.com is dead - no dns being returned.
PG: "A couple weeks ago I banned all Google crawler IPs except one"

With regard to this issue doing the above is not evidence of knowing your shit in the area in question.

To be honest, I disagree. The fact is that people who know their shit shoot themselves in the foot occasionally. That's how they come by the shit they know. Timid people rarely know their shit because they are too afraid to make mistakes.

Being bold is an important aspect of getting to know your shit.

Also, besides this, this action has gotten us talking about how Google throttles crawling, and this discussion has further reduced my opinion of the World's Largest Adware Vendor.

Now if I were to ban Google searchbots based on what PG is saying I would simply limit them to the first page or two of listing. I think this can be done in a robots.txt?

Yes, it's VERY disturbing, no question about it. Matt can sound all helpful, and cuddly, and oxytocin-inducing all he wants, but he's basically stepping in and helping 1 site over another, when the whole process should NOT favor anyone. There are lots of webmasters out there that do stupid stuff like override a robots.txt accidentally, yet I don't see Matt sending them an email, asking them to check up on it.. Come'n... this is lame!
What do you think should happen in this scenario? Matt Cutts is aware that a reasonably high-profile site is not being indexed properly. Should he just ignore that?

I think it's particularly good that this is being done publicly, too. There's billions of sites out there and Google can't provide this for all of them, but since they're doing it publicly others can learn from it and know how to address it.

If Matt was artificially boosting HN's ranking that would be disturbing, but they're fixing an indexing issue.

Are you kidding? Matt is basically a superhero who helps anyone he can find. He's solved problems for people complaining on Twitter, on Google+, he hosts regular video office hours when anyone can come to him with problems, etc. It may be the first time YOU'VE seen him rush in to help someone, but this is Matt Cutts' standard MO.
No, he's not a super hero.. and no he doesn't help everyone that asks for it. I've brought up many issues over the years to him via Twitter, blog comments, emails, etc, and none of them.. count them.. ZERO have been addressed by him. He just gives preferential treatment to those sites that make a lot of noise, and might give Google or Google's quality team a lot of negative press. That's all he does. I've even brought up 100% apparent, clear spammy sites and he doesn't respond, and doesn't do anything about them. They're still there.

Lots of folks here like to give Matt Cutts this aura of an angel, or some sort of saint. Those who have known his actions over the past 5-10 years know better. Just ask guys like Aaron Wall and Rand. Matt Cutts is just a pawn that is there to do damage control for Google. That's all he is.

Hi AznHisoka. A lot of people send me spam reports, and I'm happy to pass them on. But normally we investigate the spam reports without sending back specific feedback of what action we took in response.

P.S. Just curious--is this the same AznHisoka from BlackHat World, the "Blackhat SEO forum"? http://www.blackhatworld.com/blackhat-seo/members/137345-azn...

> normally we investigate the spam reports without sending back specific feedback of what action we took in response

I understand why you have to do this, but I wish there was at least some transparency regarding confirmation that an issue has at least been looked at, as opposed to just filed as a spam report.

I've only reported one issue to Google before, and the site in question, though incredibly obviously bad, is still the top link on the SERP for the business.

Specifically, there's a pizza place here in Halifax, NS that looks like it didn't renew it's domain in time, and some people are squatting it with an old copy of the site plus their own spammy links. The site[1] displays a copy of the site from 2008 or so, with a banner on the top which reads:

> To previous domain owner: We bought this domain after expiration so it's not our fault that you lost it. We put old content for this domain only to avoid losing good quality of it from SEO point of view. If it's a problem for you contact us ASAP!

I reported it to Google months ago and never heard a word about it, but today when I search for the business name - which I'd imagine a lot of people do when looking for menus, phone numbers, etc. - the "compromised" site still ranks first on the SERP[2]. I wonder how many people even notice the banner telling customers that <jedi>this is not the page you're looking for</jedi>.

[1] http://jessys.ca

[2] https://encrypted.google.com/search?hl=en&q=jessys%20piz...

If it is, apparently the person in question doesn't enjoy the Blackhat World forum very much, having made one post total.
And yet AznHisoka still seems to have time to recommend a colon cleansing product on Yahoo Answers to lots of different people: http://answers.yahoo.com/activity?show=EYIYH5R8aa :)
Oh noes; Matt Cuts is going to think BHW is just spammers - siiigh
Oh snap :D
Google should simply deliver the best results, no matter what algorithm they use to do so.
If that were truly their goal, their ads would never be the useful, because if they were the best results, they would be duplicates of the organic search results (hence useless). Or, if they weren't the best results, then the best results are in the organic search.

Clearly that's not the case, as advertising makes 99% of their revenues.

If you're looking to buy shoes, why should the organic results for [your favorite model] (likely wikipedia, the brand website, etc.) be better than ads?
That's exactly my point. I was responding to the parent's suggestion that "Google should simply deliver the best results, no matter what algorithm they use to do so." In the scenario you describe, the best results would be the stores (for example) where you would buy those shoes.

Remember, organic doesn't have to mean non-commercial. In fact, if I search for best buy, the first organic result is bestbuy.com

It is obvious that the best organic result for a nav query like [best buy] is bestbuy.com. The issue is with queries like [some product], should it show amazon first? or bestbuy? or some local reseller?
the point of ads is not to be maximally useful, it's to maximise some function of (usefulness to you, profitability to the advertiser).
Of course that's the point. Again, I was responding to the parent ("Google should simply deliver the best results, no matter what algorithm they use to do so"), which I understood to be what results Google should choose to show. Perhaps I misunderstood his meaning for "best", but "best" in this context to me means the result that's most likely to satisfy the user.

I'm not disagreeing with the fact that ads are subject to a whole different objective function. I'm saying that if the best results were already in the organic, then the ads would either be redundant (for the user) or otherwise less useful than the organic (by definition, really).

yep, i was just pointing out that the ads being less useful than the organic result was pretty much by definition.
Damned if you and damned if you dont.

All the people in this thread, including Google-related users, are trying to do is figure out what changed. That could just as easily have been at Google or HN's! It could also just as easily been a correct change.

If it turns out it was something HN changed and they did not use or even aware of the proper Google tools available for everyone to optimize their site, then I hardly think any reasonable person could object if it is pointed out to them - even if it's by Google personnel.

>Does anyone else find it disturbing that google employees are bending over for pg/hn?

Umm...no. They're not tweaking the algo, they're explaining to PG that he should stop blocking the crawlers, or that he should verify the site in google webmaster tools, then change the crawl rate.

Umm...Yes.

Seems like "bending over" to me:

"but I'm pinging the right people to ask whether we can get this fixed pretty quickly"

"get this fixed pretty quickly" != "change the search algorithms"

I can usually get the hacker news by just googling for "hn"

If you use Google Webmaster Tools it will in-fact tell you all of this information without having to ask someone at Google.
Interesting, but not disturbing. I think it's just a consequence of people at Google initially reading HN for the stories, then finding that the HN community forms a useful 'brains trust'. I for one am glad to see that HN has a hotline to the Kremlin ;)
Baseless comment. If they don't help, you will say - HN is one of the top sources of tech news in the world and it is surprising no one from Google is responding to such an obvious mistake. If they help, you comment as stated.

Secondly, HN is one of the most popular sources of tech news. By responding to this issue, Google is helping bring out possible solutions to a common problem - that crawlers are too frequent and affect the performance of a site. People like you and me can read the responses and learn what can be done to control this.

Third, popular websites like HN deserve attention like this because of the following they have garnered over the years. If,say "hahla blog" doesn't show as a top result in Google - it will take time for Google to verify whether it should even be shown as a top site. But, if something like HN or Amazon.com doesn't show as top result when users explicitly searches for it - that is a darn good use case for Google to fix the issue. It is an indication of something is definitely wrong somewhere that needs to be fixed.

> Maybe from their standpoint it would be more beneficial to google users who are used to typing in 'hacker news' to visit this site, but since when did that matter to google?

from google's point of view, a user who types "hacker news" into google is almost certainly looking for news.ycombinator. therefore, it is highly desirable for the site to be the #1 result.

Disturbing yes but not surprising, HN is influential. Now whether it's hacking news or entrepreneur news is another issue.
What's more interesting is that people ONLY feel the way you do when the company in question is big. If it's a tiny startup doing some sort of unscalable marketing or customer support, nobody blinks an eye. But if a Google-sized organization does it, it's shocking or disturbing.

I think your best bet is to realize that, regardless of size, companies are always made up of people. And people are opinionated, subjective, and prone to making decisions that fall outside of some standard set of rules.

Completely agree with you Hahla - amazing to see the Googlers jump in including Matt Cutts so quickly to support PG and HN and YC yet 99.99999999% would never ever see support like this.
Let's be real here. Do 99.9999...% of users need or deserve this kind of support? It's easy to have sour grapes and say "oh the big shots get attention but I don't! Not fair!". But then you have to remember that people like PG worked their ass off to get a site like this to become so valuable and popular. We're not talking about some guy's blog about his cat that no one cares about, we're talking about HN here. Is it equal treatment? Of course not but is it fair? I say it is fair.

If fairness is treating people based on merit then it's totally fine. Matt isn't in here saying he's going to manually manipulate the result or change anything on Google's end (save for maybe correcting a flaw that would benefit everyone who's in a similar situation as HN). All he's doing is suggesting possible causes, trying to make a diagnosis, and just troubleshooting.

Most people who want this attention don't deserve it. They're the type of people who won't use the resources available like the google help documents or even learning how to administrate a site for best results in search engines. Plus, most people's website just aren't worthy of this attention.

I could go on but lets look at the reality here. We're all knowledgeable of how the web works here so let's just admit that HN is a damn popular site and it definitely seems like a fluke to have it rank as it was when this was submitted.

If you don't know somebody that works for Google at this point in your career, you're working in the wrong industry.
~30k employees at google .. what makes you think they hold all the keys to the millions of web developers across the globe?
What algorithm changes?
Was the query for a domain name or just hacker+news? How do we know?

Does <title> make any difference?

I don't think of this site as "Hacker News". I think of it as ycombinator, and the subdomain, news.

Should users of hackernews.com think of that site as something else, e.g. whatever is between the title tags?

A searchable list of domain names, ranked by popularity. Or even a searchable list of main page titles. Is that how some users are using Google? If so, Google does not need a full, current cached copy of the crawlable web to provide that.

Unsure. On several occasions I got the distinct impression that a lot of members aren't actually interested in "hacking" (for any of its definitions), instead you get people asking about the value of jailbreaking a phone: http://news.ycombinator.com/item?id=3169000

I suppose it is because the top hits below HN are perhaps even less about "hacker news" ?