Hacker News new | ask | show | jobs
by pbhjpbhj 6102 days ago
I know you're trying to be correct but using data in this way is annoying - "the data shows", not because it is necessarily right (it depends) but because it sounds right. If it helps then just consider data the singular noun for a block of information.

"The data shows murder causes death" sounds fine to me, nothing jarring about that except that it's probably not true "the data shows murder is a cause of death" (murder doesn't cause all deaths usually depending on the locus).

2 comments

data shows murder causes death

Are you making the claim that the word "data" is singular? It's not. And when you use a collective noun, you use the singular form (datum, in this case).

Along this line, though, what does sound jarring to me is the recent trend among the tech lit (and maybe others) to use company names as plural, as in, "Microsoft are keeping quiet about their new product." This is totally non-standard. First of all, Microsoft is a singular entity, identified by its articles of incorporation. What these authors are doing is using a lazy shorthand for, "Members of Microsoft management are keeping quiet about their new product."

See, for instance, "The Bedford Handbook for Writers" http://www.bedfordstmartins.com/newcatalog.aspx?isbn=0312595...

To pile on the pedantry: "data" is one of those words that can be used in either a singular or plural construction. The OED has references going as far back as this one, from 1807:

W. IRVING Salmag. xviii. 366 My grandfather..took a data from his own excellent heart.

(Further proof that English is not Latin.)

I agree that it's a tough game to standardize usage, especially when different "experts" don't agree (I would guess that you could find writing by other authors circa 1807 that used "data" as a plural). Currently, you find the use of "gender" confused with the use of "sex." Over time "gender" will probably come be used generally for male/female, and sex for reproductive references. But the first such uses were non-standard.

Perhaps it's best to leave it at, "know your audience."

At the least, "The data show murder causes death"