Hacker News new | ask | show | jobs
by chrismonsanto 5600 days ago
Of course I've done a Google Scholar search for Alice ML. There is roughly one admissible source -- "Through the looking glass" in Trends in Functional Programming. Small workshop paper, 12 citations according to CiteSeer. Doesn't show up in the ACM Digital Library. There are a number of other papers on Saarland University's website, but they are either 1) not peer reviewed (tech reports) 2) not cited or 3) not actually about Alice ML. To illustrate #3, consider "A concurrent lambda-calculus with Futures". This is a relatively well-cited paper, and it mentions Alice ML (a sentence or two?), but it isn't actually about Alice ML! It describes a new language construct and models the semantics in the lambda-calculus. In this case, wouldn't it be the construct, not the language that is notable?

I gave the following example in my Wikipedia talk page: the first paper on functional reactive programming is very famous. It describes a DSL called Fran. However, Fran could have not been given a name and the paper still would have been influential, because it wasn't the language itself that had the impact, it was the idea of functional reactive programming that had an impact.

(For the record, I got an A in PL! We used Coq. And proved a lot of theorems, mechanically.)

11 comments

I think you're confusing the "reliability" of a source (in the peer-reviewed article sense) with its reliability as an indicator of the notability of some subject X.

The peer-review system basically establishes (a) the reviewers believe the result is non-trivial and (b) the reviewers believe the result is correct.

However, I'd say that the very existence of documents such as third-party tutorials/introductions/discussions, etc., can be adduced as evidence of notability. Think of determining notability more as "sociology" than some academic judgement about the intrinsic worth or originality of the topic.

For instance, "article citations as notability" is patently absurd in the case of, say, celebrity or TV show articles (the existence of both of which seems accepted on Wikipedia) or even news topics, and only somewhat less so in the case of PLs, especially those developed outside of academia. (It's arguably a bad criteria anywhere where results aren't very expensive to reproduce, such as population surveys and lab results -- surely many important results in math and physics living only on the arXiv are more notable than some minor topic which generates higher citation counts through constant re-citing by the same group of devotees).

In contrast, the three languages mentioned on the front page were all ones I'd previously heard of.

I see there is a paragraph in the Wikipedia guidelines for notability regarding refereed paper citations, but in line with my above comments I'd suggest that this not be read too literally. Perhaps it's personal preference, but I find such "long tail" articles useful, even if that means WP contains millions of articles on topics I personally find irrelevant.

Perhaps the reason you've incited such anger is that people feel you're imposing what can be seen as elitist and subjective views about how notability is defined (certainly it's not true that everyone's pet/undergrad PL project should have a page, but the three languages mentioned certainly have received wide attention). From this perspective, the question here is not about any particular property of these PLs but what the WP criteria are or should be, so it would certainly be best to err on the side of non-deletion in all cases.

I hope you reconsider your views on this topic (disclaimer: I am not associated with any of the projects mentioned).

> Perhaps it's personal preference, but I find such "long tail" articles useful, even if that means WP contains millions of articles on topics I personally find irrelevant.

So do I. With an encyclopedia of universal scope, there are bound to be articles on things that any particular person doesn't care about. The important thing is that for each article, some people find the information useful.

Wiki is not paper.

Alice has influenced programming languages and programming language research. http://lambda-the-ultimate.org/node/440 being just one example.

PL researchers are most excited about the ideas in the language. It is not surprising at all that they do not go tooting their horn with gay abandon about a particular implementation (or coursework grades). The vehicle of that idea is no less important.The notion of an electron may be more important than the instrument used to discover it, but that does not mean that the instrument has no place.

I would argue that the deleted articles are more important than an articles on, say, Java which does not bring any new ideas and I hear about it all the time. I don't need Wikipedia for it. But I do need it for languages like Pure.

I think it is more important that articles about relatively obscure but influential languages like Alice are preserved. It lets a potential CS student get excited/interested in something and pursue it further. Not everyone will be digging up old PL folklore. Wikipedia is one of the few, largely spam free, venues where on could come in contact with them.

Wikipedia serves an important purpose of disseminating knowledge in an accessible form to all. If any purpose was at all served by the deletion, it is hardly much different from dilettante vandalism: destruction of potential value to a reader.

What about human languages. There could be a human language that has remained isolated for the most part and only few speak it, that in itself will be a fascinating thing to know about. Does it mean that Wikipedia will not have space for an article on it.

Peripherally: many if not most grad students would get an A in PL if they are taking that course, and will prove theorems if one works on theorem proving or verification. I don't see anything special about that.

did you ever stop to think what positive impact the deletion of these articles actually makes? I cant come up with any positive contribution you're making, but I can see that if I saw one of these languages mentioned somewhere (hacker news for instance), you've now ensured I have one less place that I can go to to find out about them.

you're a drain on wikipedia, a source of negative knowledge.

> Did you ever stop to think what positive impact the deletion of these articles actually makes?

As far as I can tell, the only positive contribution that deletionism makes to the welfare of humanity is it makes individual deletionists happy when they destroy other people's work.

I wonder if anyone has done research into possible links between deletionism and mental disorders such as OCPD?

There's lots of pages on Wikipedia that are basically self-promotion, or descriptions of inconsequential "news" events, or "micro-celebrities," or moderately rich people, or other stuff that will have no relevance in 10 years, except to people with OCPD who like to hoard information.
You're arguing that many pages in Wikipedia aren't useful. I don't think anyone is debating that, but are these pages actually harmful?
Yes, some are harmful in that promotion disguised as objectivity is misleading.

That's a whole separate issue from notability, though. On this and many other issues I think the notability bar is set too high. I'm much rather see a wikipedia page on an obscure language than not, and I have read a few in the past.

I like to click on links in Wikipedia articles to find out more about things related to the main article. I consider wasting my time harmful.
I think this is far too harsh.

>"you're a drain on wikipedia, a source of negative knowledge."

Notability requirements are there for a reason. If you think it's notable then there are clear methods on Wikipedia to press your case and establish the notability without stooping to direct personal criticism.

I've not looked in detail but from the couple of posts here it seems the languages in question are esoteric and not especially notable in themselves. Wikipedia is not intended to be a dumping ground for every last bit of trivia.

Unless Alice, for example, has made some profound contribution to CS then it doesn't deserve it's own entry based on the limited number of papers available - even then it may not, the contribution could be listed elsewhere, perhaps on a general page about similar contribution or on the page for the language that has been most impacted.

Wikipedia isn't supposed to be a replacement for use of a search engine.

IMO adding millions of articles which are not going to get beyond stub status and not going to be maintained and take resources (editors) away from other articles is not a wise move.

If you genuinely believe that this is worth effort then there is nothing stopping you from taking the removed articles and starting/contributing to a programming language wiki. http://esoteric.voxelperfect.net/wiki/Language_list appears to be a good place to put this stuff. Or, as I said at the start demonstrating the notability and having the articles reinstated in Wikipedia.

> Wikipedia is not intended to be a dumping ground for every last bit of trivia.

One person's trivia is another person's useful information.

> Unless Alice, for example, has made some profound contribution to CS then it doesn't deserve it's own entry based on the limited number of papers available

If significant numbers of people are interested in the subject, then having an article about it promotes the public good. Deleting the article harms these people.

> even then it may not, the contribution could be listed elsewhere, perhaps on a general page about similar contribution or on the page for the language that has been most impacted.

If you have a phenomenon with its own name, it's better for a subject to have it's own article, rather than be a note in another article, because then it's easier to find.

> Or, as I said at the start demonstrating the notability and having the articles reinstated in Wikipedia.

Why should the onus be on hard-working Wikipedia editors to show their articles are worthwhile? Instead, it should be on the vandals who want to destroy work.

>If you have a phenomenon with its own name, it's better for a subject to have it's own article, rather than be a note in another article, because then it's easier to find.

In a world without word indexing that "easier to find" matters a lot.

Your statement directly contradicts Wikipedia guidelines as linked in my previous entry.

>Why should the onus be on hard-working Wikipedia editors to show their articles are worthwhile? Instead, it should be on the vandals who want to destroy work.

Save your spin and lets stick to the issue. The simple answer is "to maintain Wikipedia as an encyclopedic tome with a usable quality".

If you wish to create a quality work of notable articles then it's necessary, IMO, to require an argument for inclusion rather than demand an argument to exclude something.

AFAIK no one is here to destroy worthwhile information, I certainly haven't contributed to Wikipedia to that end, but for some types of information Wikipedia is not the right place to put it. You'll see I linked to a wiki for programming languages, the info fits there well, it can be linked to from a Wikipedia page on more general programming information; why is it essential to duplicate such information?

As far as I can tell, Wikipedia does not have clear methods (or at least not effective ones) for establishing the notability of things like programming languages, and some editors have failed to use their heads to adapt the guidelines to the situation. The fact that Heather Kuzmich (to save you the trouble of looking up her yet extant Wiki page: She was a contestant on "America's Next Top Model" who suffered from Asperger's) appears to be more notable than Alice ML shows the uneven standards of notability pretty clearly.
35 citations according to Google Scholar.

So to count for notability:

- An article can't be from a workshop?

- It must solely be about a topic?

- It must be peer reviewed?

At this point you must acknowledge that your own complicated rules for notability have diverged a long way from those stated anywhere else.

I agree with peer review as a criterion for being a reliable source, and that you need reliable sources for notability.

But the other criteria are silly.

Is Strachey's non-mainstream but very influential GPM notable? It has only one journal article about it, with a paltry 65 Google Scholar cites, and the other articles that mention it are all mostly concerned with something else.

Ha, I actually looked through the SERPs that Google Scholar gave me listing 33 citations - 18 were obvious false positives. The other listings appeared a priori not to be a completely different field of discourse (though several looked very dubious from the titles.

I've also now reviewed the AfD comments. It seems that no reason was given for why Alice ML was notable beyond it's inclusion and reference in other articles. The proposer was clearly knowledgable about the subject and had reviewed the available literature. The deletion consideration seemed logical and well founded. The failure if there was one was there not being someone else who knew the subject matter and could provide a good reason for inclusion.

I'm sure that someone has already mentioned this, but "A concurrent lambda-calculus with Futures" is a paper about AliceML. The problem is that the full syntax and semantics of StandardML contain a lot of features which are not very relevant to the discussion of the semantics of futures.

Instead, the authors describe the semantics as an extension of lambda calculus. This is both more compact and more useful to other researchers (who might not be working on an ML dialect).

That said I would be grateful to you or anyone with a backup who could restore the article. Looking at the google cache for this page, it was more than a stub and actually did link to a page with references...

Alice is still one of the first semester languages for CS students at Saarland university. I think it would be advantageous if you could find information about related languages and additional features on Wikipedia.

Wikipedia has lost its way. Thanks to the ${nearsighted} people like you who impose artificial constraints on what's in, what's out. It used to be a crowd-sourced collection, now it has managers and pretends to be an encyclopedia designed top-down and filled bottom-up.

/* I would like to use a different word instead of nearsighted, but have to maintain decency. */

Stop deleting knowledge, period. It's not your personal job to police what the world is allowed to know.

I was unbelievably pissed off when I searched for "Alice ML" and found a hit on wikipedia via google and come to see the page deleted.

Deleting a well-written, well-sourced article on the basis of notability reduces the total information of Wikipedia.

I didn't say that. Someone else said it. I just heard it.

I got involved with the deletion debate over the Go! programming language, and was among a handful of people who actually tried to reason out how Wikipedia's notability policies could be improved.

The fact is that Wikipedia is full of articles about people's own pet research projects which have no users and no impact.

Your arguments are solid and worth thinking about, but as you can see, nobody wants to address the wider problem of Wikipedia's poor notability guidelines directly, and have let this devolve into a flamewar.

Yeah, this is probably the most disappointing HN flamewar yet, full of outrage and attacks but very little information or people wanting to educate themselves. There's lots of things that can be improved on Wikipedia, but it requires people to: 1) become familiar with how things work; 2) become familiar with why things came to be that way; and 3) come up with workable proposals to improve things that take into account various problems those proposals themselves could cause.

But pitchforks and flamage are easier, I guess. It's embarrassing how similar this response is to what happens when some minor band's article gets proposed for deletion, and all the angry folks from the band's mailing list show up arguing that Wikipedia Fascists Are Damaging Knowledge.

You're assuming that people don't know how Wikipedia works. I think most people who are angry with arbitrary deletion on Wikipedia have probably at some point read through enough of the WP: articles to understand why it's offensive.
Judging by the discussion so far, I would be willing to wager that most people in fact are unfamiliar and have not read anything.

One experiment that could be useful is for people to just follow the "Articles for deletion" page for a few days when it's not currently focused on something they're personally big fans of. Say, next Monday through Wednesday. Then everyone would be much better informed to explain what's wrong about the process and what to do about it. They'd also have a better understanding about the kind of crap that gets regularly deleted, i.e. why the rules exist in the first place, which is what any reform proposal would have to also account for.

But I think most people are just jumping in because someone said something bad about a thing they're a fan of, hence the mob style.

Except that this happens over and over again.

Wikipedia's policies piss everyone off in turn, but it is in turn. The fact that each new community isn't aware of wikipedia's ludicrous policies isn't their fault, since they've had no cause to wrestle with the problems or nuances.

The fact that wikipedians are tired of the debate doesn't mean that they're right, it means that they've done a bad job communicating the problem, dealing with the consequences, and are frankly out of touch with the majority of wikipedia's readers (who outside of a subset of wikipedians thinks deletionism is a good idea?).

The fact that people defending wikipedia on these issues always treat the aggrieved fans/community with such contempt really is part of the problem.

There are certainly some problems, but I don't think they're all on one side. I think that the fact that people who don't spend any effort understanding the problem or trying to improve things treat Wikipedians with such contempt is also part of the problem.

Figuring out how to write an encyclopedia isn't a particularly easy problem, and a lot of people have spent some effort trying to balance things like, on the one hand, wanting coverage of everything, and on the other hand, not wanting physics kooks spamming up Wikipedia with their fringe theories. Lots of people disagree on how to do it, but I think people should at least make some effort to understand things rather than seeing it through the narrow tunnel vision of "the thing I'm a fan of deserves special treatment and Wikipedians don't see that so they're idiots". As in open-source, it's to some extent a contrib-ocracy; if you don't help improve the encyclopedia at all, but only show up when some external fan community is aggrieved, that sort of "contribution" isn't appreciated any more than it would be at LKML (they get such influxes now and then too, e.g. from fans of a module that wasn't accepted for merging).

> who outside of a subset of wikipedians thinks deletionism is a good idea

I actually see the opposite criticism at least as often! Wikipedia's often attacked in academic literature, and some news stories, for being filled with "Star Wars and Pokemon cruft", and not paying enough attention to reliable sources or people with expertise in various fields. There are also periodic controversies about it including articles on borderline-notable people who object to their inclusion. Most forks, like Larry Sanger's "Citizendium", have been based on the Wikipedia-is-too-loose-with-its-standards criticism, rather than the opposite one. Not that I agree with it, but it does seem to be the most common criticism outside of fan communities.

> They'd also have a better understanding about the kind of crap that gets regularly deleted

Most of it isn't crap.

For example, I looked at the top 5 articles listed on 13th February -- http://en.wikipedia.org/wiki/Wikipedia:Articles_for_deletion...

#1 is an internet marketing company. It has a website which verifiably exists. People considering doing business with them would find this useful. It should be kept.

#2 is an academic. That makes the article useful. OTOH, she's hasd death threats and wants the article down. I'm not sure what I'd do here.

#3 is a sportsman. Should be kept: even if he isn't well known, those who do know about him will likely find the article useful.

#4 is about Swiss people in Sri Lanka. If I was Swiss and was considering relocation to Sri Lanka, I might find this useful. Should be kept.

#5 is about "XMLmosaic", something I've never heard of, and has been speedily deleted.

A majority of these articles (at least 60%) are net-useful to humanity and should be kept. If I did a larger sample, I'm sure I'd get similar results.

About 100 articles a day are posted on AfD and most of these (75-80%) are deleted. What good is served by destroying tens of thousands of useful articles every year? None.

One good thing is that it keeps Wikipedia a tertiary source. I personally have a fairly inclusionist perspective, but my perspective is more or less: Wikipedia should cover anything on which any decent third-party source can be found (and it should point the reader to the source, so they can look it up themselves if they want).

There is of course stuff on which internet users can collect useful information, where no current sources exist to cite, but there's no reason Wikipedia has to be the only wiki on the internet. For example, Know Your Meme is an excellent project to document current internet memes, including via original sleuthing by its editors, who try to track down the history of memes and reconstruct their paths. It's not really a tertiary source summarizing the existing literature though; it's a sociological/historical research project. And it's great that it exists. Why does it have to exist on wikipedia.org, a project that has different goals, which don't include doing original research? Of course, memes should also be covered on Wikipedia, but only when Wikipedia can cite some existing published source documenting the information (a news article, a sociology conference paper, anything really).

I guess it seems perfectly fine to me that there's more than one project on the internet dedicated to collecting knowledge, with different goals; I don't see why Wikipedia has to be the union of all possible wikis. If anything, I would prefer more projects with different goals and approaches to exist, rather than everything being so centralized. That way I can go to Wikipedia if I want the tertiary-source take on internet memes, and I can go to Know Your Meme if I want the primary-source take on internet memes. Same with a Wikipedia article on a location versus a Wikitravel article on a location: both useful, but I don't see why they have to be merged into one project (or why that would be helpful).

Would you describe yourself as a hoarder? Applying your argument to personal belongings would appear to mean that unless it's actually completely unusable for anything it should be kept - even excrement is useful (very), as are old drinks cans (perhaps you could make a solar reflector), as are worn out clothes (make rags for cleaning), as are cellophane wrappings (could be heat welded and used as transparent insulation), ... sorry I'm digressing wildly.

Anyhow, the measure of inclusion on WP is not "isn't crap" but "is it notable" and whilst that gives a huge range to argue over it's is still clearly different to the question of "is it crap".

Take your sportsman - do you think that WP should be exhaustive? What is it about them that's notable and can be confirmed from other references?

> The fact is that Wikipedia is full of articles about people's own pet research projects which have no users and no impact.

I browse WP a fair bit. I've never come across any such articles.

Even if they do exist, what's the harm? People not interested in the topic won't read them, people who are interested will find them useful.

Thanks for posting here and addressing the topic.
So after all this outrage, do you agree that maybe you took it just a little bit too far?