Hacker News new | ask | show | jobs
Show HN: Duplicate Word Finder (duplicateword.com)
49 points by finnhvman 1217 days ago
8 comments

I always wondered whether they use something like this for BBC Radio 4’s “Just a minute”.

For those not familiar, it’s a radio panel game where the panelists must speak on a subject for up to a minute without hesitation, deviation or repetition.

I imagine they would need a screen like this to highlight repeated words for the host (perhaps in combination with some speech recognition, or stenography).

The competitors are free to protest and try to gaslight you into believing you made a mistake. Part of the fun is beingable to give a good explanation as to why the protest is wrong. The game is a friendly social one, at least the one in Swedish is done like that.

https://en.m.wikipedia.org/wiki/P%C3%A5_minuten

The original BBc one is like that. It’s supposed to be fun and not too pedantic and as far as I know Nicholas Parsons didn’t rely on anyone other than the whistle to know when the minute is over.

It’s a very fun radio show. I recommend the available episodes on Audible (or other means) to get the radio collections.

As a side note, do you know where one can get a hold of the Swedish one? I know SR doesn’t have it anyone at least.

Speaking on a subject probably makes it more complex, but reading your description I had a sudden memory of Timmy Mallett's word-association game, from the UK back in the day.

The aim there is to reply spontaneously to random words, without hesitation and repetition. If you fail you get hit on the head by a foam-mallet.

That would ruin the entire thing.
Duplicate word removal seems like a naive, coarse grained method to improve writing style. Smells like the high school trope of throwing a thesaurus at your essay and hoping for the best
Yep, it's actually what we were taught in high school.

Still, sometimes recognizing duplicate words points out some bad/imprecise wording where you're repeating yourself and may not like that.

I find myself using the "edit" feature on comments online a decent amount to catch/add things I only found when reading it back in complete form.

Uncritical duplicate removal, yes. But I edit a lot of work and duplicated words are one of the things that I see time and gain, really dragging writing down. IMO, removing duplicates is one of the cheapest wins, a very easy way of making improvements so long as you do it at all reasonably.
It doesn't even find inflections, such as plural/singular. But as a writing tool, it can only lead to contrived constructions that seem to be the realm of sports journalists: "The multiple medal winner" ... "Last year's 500 meter champion" ... "The Londoner" ... "The former law student" ... when referring to one and the same person during a race.

I do like that it lists them on the side, and allows you to remove them.

Someone commented something similar on reddit a while ago, and it is a very good argument to be honest. Still, you might find some situations where the tool might be useful and it's already deployed, so people can use it if they want
ChatGPT: "Are any of these sentences repetition of each other"

Most educational literature from the US are rife with sentences that might not be verbatim repetition but most certainly say the same with a different word soup.

Feature request to find duplicates within words contains underscores or hyphens.

I want to use this for reading documentation. I often find myself struggling with acronyms like JWT_AUTH and JWT_OKTA so this would help me visually identify and mentally map the use of common terms.

There is no no use for that.
I'm sure you've thought about every single possible situation to conclude your insightful comment.

Otherwise...

There is no use for your comment.

Re-read their comment.
If they didn't notice the duplication, there is no use in removing it?
But this tool would have helped them spot it in the first place.
But for what purpose, then?
Write "the the the the the the"

Result:. "No duplicate words found"

Seems reasonable that those types of words are eliminated for the purpose of the tool. Either by having a letter limit or by whitelisting them. Of course there will be duplicates of "the" and "and".
Unfortunately, it's just a letter limit; lower it and the "the"s get highlighted. Support for stop/common words would definitely be a good upgrade.
They're called "stop words".
Perhaps a better model might be to look at word frequency analysis, and highlight words that are used substantially more frequently than in typical English text.
Tried “Buffalo buffalo Buffalo buffalo buffalo buffalo Buffalo buffalo” and it highlighted every word. Tried “James while John had had had had had had had had had had had a better effect on the teacher” and it had no issues with it.
you can use the first slider on the toolbar to set the minimum word length, default is 4 (or the third button on the toolbar if you're on phone or smaller screens)
you can use the first slider on the toolbar to set the minimum word length, default is 4 (or the third button on the toolbar if you're on phone or smaller screens)
You might have to adjust the minimal word length to 3.
When I press the cmd key on my mac on Chrome, all the text disappears.
the initial text on the site is the instructions. I can agree it's somewhat a questionable design, but I wanted to make sure that the instructions are there for newcomers and easy to clear for the regular users. So any action on the editor will clear the instructions, but the text you type is safe.
It’s very surprising, the instructions disappear and there’s no obvious way to get them back. Undo doesn’t work, for example.

Maybe consider selecting all the text on the first click/focus in the textarea. That way it’s still there but a regular user can just paste, type, or hit backspace to get rid of the instruction. Undo would let you get it back.

Nice to see Svelte in the wild here, by the way.

Good ideas on the instructions, will work on it!

Yeah, coming from React I really liked coding in Svelte, hope to see it grow more!

Find duplicate words and eliminate them from your writings
Added text, page hangs.
was it a long text?
268 lines.