Are there any tools or tips to help speed up the "which JOIN is duplicating data" hunt?
Usually my biggest problem is getting all the query parameters lined up to reproduce the issue! (Being able to flip on extended logging or a profiler can make this easy.)
Cutting out the result columns when disabling JOINs to narrow it down is straightforward but tracking columns down in WHERE clauses quickly tends not to be.
Frequently this is trivial, sometimes it's not.
If there will be multiple hits but it doesn't matter that much, there's the obvious TOP 1 or MIN(col) and such.
It's a tradeoff between accidentally breaking the query and returning unexpected data.
Note that if you used join you could have bigger issues as the join would succeed but now you got multiple rows where you didn't expect.