Hacker News new | ask | show | jobs
by rplnt 2262 days ago
There was a study, IIRC from Israel, which said this is still reliable with 64 samples mixed.

The next question is, do you need to test the individual samples then? Not sure how much available material for testing you have, but you might be able to just divide the mix to eliminate bigger groups first.

Or even mix parts with new samples. There must be an ideal procedure (throughput-wise) if you know how the range of positives to expect.

2 comments

You can do a binary search to efficiently locate the positive sample.
Note that optimal branch factor would probably depend on the expected negative rate. If 98% of your samples are coming back negative, it may be more optimal to divide by into 4 or 8 groups instead of 2 groups.
But what if there are two (or more?) positive samples? It's an interesting problem.
If the initial test size is 64 and contains a positive, split into two groups of 32 and test each. Continue splitting and testing when a positive is encountered.
Yeah, but this version is not necessarily optimal. If you have 4 positive samples, then you will test 2+2+1+1+1+1 = 8, instead of 4 in the naive case.

It feels like a problem for information theory.

I didn’t say it was optimal, but it is a huge improvement on doing 64 tests for 64 samples.
Oh, sure!
You keep binary searching each part. I.e. if both 32/32 subsamples turn positive, you perform binary search on those 32 samples.