|
|
|
|
|
by brookst
1213 days ago
|
|
I’ve got a fun little side project that uses GPT. I tested gptzero against 10 of my projects’ writings and 10 of my own. It detected 6 out of 10 correctly in both cases (4 gpt-written bits were declared human, 4 human-written were declared gpt). Which is better than 50% but not nearly good enough to base any kind of decision on. |
|
Unrelated: p-value for getting 12 from 20 correct just by chance is ~0.4 that is there is not enough data for the conclusion "better" in this case.
Null hypothesis: 50%/50%, the result random, normal distribution: