The detection rate is worthless, an algorithm that says anything is Chatgpt would have a detection rate of 100%. What would be more interesting than that is the false positive rate but they never talk about that.
The detector provides an assessment of how likely it is that all or part of the document was written by ChatGPT. Given a sufficient amount of text, the method is said to be 99.9 percent effective.
That means given 100 pieces of text and asked if they are made by ChatGPT or not, it gets maybe one of them wrong.
Allegedly, that is, and with the caveat of “sufficient amount of text”, whatever that means.
A false positive is when it incorrectly determines that a human written text is written by AI. While a detection rate of 99.9% sounds impressive, it’s not very reliable if it comes with a false positive rate of 20%.
The detection rate is worthless, an algorithm that says anything is Chatgpt would have a detection rate of 100%. What would be more interesting than that is the false positive rate but they never talk about that.
That means given 100 pieces of text and asked if they are made by ChatGPT or not, it gets maybe one of them wrong. Allegedly, that is, and with the caveat of “sufficient amount of text”, whatever that means.
It’s actually 1 in 1000, 99.0% would be 1/100.
A false positive is when it incorrectly determines that a human written text is written by AI. While a detection rate of 99.9% sounds impressive, it’s not very reliable if it comes with a false positive rate of 20%.
I know what a false positive is, and it’s not a thing when talking about effectiveness, they claim it gets it right 99.9% of the time.
Right, I see what you mean now. I misread your comment as explaining something that was already clear.