The original researcher knows in advance what the ratio is, yes, that's my point. I'm illustrating that the research is not very good. They couldn't even identify women to take part in the study. Given the numbers involved, it certainly isn't Facebook-ready.
In general, I don't believe it is possible to distinguish male and female typing patterns.
What you might be recognising is how people learned to type combined with the size of their hands - that might partly but not exactly break along gender lines. Bucketing people on that basis is just a recipe for awkwardness.
Fabricating facts and using ad-hominem is not a very good way of backing up your arguments.
Quote from the paper:
We use the public GREYC keystroke benchmark database for this work. It is one of the largest databases (in term of number of users and sessions) in keystroke dynamics. To out knowledge, no existing database contains more individuals. In order to reduce the bias due to this high quantity of male information, we only kept the first n male samples( where n is the number of female samples).
( Don't bother with your response, I won't be reading it. )
>We use the public GREYC keystroke benchmark database
Yes. That's their own database which they're talking up, the one that they made to do this research. That's what I was talking about.
>In order to reduce the bias due to this high quantity of male information, we only kept the first n male samples( where n is the number of female samples).
It happens that I didn't read this part.
On reflection, what I understand now is far worse than what I originally understood:
- They have 35 females and 98 males, they take many handwriting samples from each.
- Since the participants provided many samples, these samples appear both in the training set data and in the test set data.
- I use the training set data to figure out if I can recognise the handwriting of the 35 female participants.
- Then I look through the test data to see if I can identify those participants again.
Basically what you've shown is you can identify the handwriting of 35 people if you've already seen it - 88% of the time.
Splitting groups into 'female' and 'male' is a red herring. This method would presumably work, even if I split them into two random groups.
In general, I don't believe it is possible to distinguish male and female typing patterns.
What you might be recognising is how people learned to type combined with the size of their hands - that might partly but not exactly break along gender lines. Bucketing people on that basis is just a recipe for awkwardness.