It's crazy how vast and powerful databases of even the most simple things turn o...

rytill · on Dec 27, 2020

I once did an experiment on identifying based solely on keystroke timing:

https://github.com/mturnshek/keystroke-timing-identifier

I did not test it on a large group, but it worked pretty well for a group of ~8 styles and minimal data. I believe a deeper model with enough data could identify on a large scale through timing alone.

ksaj · on Dec 27, 2020

I believe this would work at most scales.

Have you noticed how, for example, you can identify a guitar player easily, no matter what guitar they're playing on?

When guitar players are first learning or are transcribing, they often slow the music down. If you slow it down enough, you can hear a pattern of the player slowing down and speeding up (behind and ahead of the beat). I believe that has a lot to do with that instant-identifiability, even though it is so hard to detect at speed.

fouronnes3 · on Dec 26, 2020

All valid points, but to be fair you can already do all those things with regular documents.

BorisMelnik · on Dec 26, 2020

the difference being the database in this case is a culmination of god knows how many accounts on the www several TB-PB of data so it bit more scary.

Xenograph · on Dec 26, 2020

The point is that all the data is centralized in a single database in this case.

bluntfang · on Dec 26, 2020

sure and i can find out your real time location by hiring a PI to follow you around all day, but it's a lot cheaper and easier if i just pay google for it. plus i cant hire a PI to follow everyone around, there's a cost to that.

saadalem · on Dec 26, 2020

Carmack2Vec: Identifying and quantifying productivity through keystroke patterns.

o-__-o · on Dec 27, 2020

I studied this vector for attack in college[0] almost 20 years ago based on work done in 1986[1]!

[0] - https://dl.acm.org/doi/abs/10.1016/S0167-4048(03)00010-5

[1] - https://patents.google.com/patent/US4805222A/en