Gyroscope-based Smartphone Keylogging Attack

August 18, 2011

smartphone_keylogging_with_gyroscopes

A pair of security researchers have recently unveiled an interesting new keylogging method (PDF Research Paper) that makes use of a very unlikely smartphone component, your gyroscope.

Most smart phones now come equipped with gyroscopes, which can be accessed by any application at any time. [Hao Chen and Lian Cai] were able to use an Android phone’s orientation data to pin down what buttons were being pressed by the user. The attack is not perfect, as the researchers were only able to discern the correct keypress about 72% of the time, but it certainly is a good start.

This side channel attack works because it turns out that each button on a smart phone has a unique “signature”, in that the phone will consistently be tilted in a certain way with each keypress. The pair does admit that the software becomes far less accurate when working with a full qwerty keyboard due to button proximity, but a 10 digit pad and keypads found on tablets can be sniffed with relatively good results.

We don’t think this is anything you should really be worried about, but it’s an interesting attack nonetheless.

[Thanks, der_picknicker]

31 thoughts on “Gyroscope-based Smartphone Keylogging Attack”

Rob says:

August 18, 2011 at 6:23 am

Interesting indeed. My first thought was if handedness would affect accuracy, or would measurements simply need to be reversed.

“The motion of the
smartphone during keystroke is affected by many factors, such as the typing force, the resistance force of the
holding hand, the original orientation of the device, and
the location where the supporting hand holds the device.”

I didn’t read the WHOLE research paper, but from this it seems they’re looking at the same hand doing the typing and holding the device. One would think the rotational forces would be the same when you touch the screen in the same spot from the opposite hand holding the device, but they would be significantly smaller.

Report comment

Reply
gman says:

August 18, 2011 at 6:34 am

If this could be more accurate it would be cool to see how it would work emulating keystrokes from a dummy keyboard with no electronics.

Report comment

Reply
1. kernelcode says:
  
  August 18, 2011 at 7:37 am
  
  You mean like have a plank of dumb wood with letters painted on then stick a gyro to the back of it and make it a keyboard?
  Cool idea, would definitely be interesting to see how well it worked.
  
  Report comment
  
  Reply
2. Maave says:
  
  August 18, 2011 at 8:43 pm
  
  That sounds like a very interesting way to make a fold-out keyboard. Perhaps you could make a prototype or find somebody who would.
  
  Report comment
  
  Reply
3. athenic says:
  
  January 7, 2015 at 5:18 pm
  
  Then how do you press Ctrl+C? And when you put the keyboard on the desk it maybe more difficult for your PC to recognize the key. It’s a cool idea, and it is just an idea.
  
  Report comment
  
  Reply
Peter says:

August 18, 2011 at 7:08 am

I don’t think this is using a gyroscope– linked article even uses the work accelometer. I can’t think of a single phone that has a gyroscope. The new Wii (and maybe the new Playstation) controllers have gyros. Most smartphones have accelerometers.

Accelerometers measure linear acceleration. Gyros measure rotational acceleration.

Report comment

Reply
1. kernelcode says:
  
  August 18, 2011 at 7:40 am
  
  I haven’t read the article, but phones with gyros are out there.
  The Nexus S and iPhone 4 to name two.
  
  Report comment
  
  Reply
2. darkirby says:
  
  August 18, 2011 at 11:00 am
  
  Gyros actually measure angular velocity, not acceleration.
  
  Report comment
  
  Reply
Khanzerbero says:

August 18, 2011 at 7:16 am

They can increase accuracy by having its output estimated letters compared as “words” usign a “hamming-distance styled” estimator with a dictionary and then those word level estimations can be grammatically parsed to estimate phrases.

It all boils down to the enthropy of the language of the user and the total real information the gyro can get.

Report comment

Reply
1. Maave says:
  
  August 18, 2011 at 8:53 pm
  
  I bet you could even use Android’s source code to get a lot of help. They have both good button-guessing and good dictionary matching.
  
  Report comment
  
  Reply
TinT says:

August 18, 2011 at 7:35 am

Very ingenious!
But it will be (almost) useless with Swype.

Report comment

Reply
1. chrwei says:
  
  August 18, 2011 at 6:57 pm
  
  LOVE the swype!
  
  Report comment
  
  Reply
zrzzz says:

August 18, 2011 at 7:54 am

Having successfully installed the gyroscope sniffing software, isn’t that the actual attack? I mean if you can install that, why not just install a straight-up keylogger? Anyway, I wonder if they can improve accuracy using the same auto-correct algorithm iPhone uses, or that T9word feature on some phones? Maybe that’s what Khanzerbero is talking about.

Report comment

Reply
1. Jessica says:
  
  August 18, 2011 at 8:06 am
  
  Sensor data like this can generally be accessed by any app that wants to. Just have to get them to install the app in the first place.
  
  Keylogger, on the other hand, requires exploitz.
  
  Report comment
  
  Reply
2. Khanzerbero says:
  
  August 19, 2011 at 5:06 am
  
  well kind of like that but T9 is very sensitive to the first guessed letters of the word to estimate the others, what im saying is that you can have a dictionary of words and compare whole words.
  
  Report comment
  
  Reply
kabukicho2001 says:

August 18, 2011 at 7:57 am

then when u type ur phone disable accelerometer and or gyroscope!by now.

Report comment

Reply
D says:

August 18, 2011 at 8:15 am

If you’re trying to use this to get the phone’s password, the 72% isn’t such a big deal because you’ll have plenty of opportunities. Record multiple login attempts, and that should handle the error nicely.

Report comment

Reply
CMP says:

August 18, 2011 at 8:37 am

Easy enough to defeat though. Just lay the device flat on a table while typing.

Without that, it would be interesting to see how the accuracy compares on a tablet vs. a smartphone.

Report comment

Reply
xorpunk says:

August 18, 2011 at 9:02 am

Then physical orientation matching and sound spectrum pattern matching are also holes.

These ‘researchers”gotta be under some trademark or nobody would even be talking about this..

Report comment

Reply
Buzzles says:

August 18, 2011 at 9:19 am

So, your touch screen presses generate distinct patterns, yes, that’s a given as it’s how phone games work for things like rotation.

I see how this works to identify individual presses (aka, you can tell you’ve pressed *a* certain button) on a phone by phone/user by user basis.

However, matching up those distinct presses to a value so you can actually figure out *which* button has been pressed is harder if not near impossible due to the variables involved.

Ie, is phone free standing, on a surface, is left or right hand being used, in transit (vibrating/jittering around), on a motherchuffing boat, etc…

I think this while a nice discovery and may end up causing things like accels/gyros to be accessed only by whitelisted or active apps, I don’t think it’s much of a security problem.

Report comment

Reply
1. Buzzles says:
  
  August 18, 2011 at 9:26 am
  
  Basically, what I’m saying is that graph up there showing “key 1” “key 2” et all, is misleading.
  
  The reason? Either the app has just designated it an arbitrary value of “key 1” when really it could be number 7 on the keypad, or the reseachers will have matched up keypad numbers to the correct profile manually while testing it, the app itself won’t (unless it has some pattern matching).
  
  Report comment
  
  Reply
  1. TacoStand says:
    
    August 18, 2011 at 10:38 am
    
    Did you even read the article?
    
    Report comment
    
    Reply
Mental2k says:

August 18, 2011 at 9:32 am

I wonder how press any key and hold -> drag to key you want -> release would affect the results?

Report comment

Reply
Oren Beck says:

August 18, 2011 at 1:18 pm

Seconded on the “Improve Accuracy” concept.

@Khanzerbero: How did you arrive at the use of Hamming as opposed to several other tools for Error Detect/Correct? My background is more hardware than software and I am trying to learn :]

This hack seems in two parts- accessing the sensor data and parsing it into the desired keystrokes. Hmnm- exploiting a market app for something innocent appearing that needs positional data+communications would serve as the judas data conduit? One part of exploit seems plausible to me. Of the many ways to hack the rest?

One being grepping the keystroke handling of the firmware. As reputedly there are internal math models for device’s using variants of timing detection for several common categories of input error:

berkeley.intel-research.net/~klyons/pubs/autowhiteout++chi08.pdf

autowhiteout, in one phrase for the tl:dr version=detects timing and blanks a suspect keystroke/s which prompts the human to retype.

The semi-related one for speech to text software:

http://www.research.ibm.com/compsci/spotlight/hci/halverson99.pdf

That study by IBM of speech-to-text software offers little depiction of “math tools” so it only serves to help build our understanding of Human>Machine parsing correction “overview” in a limited case. But- it’s on track to taking a 70~% confidence level character stream and someday raising the confidence level far enough to risk burning up blown access attempts.

Oh- I updated a friend’s phone to Android 2.2x and it’s feature of “haptic by vibrate” might offer a way to totally FUBAR this not even fully formed as an exploit keylogger CONCEPT.

Which made me contemplate having the phone begin a soft vibrate after keystroke n to frustrate this proposed exploit rather completely.

Report comment

Reply
1. WhatNow says:
  
  August 18, 2011 at 4:55 pm
  
  Don’t think vibrate would fubar, though I think it might hamper.
  
  Remember Sony when they said the vibrate function would mess with the motion sensing (yes, lame excuse to avoid paying royalties ;) ?
  
  But seriously, the vibrate occurs only AFTER you press a key, and it moves it predictably. All you’d have to do is basically use your gyro/accel to record it vibrating without a keypress and effectively “subtract” that motion.
  
  Report comment
  
  Reply
2. Khanzerbero says:
  
  August 19, 2011 at 5:13 am
  
  I was thinking about they first should do principal component analysis, develop son eigen-letters and then in the principal components space they shloud do an estimate at the letter level, and then at the word level a hamming distance based estimation against a dictionary, and then at the phrase level some semantic analisys.
  
  Why hamming? i guess its faster.
  
  Report comment
  
  Reply
Brett W. (FightCube.com) says:

August 18, 2011 at 4:34 pm

Wow, this is some brilliant thinking!

Report comment

Reply
Roger Wolff says:

August 18, 2011 at 10:58 pm

What everybody (or at least the hackaday poster) misses is that even with a statistical chance of 72% of being right, you weaken the security a lot.

Suppose there is a 4-digit PIN that allows 10 tries, for a 1/1000 chance of breakin by random guesses. You have to steal on average 1000 phones and randomly try PINs before you hit the jackpot.

But with say a 90% (the calculations are easier with a round number) chance of getting the numbers right from the attack means you have only 0.9^4 = 65% chance of getting the pin right on the first try. But if you didn’t get it right on the first try, you have 9 tries left. You often have a “next candidate”. So you can try the 4 “second-best” tries next. If the second-best has a 90% chance of being correct (given the first try was wrong), then we again have 65% chance of success on the next 4 tries… With 5 tries left, we can try a few two-wrong PINs but that doesn’t make a big difference. In this example with imperfect pin-stealing, the average number of phones to steal before the PIN can be guessed goes from 1000 to under two.

With the reported 72% accuracy and only 3 tries, there is still a significant advantage over “blind guessing”. The first guess has a 27% chance of success. With the two next tries this can be raised to almost 40%. On average after stealing 2.5 phones you have guessed the PIN correctly within 3 tries.

Now

Report comment

Reply
1. Mike Nathan says:
  
  August 19, 2011 at 4:01 am
  
  If we had missed that point, we wouldn’t have posted it ;)
  
  Report comment
  
  Reply
2. Diddle says:
  
  August 20, 2011 at 3:52 pm
  
  This, of course, assuming that you have captured the data unique to that handset / user.
  
  What would be easier is watching the owner unlock the phone.
  
  Report comment
  
  Reply
elektrophreak says:

August 18, 2011 at 11:59 pm

now this is something that impresses me! great idea, brilliant!

Report comment

Reply