Grand Theft Auto V Used To Teach Self-Driving AI

For all the complexity involved in driving, it becomes second nature to respond to pedestrians, environmental conditions, even the basic rules of the road. When it comes to AI, teaching machine learning algorithms how to drive in a virtual world makes sense when the real one is packed full of squishy humans and other potential catastrophes. So, why not use the wildly successful virtual world of Grand Theft Auto V to teach machine learning programs to operate a vehicle?

Half and Half GTAV Annotation ThumbThe hard problem with this approach is getting a large enough sample for the machine learning to be viable. The idea is this: the virtual world provides a far more efficient solution to supplying enough data to these programs compared to the time-consuming task of annotating object data from real-world images. In addition to scaling up the amount of data, researchers can manipulate weather, traffic, pedestrians and more to create complex conditions with which to train AI.

It’s pretty easy to teach the “rules of the road” — we do with 16-year-olds all the time. But those earliest drivers have already spent a lifetime observing the real world and watching parents drive. The virtual world inside GTA V is fantastically realistic. Humans are great pattern recognizers and fickle gamers would cry foul at anything that doesn’t analog real life. What we’re left with is a near-perfect source of test cases for machine learning to be applied to the hard part of self-drive: understanding the vastly variable world every vehicle encounters.

A team of researchers from Intel Labs and Darmstadt University in Germany created a program that automatically indexes the virtual world (as seen above), creating useful data for a machine learning program to consume. This isn’t a complete substitute for real-world experience mind you, but the freedom to make a few mistakes before putting an AI behind the wheel of a vehicle has the potential to speed up development of autonomous vehicles. Read the paper the team published Playing for Data: Ground Truth from Video Games.

Before you think this could go horribly wrong, check out this mini-rally truck that taught itself how to powerslide and realize that this method of teaching AI how to drive could actually be totally awesome. Also realize that this research is just characterizing still-images at about 7 seconds per image. This is more than a couple orders of magnitude faster than real-world images — great for learning but we’re still very far away from real-time and real-world implementation.

We really hope that a team of research assistants were paid to play a lot of GTA V in a serious scientific effort to harvest this data set. That brings to mind one serious speed-bump. This game is copyrighted and you can’t just do anything you want with recordings of gameplay, and the researchers do mention that gameplay footage is allowed under certain non-commercial circumstances. That means that Uber, Google, Apple, Tesla, every major auto company, and anyone else developing autonomous vehicles as a business model will be locked out of this data source unless Rockstar Games comes up with a licensing model. Maybe your next car will have a “Powered by GTA” sticker on it.

[MIT Technology Review via Popular Science]

29 thoughts on “Grand Theft Auto V Used To Teach Self-Driving AI

  1. just imagine if Rockstar decided to create a mod where all this annotation is being done INSIDE the engine itself, since they already know what objects are what :)

    They could sell it to all major companies and get another few hundred million out of their already successful game franchise AND further advance the progress of vehicular autonomy.

  2. The trouble with putting an AI in the Grand Theft Auto world to teach it anything is that it may learn sociopathy there faster than anything else. The last thing we need is a rogue AI pulling people out of their cars to take a joyride on the sidewalk. :)

    1. At least you wouldn’t worry about a GTA-trained robocar trying to solve trolley problems by crashing and killing you to avoid hitting that bus full of kindergarteners and nuns. (Whether you have to worry about it activating the driver ejection seat and then aiming for the bus to get you more points is a separate problem.)

    2. It’s a simple matter of switching the scoring rules in the AI’s mind from “Video game” to “Golf” (i.e. lower scores are better) and we’ll have the safest self driving car imaginable.

  3. “Also realize that this research is just characterizing still-images at about 7 seconds per image. This is more than a couple orders of magnitude faster than real-world images — great for learning but we’re still very far away from real-time and real-world implementation.”

    At 7 seconds per image, wouldn’t it be “couple orders of magnitude SLOWER than real-world images”?

    1. **Annotation** of real-world images according to the video takes 60-90 minutes per frame (suspicious). So this is “a couple orders of magnitude faster”, but I agree, that paragraph was poorly worded.

        1. I’m a little late to the party, but reading the paper gives some more insight. They’re talking about annotating the image in the sense of the overlay above – creating a pixel-accurate mask of what’s what. Doing this very accurately is key, since it’s supposed to be a “perfect” dataset to train with. Creating these masks in this sense by hand on a frame by frame basis (like the other datasets they’re comparing against) could easily take that long.

  4. The whole self-driving car thing is taking on dangerous aspects with cheaper and cheaper and more and more amateurish experiments and kickstarters.
    But at least in Europe you can say it was a terrorist if things go really foul I suppose.

  5. I don’t understand why people think this sort of machine learning isn’t possible, just the other day some idiot that uses the nic “LOL” was making that claim, but of course it works, that is how our neural networks get trained too, in controlled and structured environments that are a microcosm of the real world, i.e. The family home and then schools etc.. The advantage AI has over humans is that you can duplicate the net once it is trained as many times you like, imagine having true ancestral memory and being born with all the knowledge your parents gained before you were conceived.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s