Chevron icon It indicates an expandable section or menu, or sometimes previous / next navigation options. HOMEPAGE

Google Is Teaching Machines To ‘See’ Pictures And Describe Them With Startling Accuracy

Google wants to be able to create accurate, automatic captions for complex photos, and, according to a recent blog post, it's getting closer to achieving that goal. 

Advertisement

Google's machine-learning system can "see" a photo and automatically produce descriptive and relevant captions. The system has to be able to get a deeper representation of what's going on in a picture by recognizing how different objects in the picture relate to one another, an then translate that into natural-sounding description.  

For example, the system automatically captioned this photo, "Two pizzas sitting on top of a stove top oven":

PizzaGoogle
Google

"This kind of system could eventually help visually impaired people understand pictures, provide alternate text for images in parts of the world where mobile connections are slow, and make it easier for everyone to search on Google for images," Google research scientists Oriol Vinyals, Alexander Toshev, Samy Bengio, and Dumitru Erhan write. 

The key innovation is that Google's team has merged a computer vision system, which classifies objects in images, and a natural language processing model into one system, where it can take an image and directly produce a sentence to describe it. 

Advertisement

In Google's words, the model "combines a vision CNN with a language-generating RNN so it can take in an image and generate a fitting natural-language caption":

GoogleImage2
Google

Not that the new approach isn't fallible. Here's an interesting look at some of its results:

GoogleImage
Google

Google's the first to admit there's still work to be done. 

"We look forward to continuing developments in systems that can read images and generate good natural-language descriptions," the team writes. 

On February 28, Axel Springer, Business Insider's parent company, joined 31 other media groups and filed a $2.3 billion suit against Google in Dutch court, alleging losses suffered due to the company's advertising practices.

Google
Advertisement
Close icon Two crossed lines that form an 'X'. It indicates a way to close an interaction, or dismiss a notification.

Jump to

  1. Main content
  2. Search
  3. Account