Facebook Using Billions of Instagram Photos to Train its Image Recognition AI

8 years ago

In a research presented today at its F8 annual developers conference, Facebook revealed that it took billions of public Instagram photos annotated by users with hashtags, and used that data to train its own image recognition models. According to TechCrunch, the largest of tests used 3.5 billion Instagram images spanning 17,000 hashtags.

The social media giant detailed how it relied on hundreds of GPUs running around the clock to parse the data, and were ultimately left with deep learning models that beat industry benchmarks, the best of which achieved 85.4% accuracy on ImageNet.

Facebook notes that while other image recognition benchmarks may rely on millions of photos that humans have annotated personally, it had to find methods to clean up what users had submitted that could by executed at scale.

The “pre-training” research focused on developing systems for finding relevant hashtags; that meant discovering which hashtags were synonymous while also learning to prioritize more specific hashtags over the more general ones. This ultimately led to what the research group called the “large-scale hashtag prediction model.”

The privacy implications here are interesting. On one hand, Facebook is only using what amounts to public data (no private accounts), but when a user posts an Instagram photo, how aware are they that they’re also contributing to a database that’s training deep learning models for a tech mega-corp?

The sophisticated AI deep learning models this data trained will not only be very useful to Facebook, but could also bring better image search and accessibility tools to users.

Want to see more of our stories on Google?

P.S. Want to keep this site truly independent? Support us by buying us a beer, treating us to a coffee, or shopping through Amazon here. Links in this post are affiliate links, so we earn a tiny commission at no charge to you. Thanks for supporting independent Canadian media!

2 Comments

Oldest

Newest Most Voted

Park Jihyo

8 years ago

Isn’t that crazy. So if they scan enough photos they can make a mold of your face and unlock your iPhone X. Highly possible. I think so. Facebooks true power is having information on everything and everyone!

Aleks Oniszczak

Reply to Park Jihyo

8 years ago

Good point! It makes me question the security of FaceID, and not just from FaceBook, but from anyone. Since our faces are on display in public – I can imagine someone using a portable 3D scanner to quickly scan a face and then feed the info into a special manikin head that changes shape based on the scanned data.

Facebook Using Billions of Instagram Photos to Train its Image Recognition AI

Other articles in the category: News

Apple Hikes AppleCare+ Prices for Mac and iPad in Canada: What You’ll Pay

Wealthsimple’s Spend Tracking Just Got A Big Update. Here’s What It Can Do Now.

What’s New on Disney+ Canada: August 2026